Implementation of MetaSearch Engine by Antony pranata http
Implementation of Meta-Search Engine by: Antony pranata http: //antonypr. pair. com antonypr@pair. com
Motivation • Most Internet users use search engines to find information on the Web. • Explore the idea behind meta-search engines. • Build a “new” client-side meta-search engine Simple. Find (http: //antonypr. pair. com/simplefind. html).
What are Search Engines? • Search engines are interactive tools to help people locate information available via the World Wide Web. • Search engines are actually databases that contain references to thousands of resources. • Search engines provide interfaces between the users and the underlying databases.
Types of Search Engines • Robot-Driven Search Engines. – Example: Alta. Vista, Excite, Hot. Bot, Lycos. • Web Directory Services Search Engines. – Example: Yahoo!, Snap, Look. Smart. • Meta-Search Engines. – Example: Meta. Crawler, Mamma, Savvy. Search.
How do People Find New Web Sites? Source: GVU Center at Georgia Institute of Technology.
Search Engines Used Source: GVU Center at Georgia Institute of Technology.
Why do We Need Meta-Search Engines? • Each search engine provides its own database, interface, and special features. • Each search engine collects resources differently, therefore the same query typed into several search engines is likely to produce different results. • Most search engines contain less that 20% of the data on the Web.
What are Meta-Search Engines? • Meta-search engines search the databases of other search engines and directories. • Meta-search engine don’t create their own databases of information. • The results are the compilation of all search engine queried.
What is Simple. Find? • A “new” client-side meta-search engine implemented in my thesis. • Developed with C++Builder + STL + ICS. • Minimum system requirements: PC Pentium with Win 95/98/2000, 32 MB RAM, 2 MB HD, and Internet connection.
Features of Simple. Find • Currently supports seven major search engines, Alta. Vista, Excite, Hot. Bot, Infoseek (Go), Lycos, Web. Crawler, and Yahoo! • Send the query to multiple search engines simultaneously. • Duplicated links are merged into one link. • Customizable sort method.
Features of Simple. Find • Save the results as Simple. Find format, HTML files, or CSV. • Customizable search engines. • Support AND, OR, and NOT operator as well as + and - operator. • Customizable title and description weight.
Company Name Test
Company Name Test (Query: satelindo) Test was conducted on January 15, 2000
Company Name Test (Query: toefl) Test was conducted on January 15, 2000
Company Name Test (Query: inprise) Test was conducted on January 15, 2000
Company Name Test (Query: nokia) Test was conducted on January 15, 2000
Company Name Test (Query: gadjah mada university) Test was conducted on January 15, 2000
Phrase Test
Phrase Test (Query: indonesia programmer) Test was conducted on August 1, 1999
Phrase Test (Query: tip trick delphi programming) Test was conducted on August 1, 1999
Phrase Test (Query: download free mp 3 music) Test was conducted on January 15, 2000
Reality Check • Simple. Find has been distributed and tested by more than 100 users world wide. • “Amazing program. One of the most cogently designed, effective web utility programs developed to date. ” (William P. Welty, M. Div. , Executive Director of The ISV Foundation)
Simple. Find Distribution
Simple. Find Distribution
Summary • Meta search engines is useful for saving time in searching multiple search engines at once. • Searching with meta-search engines is not always give the best results, however starting to search a query with meta-search engines is recommended.
Further Development • The list of search engines can be added by modifying Simple. Find. ini file Google, Go. To, Northern Light, etc. • The program can be improved to search other information, such as e-mail address (Who. Where, Four 11), software (Tu. Cows, Download, Hot. Files), etc.
Thank you Antony Pranata http: //antonypr. pair. com antonypr@pair. com
- Slides: 27