Major Internet Retrieval Tools


1. Singular Internet Retrieval Tools
2. Collective Internet Retrieval Tools
3. Retrieval Tools for Specific Types of Information
4. Retrieval Tools for Images and Videos
5. Retrieval Tools for Blogs, News, or Subject-Specific Information
6. Retrieval Tools for Non-Web Materials

1. Singular Internet Retrieval Tools

1.1A. Focus on searching (The Big Four)

Google http://www.google.com

Created by Larry Page and Sergey Brin, the beta version of Google was released in February 1999. It presents a most relevant site for any searches based on its ranking algorithm with its "I'm feeling lucky" button. The relevance of a site is measured by, among other things, examining how many other sites are pointing to it and their importance. It has been rated as the top search engine in recent years. At present, it is said to conduct more than 50 percent of all Internet searches with regular introduction of new services. Other services and products Google offers include Google Book Search and Google Scholar.

Yahoo! http://www.yahoo.com

Yahoo! is the best known and most popular Internet directory service. Tens of thousands of Internet sites are listed within an easy-to-use, comprehensive subject hierarchy or taxonomy. The lists feature short descriptions of the sites. Yahoo! is one of the best places to start locating information on the Internet. Its search service is increasingly used by many people. Y!Q Beta for contextual search and Creative Commons Search Beta are some services Yahoo! recently introduced. Yahoo! also participates in the Open Content Alliance project. If you think the classic Yahoo! interface appears too busy, try This too is Yahoo! - called Alpha but in Beta version.

Live Search (Formerly known as MSN Search) http://www.live.com

Live Search, also called Windows Live Search or Live.com, is gaining popularity on the Internet with strong technology support from Microsoft. It offers some unique features at its Advance Search interface. In addition, it supports other types of searching such as local search and products search.

Ask http://www.ask.com

Made its debut on the Web in 1998, Ask, known as Ask Jeeves in the past, is the first search tool that allows users to enter a question in natural language as a search query although its initial setup was based on questions and answers developed by human beings. It has experienced ups and downs since its establishment. It also hosts a search site designed specifically for kids at http://www.askforkids.com.

1.1B. Focused on searching (The Old Magnificent Seven)

AltaVista http://www.altavista.com

It was developed at the Digital's Research Laboratories in California, and delivered to the Web on December 15, 1995. It received favorable reviews for years before other major search engines came out. It is the first site that introduced the language translation feature. It has been one of the major search tools in its kind. It switched to Yahoo!'s database in March 2004.

Excite http://www.excite.com

A multi-purpose site from Architect. It used to support concept searching. Excite also suggested terms for search modification, which was called "Zoom In". It ceased functioning as an individual search engine in December 2001, and becomes a property of IAC Search & Media.

HotBot - RIP

HotBot was owned by Lycos. But, it had simply become an interface for tools such as Ask.com since February 2005. As a newcomer from the University of California in the mid 1990s, it used to be a comprehensive search engine without sacrificing speed. It also provided a variety of options for limiting searches then.

Go Network (Infoseek) http://www.go.com

Infoseek offered both free and a subscription-based services in the beginning. It was voted by PC Computing magazine's editors as the most valuable search tool in 1995. Purchased by the Go Network, it becomes a search tool with few distinctive features. Go, as an independent search tool, is gone because of the decision made by its parent company in January 2001. The searches at its site are now powered by Yahoo!.

Lycos http://www.lycos.com

Lycos used to be a huge search tool. It has undergone great changes in the past years. One weak spot of Lycos remains to be that it indexes only part of the sites it covers. Sponsored sites are listed before search results, which would effectively scare its users away. Lycos is currently powered by Ask.com.

Open Text - RIP

Open Text was a major Web-based search tool. Later, it became a site for business information under a new name Livelink Pinstripe, and then BusinessWeb. Now it disappeared entirely from the Internet as a retrieval tool.

WebCrawler http://webcrawler.com

WebCrawler is small, fast and relatively easy to use. It has been keeping a low profile since the very beginning. Yet, it stays in the very competitive field of information retrieval on the Internet. Since January 1999, it became part of the Excite@Home portal. Now, it is owned by InfoSpace as a meta search tool. Its SearchSpy displays real-time queries of WebCrawler users.

1.1C. Focused on searching (Search Boomers)

AlltheWeb (Formerly known as Fast) http://www.alltheweb.com

Introduced to the public in May 1999, the main features of Fast include size, speed and search filters. Multimedia searching and other features are added later. It was physically located in Norway. It switched to Yahoo!'s database after 2003, and is becoming a Yahoo! company with few of its own identities left.

Exalead http://www.exalead.com/search

Geographically located in France, Exalead was created in 2000 with a host of features that are usually not available from other search sites. Those features include phonetic search and approximate spelling. It is rapidly gaining reputation among Internet searchers. Its coverage for European resources appears to be better than other tools based in America.

Gigablast http://www.gigablast.com

Making its debut in the giant field of Internet searching, Gigablast gained fame graduately by introducing natural lanaguage searching (Ask a question) and suggesting related terms or concepts for further search with its Giga Bits. While the latter is still supported, the former has been removed. It also offers four options (i.e., Entire Directory, Entire Category, Pages in Base Categories, and Sites in Base Category) in searching its directory.

mozDex http://www.mozdex.com

Lauched its beta version in early spring of 2004, it uses open source search technologies to operate an open and fair search engine. It also indicates that data from Dmoz , aka Open Directory, offers the resources for starting mozDex. The Explain link after each search result lists the weighting algorithm in addition to other information. It was not working in August 2008 because, according to a message displayed at its site when one tries a search, "We are doing some updates to our search engine. Please come back later!"

1.2. Focused on browsing

About.com http://about.com

About.com includes over 470 highly targeted environments (i.e., topics), each overseen by a professional guide. The guide is normally displayed along with the full name and photo of the individual who is responsible for its development. It was acquired by The New York Times Company in 2005.

Galaxy http://www.galaxy.com

Formerly EINet Galaxy. One of the first browsing tools available and one of the largest. A well organized and easy-to-browse topical arrangement to a wide variety of Internet resources. Now it focuses on classified and directories using a search mechanism.

LookSmart http://www.looksmart.com

Looksmart, a category-based Web directory, was founded in 1995 by an Australian husband and wife team, Evan Thornley and Tracey Ellery. LookSmart had supplied its directory to such sites as Excite and AltaVista in the past. Now headquartered in California, it becomes an advertising and technology company and keeps little search component in its new development.

Open Directory http://dmoz.org

It is a volunteer-based project as well as self-regulating republic where experts collect their recommendations, without including noise and misinformation. Its goal is to produce the most comprehensive directory of the Web, by relying on a vast army of volunteer editors. However, it does not seem well-kept in recent years based on what it can offer nowadays.

qbsearch.com (Formerly QuickBrowse) http://www.qbsearch.com/

Its predecessor, QuickBrowse, was created by a freelance journalist, Marc Fest, to invent a way by literally "stitching together" multiple webpages for faster viewing. More than 16 various retrieval tools were listed for selection but final results were not merged in any way. Currently, it becomes a directory service, focusing on telecommunications resources. qbsearch also lists sites for locating people and other type of resources.

1.3. Alternative tools

AfterVote (Now called Scour) http://http://www.scour.com/

After acquired by Internext media in 2008, AfterVote becomes Scour. Strictly speaking, Scour was a ranking facility for search results from major tools rather than a search engine. It lets users conduct a search using Google, Yahoo! and MSN, and then vote/comment on what has been retrieved.

ChaCha http://search.chacha.com

Labeled as People Powered Search, ChaCha has re-introduced the search intermediary dimension into the search arena. Free human search guides are available to help just a click away via live chat. Be aware that you have to visit search.chacha.com, the so called "Classic Chacha", to start using it while www.chacha.com only displays information about Chacha itself.

Hakia http://hakia.com

Still in Beta, Hakia is claimed to be a semantic, meaning-based search engine. The benefits Hakia enumerated on its website includes meaningful categorization of search results (i.e., galleries) and differentiation of homonyms. It also can take questions and sentences as queries in addition to the common phrases and keywords.

Quintura http://www.quintura.com/

Quintura presents its search results two ways in two parallel windows. One is the traditional list and the other is a tag cloud - a display that shows the relationships among various terms involved when one places the cursor on a term. This feature is what makes Quintura an alternative retrieval tool. Its version for kids, Quintura Kids, is built on the same concept.

Top 100 Alternative Search Engines

Charles Knight and his colleagues have started compiling a list of top 100 alternative search engines since January 2007 on a monthly basis. Here is a list for August 2008. Although the term "alternative" is defined broadly and loosely on this list, it does open a new window for viewing the recent developments in Internet searching.


2. Collective Internet Retrieval Tools

2.1. Collective listing, individual searches

iTools! http://www.iTools.com

iTools!, which stands for Internet Tools!, is a collection of Internet retrieval tools listed under several categories such as search tools, language tools, and research tools. It also includes a number of not-so-common tools (e.g., Link Popularity Check).

Zuula http://zuula.com/

Zuula has come into the search world since 2005. It serves as a gateway to searches in five areas: Web, images, news, blog and jobs. Several individual tools are chosen for each area while the searcher is given the option of changing preferences, including the order of selected search engines.

2.2. Simultaneous searches, integrated results

Beaucoup http://www.beaucoup.com

Beaucoup is a site established by one person, Teri Madden. The site claims to cover more than 2,500 retrieval tools available on the Internet. It can query 10 search sites at once. Recented, it also introduces a mechanism that allow the user to conduct searches in a single tool with the help of a drop-down menu. Sponsored results, sometimes very lengthy, are listed before any other results.

Dogpile http://www.dogpile.com

Dogpile sends queries to over 20 Internet retrieval tools such as Google and Yahoo!. Users can specify whether the search should be limited to images, audio or the like. Dogpile can serve as a good start point for Internet retrieval tasks. It is now part of InfoSpace.

Ixquick http://www.ixquick.com

Compared with other meta tools, Ixquick is new in Internet retrieval. However, it brought some features that other counterparts do not support. For example, it supports multilingual search in 18 languages. It also allows one to remove any results from a search. Choices of individual tools are available only after a search is done. In addition, ranking information from individual tools is provided next to each result.

KartOO http://www.kartoo.com/en_index.htm

Calling itself a visual metasearch engine, it presents retrieved results graphically. That is the most unique feature of KartOO. The relationship between one site to others can be visualized by moving the pointer over a result visual. KartOO also claims that it supports natural language searching. Legend of the visuals shown as results is provided in Help/Key Tips/Functions.

Mamma http://www.mamma.com

Mamma, claiming to be the mother of all search engines, is another meta search tool. It offers a variety of search options which include forming queries using syntax appropriate to specific source search tools the user chooses to probe. Its Power Search interface lists several options for meta searching. It acquired Copernic in December 2005.

MetaCrawler http://www.metacrawler.com

It searches for and combines results from selected retrieval tools. Result sets specify name, URL, the database in which the resource was found, and any annotations that could be found in the source database. Metacrawler was developed by Erik Selberg and Oren Etzioni of the University of Washington, and now it is also part of InfoSpace. MetaSpy displays real-time queries of MetaCrawler users.

Search.com (SavvySearch) http://www.search.com

Before becoming Search.com, SavvySearch covered more than 200 search sites while searching only up to 5 at a time. In addition, it listed major search categories from which users can choose and narrow down a search. Search.com, now as part of CNET, offers a few search options in its Preferences for general searching. Like many other retrieval tools, it sets up tabs (e.g., images, reference) for more specific searches..

Vivisimo http://vivisimo.com

Founded in June 2002 by Carnegie Mellon University research computer scientists, Vivisimo is particularly noted for its clustering technology. Search results from different retrieval sites are automatically grouped into clusters at its side bar. Choices of individual tools can be made from its Advanced Search form. In addition, ranking information from individual tools is provided along with each result. Vivisimo introduced another clustering tool called Clusty in September 2004 with a different interface and additional features (e.g., tabs for Web, News, Images). Clusty now replaces Vivisimo entirely for Web searching while the latter focuses on other business.


3. Retrieval Tools for Specific Types of Information

3.1. Snail mail addresses, phone numbers, etc.

InfoSpace (Now replaced by Superpages.com) http://www.superpages.com/

InfoSpace was a later comer compared with other major retrieval tools in this category. Yet it offered quality and comprehensive services for locating information about individuals and business until it automatically directs users to fee-based services. InfoSpace acquired quite a few Internet retrieval tools in the past. It was recently sold to Idearac Media and became Superpages.com.

PhoneNumber.com http://www.phonenumber.com/

PhoneNumber.com is a tool for an online directory information. In addition to providing traditional white and yellow page listings, it also provides reverse phone number and address searches. The same kind of services is also available for locating area, zip, and country codes. However, what was available for free in the past now becomes fee based at this site.

Switchboard http://www.switchboard.com

Switchboard initially functioned as a network version of white and yellow pages. Other services, such as zip and area code searching, are introduced later. Recently, it becomes a part of Idearac Media. By forging relationship with other companies, it provides additional services such as maps and directions.

3.2. Maps, Driving Directions & More

America's Byways http://www.byways.org

As part of the U.S. Department of Transportation, Federal Highway Administration, the National Scenic Byways Program created this site in an effort to help recognize, preserve and enhance selected roads throughout the United States based on one or more archeological, cultural, historic, natural, recreational and scenic qualities. Those roads are otherwise not commonly known to the public. Browsing by state using the interactive map seems a better retrieval approach if one does not know which byway to search for.

Google Earth http://earth.google.com/

According to Google, Google Earth combines satellite imagery, maps and the power of Google Search to put the world's geographic information at your fingertips. One needs to download and install the free Google Earth software before being able to use it.

Google Maps maps.google.com

Google Maps came out in 2005 with typical map related search services. In addition, it offers draggable maps, satellite images of the location one is searching for, and a few other nice features. For example, you can try My Maps to create and share personalized, annotated maps of your world. A related service of Google Maps is Google Mars, taking users beyond the planet we are living on. A good number of people have built their services on Google Maps. gCensus is one of such services, which presents US census data for an area you have clicked on Google Map.

MapQuest http://www.mapquest.com

It has been offering quality services of maps, driving directions, road planners, and more since mid 1990s. However, like its counterpart MapBlast which already disappeared from the Internet, MasQuest is clearly loosing its competitiveness in this rapidly evolving arena.

MSN Maps & Directions (Formerly MapBlast) http://mappoint.msn.com

As a poineer service in its kind, MapBlast let you type in a street address and city name, then presented you with driving directions and a detailed interactive map of the area in addition to a lot of other commercial information like shops and restaurants. You can pan or zoom the map to get a better look. Now it is owned by MSN. It seems that the map service offered by Live Search at http://maps.live.com will soon take the place of MSN Maps & Directions. Live Search Maps also has some nifty features (e.g., 3D display).

Rand McNally - Maps, Driving Directions & More http://www.randmcnally.com

Based on its quality printed products and services, Ran McNally offers comparable map and driving direction services on the net free of charge.

Yahoo! Maps http://maps.yahoo.com

Yahoo! has recently redesigned its map service. Besides offering standard maps and driving directions, Yahoo! Maps also provides live traffic information such as road work and traffic conditions. It also displays the geographical location one is currently at when it is launched.


4. Retrieval Tools for Images and Videos

4.1 Description-based Image Retrieval

AltaVista Image Search http://www.altavista.com/image/default

Introduced by AltaVista not long after the company went to the public. It exclusively covers image information.

Ditto http://www.ditto.com

It is a commercial tool for finding image information. A large number of commercial sites are sandwiched in the results.

Flickr http://www.flickr.com

As a Yahoo! company, Flickr claims that it is the best online photo management and sharing application in the world. Flickr is rapidly growing thanks to its Web 2.0 capabilities (e.g., tagging).

Google Image Search http://images.google.com

Google introduced image searching to its site later than other major players, claiming the most comprehensive image search on the Web. Due to Google's reputation in text retrieval, its image search service is rated high as well. In addition, a number of options for refining searches are available on the Advanced Image Search interface.

Picsearch http://www.picsearch.com

Picsearch is a search engine devoted to images. The options listed at its Advanced Search (e.g., images or animations, color or black & white) enable the user to fine-tune search results.

4.2 Description-based Video Retrieval

Blinkx http://blinkx.com/

Blinkx is one of the major players in retrieving videos. One can either search videos by keyword or simply browse video categories or thumbnail images.

YouTube http://www.youtube.com/

YouTube has become the best known video retrieval tool after Google acquired it. It provides a rich source of videos of all kinds. Moreover, it sets up a mechanism that enables users to share, evaluate videos of interest.

4.3 Description & Content-based Image & Video Retrieval

WebSEEk http://www.ctr.columbia.edu/webseek/

WebSEEK, developed at Columbia University, is a visual information system for locating images and videos on the World Wide Web using keywords as well as image properties (e.g., color, shape, and texture). It has often been hard for people to use this site in the past years because of its fluctuated availability.


5. Retrieval Tools for Blogs, News, or Subject-Specific Information

5.1. Blogs

Bloggernity http://www.bloggernity.com

Bloggernity is a blogger search directory that allows one to either search or browse blogs at the blog or category level. This site, on the other hand, appears very commercial.

5.2. News

SurfWax http://news.surfwax.com

Like the surf wax that helps surfers grip their surfboard, SurfWax attempts to help websurfers get the best grip on news information from the Web. It also claims to be able to gather news information from 4,000 sources. Its search and display mechanism is unique in that results are displayed while one is typing a query.

5.3. Subject-Specific

GEM http://www.thegateway.org

GEM (Gateway to Educational Materials), initially funded by the National Library of Education and the U.S. Department of Education, evolves into a site more than a center for educational resources. It also assumes many other functions as specified at its homepage. The quality and quantity of GEM resources, however, remain as good as before.

Healia http://www.healia.com/healia/

Healia came into existence not long ago. What distinguishes Healia from other search tools for health information is its ability to zero in retrieved results by pre-defined tabs (e.g., prevention, symptoms, and treatment).

Intute http://www.intute.ac.uk/

Intute, a UK-bases service, provides quality Web resources for education and research in many disciplines under the broad categories of arts & humanities, health & life sciences, science & technology, and social sciences. It uses a thesaurus for resources in the social sciences. It supports both browsing and searching.

PubMed http://www.ncbi.nih.gov/entrez/query.fcgi

It was developed by the National Center for Biotechnology Information (NCBI) at the National Library of Medicine (NLM) for providing access to citations from biomedical literature. The information retrieved from this site is peer reviewed and authoritative.


6. Retrieval Tools for Non-Web Materials

Lists USENET & Forums

Updated by Heting Chu on September 1, 2008