Find more facts and information on our topic page about
search engine
Search Engines
Search Engines
A search engine is an information retrieval system that allows someone to search the vast collection of resources on the Internet and the World Wide Web. All major search engines are similar in that keywords, phrases, or in some instances, questions, are entered in a search form. After clicking on the search command button, the database returns a collection of hyperlinks to resources that contain the search terms. These hyperlinks are listed in some sort of order, usually from most relevant to least relevant, or by how important the web pages are, depending on the search engine used. Search engines are composed of computer programs that create databases automatically. They should not be confused with human-built directories, such as Yahoo!, which depend on people for development and maintenance.
Search Engine Basics
Search engines have three components. The first part is a computer program called a spider or robot, which gathers information on the Internet. The spider retrieves hyperlinks attached to documents. It starts with an existing database and follows the existing hyperlinks to gather new and updated resources to add to the list. If a web page does not contain hyperlinks to other web pages, the search engine cannot find it. Other types of resources that most spiders are unable to locate include files that are not written in Hypertext Markup Language (HTML) , and from specialized databases that require the user to fill out a search form. Spiders automatically do this gathering of documents at intervals that differ from service to service.
Second, resources collected by the spider are loaded into a database that indexes them using a formula that is unique to each. The index contains a copy of every web page the spider finds. People can also submit web pages to this database in case the spider either fails to access it quickly enough, or if there are no links on the pages. While most search engines claim to index the entire World Wide Web, none actually do. Although spiders have many different ways of collecting information from web pages, the major search engines all claim to index the entire text of each web document in their databases. This is called full-text indexing . Some search engines may not index common words such as: and, a, I, to. These are called stop words.
The third part of the search engine is software that allows users to enter keywords in search forms using some type of search expression, with syntax that is supported by the search engine. The search results are then listed in order according to a ranking algorithm . Some search engines list results by relevancy, while others list them by how many web pages link to them, thereby showing the most important, or popular, web pages first, and others group results together by subject. Many search engines employ a combination of these.
Search Features
It is important to understand the different search features available before beginning to use a search engine as each engine has its own way of interpreting and manipulating search expressions. Because a search can retrieve many documents, it is common to have a number of hits, but only a few that are relevant to the query submitted. This is called low precision/high
recall . On the other hand, a searcher may be satisfied with having very precise search results, even if a very small set of hits is returned. This is defined as high precision/low recall . Ideally, the search engine would retrieve all of the relevant documents that are needed. This would be described as high precision/high recall . Search engines support many search features, though not all engines support each one. If they do support certain features, they may use different syntax in expressing them. Before using a search feature, the user should always check the search engine's help pages to understand how the feature is expressed, if it is supported at all. Some examples of search syntax and features used by search engines are: Boolean operators (and, or, not), implied Boolean operators (+ and -), phrase searching, natural language searching, proximity searching, truncation, and field searching .
Types of Search Engines
Search engines can be divided into three basic types: general or major search engines, meta-search engines, and specialty search engines. Each of the major search engines attempts to do the same thing—index as much of the web as possible—so they handle a huge amount of data. Due to this tremendous amount of information, it is common for documents of little useful content to be picked up, making the quality of the ranking scheme used very important. In most first-generation search engines, such as AltaVista and HotBot, results are ranked by relevancy. Relevancy is determined by algorithms that usually count how many times the keywords typed in the search form appear in the documents that exist in the database. Second-generation tools such as Vivisimo, Google, and Direct Hit, use ranking algorithms that use techniques such as grouping and sorting results, importance or popularity of web sites, and human judgment from prior searches. Meta-search engines are tools that search more than one search engine or directory at once, compiling the results and consolidating them into an overall list.
Examples of meta-search engines are Metacrawler, Vivisimo, and Search.com. One drawback of meta-search engines is that they do not include all of the search engines possible, and they are unpredictable in how they handle complex searches. They can be useful for obscure searches.
Specialty search engines, or specialized databases, are search tools that focus on particular subjects, or types of file format (e.g. images or music files). These databases can be time savers because their databases are much smaller and focused on a particular subject area, or type of resource. For example, if a certain legal opinion is needed, a searcher would achieve greater success with FindLaw <http://www.findlaw.com> rather than spending the time in a major search engine such as AltaVista looking through perhaps hundreds of results.
Difficulties and Benefits of Major Search Engines
Search engines send their spiders to crawl the web periodically, so there may be infrequent updates and new sites may not be immediately added. Specialty search engines may be better for very current, dynamically changing information, such as fast-breaking news stories. There is evidence that the major search engines realize this problem and are starting to team with specialty services that provide recent news. For example,
AltaVista uses the Moreover news service to provide users with news stories. Another difficulty is that according to a 1999 study by Steve Lawrence and C. Lee Giles, only 16 percent of the web is indexed. Besides content that cannot be gathered by search engine spiders, such as dynamically generated web pages, and pages that contain no hyperlinks, and certain file types, there is also evidence that commercial sites are more often indexed than non-commercial sites. This part of the web that is hidden from the major search engines is often referred to as the invisible web.
Another difficulty is that information found in major search engines has not been evaluated. The responsibility is placed upon the individual to evaluate what is found. These drawbacks should not detract from the benefits of these major search tools, however. Many general or major search engines,
realizing the added benefit of human-managed information, include directories such as the Open Directory Project, in conjunction with the computerized indexes. And some directories, such as Yahoo!, employ search engines to search the web when their directories fail to provide the resources needed by the searcher. The usefulness of being able to search for obscure topics, multi-faceted subjects, specific web pages and sites, in addition to information from specific dates, languages, news stories, images, and more, makes search engines necessary tools for the searcher to learn and use.
Popular Search Engines
Some of the most popular search engines include:
see also Information Access; Information Overload; Information Retrieval; World Wide Web.
Karen Hartman
Bibliography
Ackermann, Ernest, and Karen Hartman. Internet and Web Essentials: What You Need to Know. Wilsonville, OR: Franklin, Beedle, and Associates, 2001.
Cohen, Laura. "Searching the Web: The Human Element Emerges." Choice Supplement 37 (2000): 17-30.
King, David. "Specialized Search Engines: Alternatives to the Big Guys." Online 24, no. 3 (2000): 67-74.
Lawrence, Steve, and C. Lee Giles. "Accessibility and Distribution of Information on the Web." Nature 400, no. 6740 (1999): 107-109.
Snow, Bonnie. "The Internet's Hidden Content and How to Find It." Online 24, no. 3 (2000): 61-66.
Internet Resources
Lawrence, Steve, and C. Lee Giles. "Accessibility and Distribution of Information on the Web." <http://www.wwwmetrics.com/>
Sullivan, Danny. "How Search Engines Work." SearchEngineWatch.com. <http://searchenginewatch.com/webmasters/work.html>
——. "Search Engine Features for Searchers." SearchEngineWatch.com. <http://searchenginewatch.com/facts/ataglance.html>
Cite this article
Pick a style below, and copy the text for your bibliography.
|
META SEARCH ENGINES.
Magazine article from: Online; 5/1/1999; ; 700+ words
; ...search engine searches. By using a meta search engine to search several engines at once and...how many search engines the meta engine searches sounds like...choosing a meta search engine, look for...displays the engines it ...
|
|
Search Engines: Do They Answer Your Questions?(Brief Article)
Newspaper article from: The Information Advisor; 10/1/2000; 700+ words
; ...reported most often about search engines. What aspect of search engines is being discussed, and what...Announcements of New Search Engines These articles announce or describe some specific new search engine. Some of the specific names...
|
|
Search engines sort out the tangled Web.(Family Times)(`Webwise')
Newspaper article from: The Washington Times; 7/1/1997; ; 700+ words
; ...name (http://www.(search engine name).com). Each of these search engines provides different features...yellow pages to local area site searches, and each will produce a different response to an identical search request. For the purpose...category or hierarchical search ...
|
|
Search engines
Magazine article from: Management Services; 8/1/2001; ; 700+ words
; ...directory of web sites. Search engines index the words in documents...need to choose the right search engine for a particular job because...than others for different searches. There are thousands of search engines to choose from, a list can...
|
|
Using search engines and web directories.
Magazine article from: Journal of School Health; 10/1/1998; ; 700+ words
; ...or updates the search engine database. Each engine searches a different database...garnered by each search engine even when exact...searching varies. Most search engines index sites either...Keyword indexing searches for significant...
|
|
Search engines: Your Internet dashboard
Magazine article from: Credit Union Management; 7/1/1997; ; 700+ words
; ...That's where the "search engine" comes in. Search engines are Web sites that...submitted to some engines and not others...likely settle on one engine that seems to consistently...get to the various search engines by typing in their...
|
|
Switching your search engines.(on the net)
Magazine article from: Online; 5/1/2007; ; 700+ words
; ...needs, a simple search at any Web search...as that search engine answers all queries...between the search engines' underlying databases...standard for search engines first introduced...quickly transfer searches from one database...under "Transfer Search ...
|
|
Using search engines effectively
Magazine article from: Beyond Numbers; 4/1/2003; ; 700+ words
; ...material-too often a search will yield pages and...the most out of search engines, some of which are...t actually a search engine, just a list of links...upper case: Some search engines are case sensitive, which means that if your search term is all in upper...case. ...
|
|
Search engines for the World Wide Web: visual quickstart guide.
Magazine article from: Technical Communication; 2/1/2003; ; 700+ words
; ...50% of the time on search hits, Of course...discover how to do my searches differently. And...be that "not all search engines are created equal." Some engines are better at ferreting...others. And every engine is different in the...efficient way to do searches without spending ...
|
|
Search engines sometimes run off rails
Newspaper article from: The Milwaukee Journal Sentinel; 2/23/1998; ; 700+ words
; ...firefighter." Also, search engines don't understand abbreviations...other approach (Boolean searches), you narrow the search by using words and phrases...marks. For rules on these searches, check the Help pages of the individual search engine, sometimes under "advanced...
|
|
Search Engines
Book article from: Computer Sciences
Search Engines A search engine is an information...All major search engines are similar in that...and maintenance. Search Engine Basics Search engines have three components...pages, the search engine cannot find it. Other...
|
|
search engine
Book article from: A Dictionary of the Internet
search engine The Internet and that part of it known...information can be at a severe disadvantage. Search engines were developed in order to speed up searches within the Internet. The first search engine was known as ARCHIE . It was developed...
|
|
Search Engine Strategy
Encyclopedia entry from: Gale Encyclopedia of E-Commerce
SEARCH ENGINE STRATEGY Most people...Web by using search engines like Yahoo!, Alta...for this is that every search engine works differently...different types of search engines — those that...directory-based engines, and link-based engines...their site ...
|
|
Internet Search Engine
Encyclopedia entry from: The Gale Encyclopedia of Science
...Engine An Internet search engine is a service that searches the Internet for specific items, following search terms specified by...leading Internet search engines, in decreasing order...first when a user searches for the term “...x201D; Because search ...
|
|
search engine submission
Book article from: A Dictionary of the Internet
search engine submission The process...submitting a WEB SITE to a SEARCH ENGINE so that it can be indexed and...retrieved by users of that search engine. The process of submission to a search engine is usually very simple: a...submitted to a number of search ...
|