Internet Spider

Encyclopedia of Espionage, Intelligence, and Security

Internet Spider

An Internet spider is a program designed to "crawl" over the World Wide Web, the portion of the Internet most familiar to general users, and retrieve locations of Web pages. It is sometimes referred to as a webcrawler. Many search engines use webcrawlers to obtain links, which are filed away in an index. When a user asks for information on a particular subject, the search engine pulls up pages retrieved by the Internet spider. Without spiders, the vast richness of the Web would be all but inaccessible to most users, rather as the Library of Congress would be if the books were not organized.

Some search engines are human-based, meaning that they rely on humans to submit links and other information, which the search engine categorizes, catalogues, and indexes. Most search engines today use a combination of human and crawler input. Crawler-based engines send out spiders, which are actually computer programs that have sometimes been likened to viruses because of their ability to move between, and insert themselves into, other areas in cyberspace.

Spiders visit Web sites, record the information there, read the meta tags that identify a site according to subjects, and follow the site's links to other pages. Because of the many links between pages, a spider can start at almost any point on the Web and keep moving. Eventually it returns the data gathered on its journey to the search engine's central depository of information, where it is organized and stored. Periodically the crawler will revisit the sites to check for changed information, but until it does so, the material in the search engine's index remains the same. It is for this reason that a search at any time may yield "dead" Web pages, or ones that can no longer be found.

No two search engines are exactly the same, the reason being (among other things) a difference in the choice of algorithm by which the indices are searched. Algorithms can be adjusted to scan for the frequency of certain keywords, and even to circumvent attempts at keyword stuffing or "spamdexing," the insertion of irrelevant search terms intended simply to draw traffic to a site.

FURTHER READING:

BOOKS:

Fah-Chun Cheong. Internet Agents: Spiders, Wanderers, Brokers, and 'Bots. Indianapolis, IN: New Riders, 1996.

Sherman, Chris, and Gary Price. The Invisible Web: Uncovering Information Sources Search Engines Can't See. Medford, NJ: Cyber Age Books, 2001.

Young, Gray. The Internet. New York: H. W. Wilson, 1998.

SEE ALSO

Computer Virus
Internet: Dynamic and Static Addresses
Internet Spam and Fraud
Internet Surveillance
Internet Tracking and Tracing


Find more facts and information related to the .
Copyright 2004, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

Related newspaper, magazine, and trade journal articles from HighBeam Research

(Including press releases, facts, information, and biographies)

Search Engines: Do They Answer Your Questions?(Brief Article)
; ...Announcements of New Search Engines These articles...specific new search engine. Some of the...rank highly in search engines. 8. The Search Engine Business These...particular search engine indexes, whose search engines have done best... Read more
Dead search engines: the URLs that used to lead to a unique database with unique search features and capabilities may or may not still do so. (On the Net).(Internet search engines )(industry)(Internet/Web/Online Service Information)
; ...standing. SEARCH ENGINE MORTALITY...all the old search engines are still...WebCrawler search engine was dead. A few search engines have completely...earliest search engines fatalities...the first search engine to introduce... Read more
Search for search engines
; ...the amount of search engine sites available...Internet presently. Search engines and directories...interesting facts about search engines. I hope this helps...decision on which search engine to search in and...product name into a search engine than ... Read more
Switching your search engines.(on the net)
; ...as long as that search engine answers all queries...differences between the search engines' underlying databases...only one or two search engines had the answer...that the first search engine gives zero or...times when no search engine will find an ... Read more
Finding with help of search engines
; ...with help of search engines Byline: Chandra...queries on search engines? Based on a...users utilised search engines to find information...queries on search engine. That's equivalent...who turn to search engines for assistance...hitting on the search ... Read more
How to utilise search engines
; ...to utilise search engines Byline: Maria...are so many search engines, directories...find local search engines in Greece...want to find search engines that specialise...one local search engine, other countries...favourites. Search Engine ... Read more
Out there: at the edge.(The Extreme Searcher's Guide to Web Search Engines: A Handbook for the Serious Searcher, 2nd ed.)(Book Review)
; ...history of search engines, explaining...make up a search engine--the crawlers...several leading search engines in fact use other search engine databases...latest in search engine interfaces...nature of search engines means that... Read more
The role for Web search engines
; ...the user. All search engines share the following...defined by the search engine's set of variable...number of popular search engines that can be accessed...point regarding search engines is that the user...documents. No search engine is able to ... Read more
Search Engines: More Horsepower Than a '57 Chevy.(online search engine service review)(Brief Article)
; ...Some popular search engines and directories...an integrated search engine, which can search...Individual and meta search engines are the two types of search engines found on the...today Individual search engines take keywords...receive from a ... Read more
Editorial: Of Search Engines, Commercialism & Ethics.(Editorial)
; Are search engines deceiving...eight leading search engine vendors. We...including search engine results. Search engines are placing...at the two search engines in question...technically not a search engine but a directory... Read more

Related entries from encyclopedias, dictionaries, and thesauruses

spider
spider A program which wanders around the Internet looking for new resources such as recently...ENGINES although a number of user-driven spiders can be found on SHARE WARE sites. When...search engine the engine often sends a spider to the site to index it and produce data... Read more
spider
spider (crawler, Web crawler) An automatic program that searches the Internet, finding new Web sites and producing an index of addresses and content for use in a search engine . Read more
Internet Spam and Fraud
...from health and well-being products to pornography. Internet experts assert that nearly 90 percent of the spam mail...marketing companies that use spam. Spam is costly to Internet service providers (ISP) and to consumers in terms of...computers (e.g. open proxies, etc.) attached to the Internet that are ... Read more
crawler
crawler Synonymous with SPIDER . A program which accesses the Internet, usually the WORLD WIDE WEB or NEWSGROUPS , gathering information for SEARCH ENGINES . Read more
search engine
Tool for finding information, especially on the Internet or World Wide Web . Search engines are essentially massive databases that cover wide swaths of the Internet. Most consist of three parts: at least one program, called a spider, crawler, or bot, which 'crawls' through the Internet gathering ... Read more

For Students and teachers!

HighBeam Encyclopedia provides students and teachers facts, information, and biographies from verified, citable sources, including:

HighBeam Encyclopedia provides students and teachers facts, information, and biographies from verified, citable sources, including: