internet research search engines & subject directories
TRANSCRIPT
![Page 1: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/1.jpg)
Internet Research
Search Engines & Subject Directories
![Page 2: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/2.jpg)
Search engines
• Search engines are the means by which most people search the Web.
• Common examples are Google, Altavista, Direct Hit.
![Page 3: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/3.jpg)
Yet they don’t search the Internet
• Yet a search engine does not actually search the Web during your search.
• A search engine searches itself.
• It’s a three-step process.
![Page 4: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/4.jpg)
1) Bots index words
• Search engines continually send out hundreds of “robots” or “bots” (or “spiders” or “crawlers” )
• Bots visit web sites, read word by word, and then index those words.
![Page 5: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/5.jpg)
2) A database is created
• A huge database of Web sites thus is gathered and indexed by word.
• These databases can be huge, with millions of links.
![Page 6: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/6.jpg)
3) The Interface gives you access
• Using the keywords you give it, a search engine then searches its own current index.
![Page 7: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/7.jpg)
Interfaces are based on rankings
• Search engines return results based on a ranking system.
• Ranking is the order that files are listed when they are retrieved.
![Page 8: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/8.jpg)
The ranking system is secret• These systems are proprietary and
often “secret.” In general:• Altavista ranks web pages higher if
your search terms are found in the first few words of the page
• Google ranks by document “popularity” with other similar searches
• Direct Hit ranks by the length of time other users spent at the site
![Page 9: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/9.jpg)
Not even half the Web
• With all of this software and sophistication, even the best search engines cover only 40-50% of the Web.
• And they miss much else on the Internet.
![Page 10: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/10.jpg)
Bots hit and miss
Bots miss: XML pages, pdf files Dynamically created HTML pages Frames-based pages New pages or recent updated text Some say the Invisible Web is 500
times larger than Web
![Page 11: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/11.jpg)
Subject Directories
• A subject directory is also a database of web sites and references.
• But a subject directory is organized not by keywords but by category or subject.
![Page 12: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/12.jpg)
Yahoo!
• Yahoo! Is the most popular subject directory.
• www. about.com takes the idea a step further with subject guides for selected topics.
![Page 13: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/13.jpg)
Subjects are organized by people.
• Information is selected, organized and cataloged by a person, not software.
• You can usually be more assured that the search results will make sense.
![Page 14: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/14.jpg)
You get an index of sites.
• Subject directories will not often provide you with ranked web sites.
• Instead, you will get a broad index related to your topic, divided further by subheadings.
![Page 15: Internet Research Search Engines & Subject Directories](https://reader036.vdocuments.net/reader036/viewer/2022062407/56649d985503460f94a8337e/html5/thumbnails/15.jpg)
Use for early searching.• Use a subject directory
early in your search process to learn about your subject.
• You will get fewer links of higher quality.
• When you get more specific questions, you should use a search engine.