web location services

Post on 23-Feb-2016

34 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Web Location Services. Historical Background. In use up to 1996. An internet server from which hierarchically-organised text files could be retrieved from allover the world. Developed at the University of Minnesota, whose sports teams are called “The Golden Gophers”. - PowerPoint PPT Presentation

TRANSCRIPT

Historical Background

An internet server from which hierarchically-organised text files could be retrieved from allover the world.

Developed at the University of Minnesota, whose sports teams are called “The Golden Gophers”

HyperGopher could also display Gif and Jpeg graphic images.

Three important Gopher applications were Veronica, Archie and Jughead (named after comic characters!)

In use up to 1996

Lost its importance and relevance with the introduction of the first graphical browser, Mosaic

Historical Background

A Gopher Application software

Historical Background

Wide Area Information Servers Pronounced : “Ways”Co-existed with the Gopher technology

Files had to be stored in a predetermined manner in databases

The WAIS user had to connect to known databases in order to retrieve information or files Had the same fate as Gophers –

became superfluous with the introduction of browsers and search engines

Wide Area Information Servers

The primitive XWAIS

Historical Background

Wide Area Information Servers

An improved user interface

Databases

Historical Background

Wide Area Information Servers

Historical Background

The Windows version

A proper search engine

An example of a catalogueYahoo actually uses Google for its

search engine!

More of a complete front end for the internet than just a search engine!MSN is another such site

A search engine needs to employ a Web Crawler (a sub group of “software agents”) with an unusual degree of autonomy! These are referred to as

“Bots” (short for robots), and “Spiders”

Spiders or crawlers normally start out with a historical list of links,

such as server lists, and lists of the most popular or best sites, and

follow the links to these pages to find more links to add to the

database.

Yes unfortunately this theme enters the realm of Search Engines too!

There is a strong rumour that Search Engine companies sell their information about you. Just think – you have recently made a search for hotel quotes and suddenly you start receiving emails advertising hotels and holidays. Is it really a coincidence?

…allow the owner of the page to specify key words and concepts under which the page will be indexed.

Example: <html>

<head><title>My World</title></head><body><p align="center"><font size="7">Welcome to My World</font></p></body>

</html>

Example:Such an html code would give the following search result:

My WorldWelcome to My World

Example: Fixing this using Meta Tags:<html>

<head><title>My World</title><meta name="description" content="Everything you ever wanted to know about stamps from prices to history."><meta name="keywords" content="stamps, stamp collecting, stamp history, stamp prices, stamps for sale."></head>

</html>

Example: The search result this time will be:

My WorldEverything you wanted to know about stamps, from prices to history.

And: All the terms joined by “AND” must appear in the pages or documents. Some search engines substitute the operator “+” for the word AND OR: At least one of the terms joined by “OR” must appear in the pages or documents NOT: The term or terms followed by “NOT” must not appear in the pages or documents. Some search engines substitute the operator “-” for the word NOT

Followed by: One of the terms must be directly followed by the other

Near: One of the terms must be directly followed by the other.

Quotation marks: Words between quotation marks are treated as a phrase, and that phrase must be found within the document or file.

Search engines that return results from a particular website only Usually created by

programs that generate a Java Script code

One such program is SearchMaker Pro

seaRchmAker[16]=new Array("http://www.geocities.com/powerpoint95forschool/quizcorrect.htm","New Page 1","","Use your browser's Back button please! ");seaRchmAker[17]=new Array("http://www.geocities.com/powerpoint95forschool/quiz3wrong.htm","New Page 1","","Use your browser's Back button please! &nbsp; ");seaRchmAker[18]=new Array("http://www.geocities.com/powerpoint95forschool/shotGallery.htm","ScreenShots","","Move your mouse to the left-hand side of the screen to access your menu. &nbsp; &nbsp; ");seaRchmAker[19]=new Array………………

<HTML><HEAD><SCRIPT language=JavaScript><!--

// Script generated by Search Maker Pro:

seaRchmAker=new Array();seaRchmAker[1]=new Array………………

Webmaster utility sites like Bravenet.com and Cgispy.com offer their services to create a site search engine…for free.

If you know of a specialized search engine such as SearchNetworking that matches your subject (for example, Networking), you'll save time by using that search engine.

If there isn't a specialized search engine, try Yahoo. Sometimes you'll find a matching subject category or two and that's all you'll need.

If Yahoo doesn't turn up anything, try AltaVista, Google, Hotbot, Lycos, and perhaps other search engines for their results. Depending on how important the search is, you usually don't need to go below the first 20 entries on each.

For efficiency, consider using a ferret that will use a number of search engines simultaneously for you.

A good ferret program is Copernic 2001 downloadable from http://www.copernic.com

Another similar ferret software is WebFerret by ZDNet – downloadable for free.

If you feel it's necessary, also search the Usenet newsgroups. As you continue to search, keep rethinking your search arguments. What new approaches could you use? What are some related subjects to search for that might lead you to the one you really want?

At this point, if you haven't found what you need, consider using the subject directory approach to searching. Look at Yahoo or someone else's structured organization of subject categories and see if you can narrow down a category your term or phrase is likely to be in. If nothing else, this may give you ideas for new search phrases.

Hell-bent in indexing anything and everything!

Over 30 million pages indexed! (until July 96)

http://www.altavista.com

Claims to have indexed even more pages than Alta Vista

http://www.excite.com

Indexes the full text of less than a million pages (until May 96), but stores more URL’s

http://www.opentext.com

Claims that it can grow with the web and index it entirely no matter how big!

http://www.hotbot.com

Uses multiple spiders – up to 4 at a time!

At peak performance the system could crawl over 100 pages per second generating around 600 kilobytes of data per second!

http://www.google.com

It is simply an online game using Google!

The contestant types two words in Google and hopes to receive ONE result only.

When this happens it is called a Pure Whack. The contestant can then submit it to www.googlewhack.com where it will be posted.

One such Pure Whack is the result of the search for “ambidextrous scallywags”!

http://www.googlewhack.com

Which used to be available at the following address:

http://wwww.cs.colorado.edu/home/mcbryan/WWWW.html

For more information go to the following URL:

http://www.inf.utfsm.cl/~vparada/html/wwww.html#instr

A Search Engine proper is a database and the tools to

generate that database and search it.

A Catalogue is an organisational method and a

related database, plus tools for generating it.

top related