nrccl (university of oslo, faculty of law) hyperlinks and search engines jon bing nrccl, department...

7
NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

Upload: annabella-hamilton

Post on 19-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

Hyperlinks and search engines

Jon Bing NRCCL, Department of Private Law

Master lecture 13 November 2007

Page 2: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

Web page with hyperlink

Page 3: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

HTML rendering of hyperlink

<a href="http://www.lovdata.no/litt/index.html">

Books (from Lovdatas webpages)</a><br>

Page 4: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

Request triggered by hyperlink

`

User

Request/URL

Resolving URL

Internet

Content provider

File with media type

ISP

Page 5: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

• Initialising request• IP address resolved by name server• Request forwarded to resource• Requested resource copied and communicated with

a media type definition• If media type other than HTML, appropriate plug-in

looked for• If plug-in not found, user requested to select• If plug-in found, program loaded, resource displayed

or performed

Page 6: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

Search engine indexing and caching

Site 1

Site 2Extract Indexing of

extract

Add-on existing incex

Caching of extract

Search request

matchingIist of matching pages - KWIC

Idetifying sites

Hyperlink

`

Current index

Page 7: NRCCL (University of Oslo, Faculty of Law) Hyperlinks and search engines Jon Bing NRCCL, Department of Private Law Master lecture 13 November 2007

NRCCL (University of Oslo, Faculty of Law)

Search engine indexing

• Identifying new sites by following hyperlinks• Extracting part of the material from top• Copying extract to home site• Indexing of extract• Storing of extract

• Extract stored because re-indexed frequently, using new algorithms

• Displaying KWIC for relevance function