nrccl (university of oslo, faculty of law) hyperlinks and search engines jon bing nrccl, department...
TRANSCRIPT
NRCCL (University of Oslo, Faculty of Law)
Hyperlinks and search engines
Jon Bing NRCCL, Department of Private Law
Master lecture 13 November 2007
NRCCL (University of Oslo, Faculty of Law)
Web page with hyperlink
NRCCL (University of Oslo, Faculty of Law)
HTML rendering of hyperlink
<a href="http://www.lovdata.no/litt/index.html">
Books (from Lovdatas webpages)</a><br>
NRCCL (University of Oslo, Faculty of Law)
Request triggered by hyperlink
`
User
Request/URL
Resolving URL
Internet
Content provider
File with media type
ISP
NRCCL (University of Oslo, Faculty of Law)
• Initialising request• IP address resolved by name server• Request forwarded to resource• Requested resource copied and communicated with
a media type definition• If media type other than HTML, appropriate plug-in
looked for• If plug-in not found, user requested to select• If plug-in found, program loaded, resource displayed
or performed
NRCCL (University of Oslo, Faculty of Law)
Search engine indexing and caching
Site 1
Site 2Extract Indexing of
extract
Add-on existing incex
Caching of extract
Search request
matchingIist of matching pages - KWIC
Idetifying sites
Hyperlink
`
Current index
NRCCL (University of Oslo, Faculty of Law)
Search engine indexing
• Identifying new sites by following hyperlinks• Extracting part of the material from top• Copying extract to home site• Indexing of extract• Storing of extract
• Extract stored because re-indexed frequently, using new algorithms
• Displaying KWIC for relevance function