1 deep web searching carl heine, ph.d. illinois mathematics and science academy

Post on 04-Jan-2016

217 Views

Category:

Documents

2 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

Deep Web SearchingDeep Web Searching

Carl Heine, Ph.D.Carl Heine, Ph.D.

Illinois Mathematics and Science AcademyIllinois Mathematics and Science Academy

2

Problem: Translating a question into a query

Problem: Choosing the best database

Problem: Finding better keywords

Problem: Recognizing information that’s relevant

Problem: Verifying the credibility of information

Illinois Technology Standards for Teachers: 2, 6, 7, 8

3

What digital natives don’t do well

What they tend to do instead

Turn a question into a queryRush ahead toward an answer, either grabbing the whole question “as is” or missing an important part of it

Select the ‘right’ database

Enter words or phrases into Google

Recognize information when they find it

Rush past important information and clues, continue to browse

Find better keywords Stick with their original words and browse

Verify the credibility of informationAccept what they find at face value, hoping somewhere in the information there is an answer

4

Problem: Selecting the right Database

This failure occurs before submitting the first query.

Task: Predict where expert information may be found.

No search engine performs a live Internet search when you submit a query.

That which is stored in one search engine’s database is invisible to another search engine (also called Enterprise Data)

5

Research Findings (IMSA second semester sophomores)

90% search with Google more than half of the time.

32% use Google exclusively.

Problem: Selecting the right Database

6

Modes of Searching

Table of Contents

Index

Flipping through the pages

Subject Directory

Search Engine

Browsing (surfing)

7

Subject

Directory

8

Search Engine

9

Browsing

10

Information that is stored in a different database than where you are searching now.

Deep Web, Invisible Web

Information that is “buried” deeper in the abstracts than results from another database.

11

Retrieval Speed e.g., Results 1 - 10 of about 111,000,000 (0.11 seconds) 

HTTP Error 404

Evidence of Database Searching

12

Robotic programs known as Crawlers or Spiders…

…Retrieve information from web pages…

…That is compiled into a database…

…Called out by queries passed through a search engine…

…Displayed as abstracts (snippets)

Database Searching

13

Information retrieved (Google): Words in the <title> Characters in the URL Words in the body Links from this page to another page

Database Searching

Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page

<html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <meta name="keywords" content="Accessibility, Web accessibility, alternative browsers, alternative browsing, assistive technologies"> <meta name="description" content="Reference list of alternative browsers and assistive technologies for people with disabilities using the Web."> <title>Alternative Web Browsing</title> <link rel="stylesheet" type="text/css" href="http://www.w3.org/WAI/style/wairesource.css"> </head>

14

Database Searching

Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page

15

Strategies for navigating the Deep Web

Making an educated prediction:

1. Where would an expert post the information?

2. Can Google google it?

16

Strategies for navigating the Deep Web

Where would an expert post the information? Who might know the answer to my question?

Where could I find that person?

17

Strategies for navigating the Deep Web

Where would an expert post the information?

An obvious choice? Keywords: <subject matter> + database Instead of database, try archive, repository, historical,

information, facts, almanac

18

Strategies for navigating the Deep WebWhat classical composer's music is heard

most often in movies?

An obvious choice?

Keywords: movie classical music database

OR information OR facts OR …?

19

Strategies for navigating the Deep Web

How many movies has John Williams, the classical guitarist, scored?

An obvious choice?

Keywords: ?? +database

OR information OR facts OR …?

20

Strategies for navigating the Deep Web

What airplane finished the Sport Qualifying race in second place at the Reno Air Races, September 11, 2002?

An obvious choice?

Keywords: ?? +database

OR information OR facts OR …?

21

Strategies for navigating the Deep Web

Can Google google it?

Types of information that Google is NOT LIKELY to access:

1. Information stored behind a SEARCH BOX

22

Strategies for navigating the Deep Web

Can Google google it?

Types of information that Google is NOT LIKELY to access:

2. Information stored behind a PASSWORD

Investors.com

23

Strategies for navigating the Deep Web

Can Google google it?

3. Other types of information that Google is NOT LIKELY to access:

24

Strategies for navigating the Deep Web

Page Not Found www.archive.org

25

How, When and Where can Deep Web How, When and Where can Deep Web searching skills be taught?searching skills be taught?

Performance skills require hands-on practice

Interactive tutorials (example: Can Google google it?) MicroModules (example: Invisible Web) Timed Search Challenges (example: Broadway shows)

When and Where can these skills be taught? In the context of research paper preparation Library and research orientation Computer lab and non-digital applications

26keywords: information fluency

On line Resources for Information FluencyOn line Resources for Information Fluency

27

Contact Us!Contact Us!URL http://21cif.imsa.edu

General info 21cif@imsa.edu Carl Heine heine@imsa.eduDennis O’Connor doconnor@imsa.edu

Thank you for participating!

top related