1 deep web searching carl heine, ph.d. illinois mathematics and science academy

27
1 Deep Web Searching Deep Web Searching Carl Heine, Ph.D. Carl Heine, Ph.D. Illinois Mathematics and Science Academy Illinois Mathematics and Science Academy

Upload: brandon-abel-bruce

Post on 04-Jan-2016

217 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

1

Deep Web SearchingDeep Web Searching

Carl Heine, Ph.D.Carl Heine, Ph.D.

Illinois Mathematics and Science AcademyIllinois Mathematics and Science Academy

Page 2: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

2

Problem: Translating a question into a query

Problem: Choosing the best database

Problem: Finding better keywords

Problem: Recognizing information that’s relevant

Problem: Verifying the credibility of information

Illinois Technology Standards for Teachers: 2, 6, 7, 8

Page 3: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

3

What digital natives don’t do well

What they tend to do instead

Turn a question into a queryRush ahead toward an answer, either grabbing the whole question “as is” or missing an important part of it

Select the ‘right’ database

Enter words or phrases into Google

Recognize information when they find it

Rush past important information and clues, continue to browse

Find better keywords Stick with their original words and browse

Verify the credibility of informationAccept what they find at face value, hoping somewhere in the information there is an answer

Page 4: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

4

Problem: Selecting the right Database

This failure occurs before submitting the first query.

Task: Predict where expert information may be found.

No search engine performs a live Internet search when you submit a query.

That which is stored in one search engine’s database is invisible to another search engine (also called Enterprise Data)

Page 5: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

5

Research Findings (IMSA second semester sophomores)

90% search with Google more than half of the time.

32% use Google exclusively.

Problem: Selecting the right Database

Page 6: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

6

Modes of Searching

Table of Contents

Index

Flipping through the pages

Subject Directory

Search Engine

Browsing (surfing)

Page 7: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

7

Subject

Directory

Page 8: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

8

Search Engine

Page 9: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

9

Browsing

Page 10: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

10

Information that is stored in a different database than where you are searching now.

Deep Web, Invisible Web

Information that is “buried” deeper in the abstracts than results from another database.

Page 11: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

11

Retrieval Speed e.g., Results 1 - 10 of about 111,000,000 (0.11 seconds) 

HTTP Error 404

Evidence of Database Searching

Page 12: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

12

Robotic programs known as Crawlers or Spiders…

…Retrieve information from web pages…

…That is compiled into a database…

…Called out by queries passed through a search engine…

…Displayed as abstracts (snippets)

Database Searching

Page 13: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

13

Information retrieved (Google): Words in the <title> Characters in the URL Words in the body Links from this page to another page

Database Searching

Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page

<html> <head> <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> <meta name="keywords" content="Accessibility, Web accessibility, alternative browsers, alternative browsing, assistive technologies"> <meta name="description" content="Reference list of alternative browsers and assistive technologies for people with disabilities using the Web."> <title>Alternative Web Browsing</title> <link rel="stylesheet" type="text/css" href="http://www.w3.org/WAI/style/wairesource.css"> </head>

Page 14: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

14

Database Searching

Information available (Google): One or more Words in the <title> Characters in the URL One or more Words in the body (except stop words) Pages that link to this page

Page 15: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

15

Strategies for navigating the Deep Web

Making an educated prediction:

1. Where would an expert post the information?

2. Can Google google it?

Page 16: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

16

Strategies for navigating the Deep Web

Where would an expert post the information? Who might know the answer to my question?

Where could I find that person?

Page 17: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

17

Strategies for navigating the Deep Web

Where would an expert post the information?

An obvious choice? Keywords: <subject matter> + database Instead of database, try archive, repository, historical,

information, facts, almanac

Page 18: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

18

Strategies for navigating the Deep WebWhat classical composer's music is heard

most often in movies?

An obvious choice?

Keywords: movie classical music database

OR information OR facts OR …?

Page 19: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

19

Strategies for navigating the Deep Web

How many movies has John Williams, the classical guitarist, scored?

An obvious choice?

Keywords: ?? +database

OR information OR facts OR …?

Page 20: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

20

Strategies for navigating the Deep Web

What airplane finished the Sport Qualifying race in second place at the Reno Air Races, September 11, 2002?

An obvious choice?

Keywords: ?? +database

OR information OR facts OR …?

Page 21: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

21

Strategies for navigating the Deep Web

Can Google google it?

Types of information that Google is NOT LIKELY to access:

1. Information stored behind a SEARCH BOX

Page 22: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

22

Strategies for navigating the Deep Web

Can Google google it?

Types of information that Google is NOT LIKELY to access:

2. Information stored behind a PASSWORD

Investors.com

Page 23: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

23

Strategies for navigating the Deep Web

Can Google google it?

3. Other types of information that Google is NOT LIKELY to access:

Page 24: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

24

Strategies for navigating the Deep Web

Page Not Found www.archive.org

Page 25: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

25

How, When and Where can Deep Web How, When and Where can Deep Web searching skills be taught?searching skills be taught?

Performance skills require hands-on practice

Interactive tutorials (example: Can Google google it?) MicroModules (example: Invisible Web) Timed Search Challenges (example: Broadway shows)

When and Where can these skills be taught? In the context of research paper preparation Library and research orientation Computer lab and non-digital applications

Page 26: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

26keywords: information fluency

On line Resources for Information FluencyOn line Resources for Information Fluency

Page 27: 1 Deep Web Searching Carl Heine, Ph.D. Illinois Mathematics and Science Academy

27

Contact Us!Contact Us!URL http://21cif.imsa.edu

General info [email protected] Carl Heine [email protected] O’Connor [email protected]

Thank you for participating!