a user evaluation of hierarchical phrase browsing

29
A User Evaluation of Hierarchical Phrase Browsing [kde2, dmn, kthomson, ihw]@cs.waikato.ac.nz New Zealand Digital Library Project Department of Computer Science University of Waikato New Zealand nzdl.org Katrina D. Edgar, David M. Nichols, Gordon W. Paynter, Kirsten Thomson and Ian H. Witten [email protected] INFOMINE Project University of California, Riverside USA infomine.ucr.edu

Upload: erin-love

Post on 31-Dec-2015

37 views

Category:

Documents


0 download

DESCRIPTION

A User Evaluation of Hierarchical Phrase Browsing. Katrina D. Edgar, David M. Nichols, Gordon W. Paynter, Kirsten Thomson and Ian H. Witten. [kde2, dmn, kthomson, ihw]@cs.waikato.ac.nz. [email protected]. New Zealand Digital Library Project - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: A User Evaluation of Hierarchical Phrase Browsing

A User Evaluation of Hierarchical Phrase Browsing

[kde2, dmn, kthomson, ihw]@cs.waikato.ac.nz

New Zealand Digital Library

Project

Department of Computer Science

University of Waikato

New Zealand

nzdl.org

Katrina D. Edgar, David M. Nichols, Gordon W. Paynter,

Kirsten Thomson and Ian H. Witten

[email protected]

INFOMINE Project

University of California, Riverside

USA

infomine.ucr.edu

Page 2: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 2

Overview

Background: searching, browsing, …Inferring Hierarchical Phrase StructurePhind: an interface for phrase browsingEvaluating PhindUser StudyResultsConclusion

Page 3: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 3

Access

SearchBrowsing– Subject– Metadata– Textual documents

• Concordance

Hierarchical Phrase Browsing

Page 4: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 4

Identifying phrases

The basic insight of the phrase-finding method is that any phrase which appears more than once can be replaced by a grammatical rule that generates the phrase, and that this process can be continued recursively. The result is a hierarchical representation of the original sequence.

• Nevill-Manning et al, IJDL, (1999)

Page 5: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 5

Extracting nice phrases

Extract text from HTML– Stopwords, punctuation delimiters

Create overlapping phrase hierarchy– Each phrase has a set of expansions which are the longer phrases

that contain it

– Only repeated phrases

– Maximal length condition• No unique expansion in either direction• Different LHS and RHS contexts

Turn phrase hierarchy into an interactive interface Phind

• Paynter et al, Proc. DL (2000)

Page 6: A User Evaluation of Hierarchical Phrase Browsing

Prune trivial expansions

Phrases that occur twice or more

Page 7: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 7

Example

FAO on the Internet CD-ROM (1998)– Food and Agriculture Organization

187 MB of HTML30 mins to extract phrases28 MB of index files

Page 8: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 8

Phind Interface

Java applet in Web pages– Just another means

of access

2 main panels

Page 9: A User Evaluation of Hierarchical Phrase Browsing
Page 10: A User Evaluation of Hierarchical Phrase Browsing
Page 11: A User Evaluation of Hierarchical Phrase Browsing
Page 12: A User Evaluation of Hierarchical Phrase Browsing
Page 13: A User Evaluation of Hierarchical Phrase Browsing
Page 14: A User Evaluation of Hierarchical Phrase Browsing
Page 15: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 15

Previously we have claimed about Phind…

Good points– Automatically created– Cheap and scalable

Bad points– uncontrolled vocabulary (compared with thesaurus)

• Paynter et al, DL 2000

Only previous Phind evaluation in relation to a thesaurus – Paynter et al, Asian DL 2000

Page 16: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 16

So …

It may be cheap, scalable and automatic…

… but is it any use?

What do people do when confronted by Phind?

Can they use it to find things?

Page 17: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 17

User Study: participants

University of Waikato Usability Lab– http://www.cs.waikato.ac.nz/usability

12 participants– Students, 9 male– Backgrounds: Computing, management

Individual sessionsSession length : 1 hour

Page 18: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 18

User Study: collection

Existing collection within GreenstoneWeb site of the Food and Agriculture Organization (FAO) of the United Nations, – CD-ROM version as distributed in 1998

21,700 Web pages – as well as around 13,700 associated files (image files, PDFs,

etc.), – a medium-sized collection of approximately 140 million words

of text

Page 19: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 19

User Study: tasks

seven tasks that involve locating information, understanding content, and recognizing and using elements and functions prompted with help during their first task 1. exploratory questions– “find out more about national forest programmes in different

countries”

2. specific retrieval tasks– “where can golden apple snails be found?”– “what was the locust numbers situation during May in Kuwait?”

Page 20: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 20

User Study: mechanics

Phind as a Java applet within Greenstone– In Internet Explorer on Windows 98

FAO collection on public web server – nzdl.org

Video recordingQuestionnaires– Before and after tasks– Summary questionnaire at end

Page 21: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 21

Results: summary

Phind was– useful– liked– good at supporting exploratory tasks– bad at supporting specific tasks

Page 22: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 22

Results: task performance

Specific retrieval tasks involving multiple concepts:– ‘what are the most widely planted pines for timber and pulp

production in the southern United States?’ – ‘What was the locust numbers situation during May in Kuwait?

12 attempts using Phind on these 2 tasks:– 4 gave up, 5 gave the wrong answer – 3 found the correct answer

12 attempts using keyword searching:– 11 correct, 1 wrong

Quotes:– “You should be able to put more than one word”– “Confusing when I was searching for two different topics.”

Page 23: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 23

Results: interface

2 Windows:– Three participants minimized the document window instead of

closing it – which meant that when they clicked on a document link, Phind

opened the document in the hidden window

Navigation– 5 of the 12 participants did not use the ‘Previous’ or ‘Next’

buttons at all – Elements little used:

• ‘get more phrases’• ‘get more documents’

Page 24: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 24

Results: questionnaires

Phind’s results (10/12) : – ‘clear and easy to understand’– ‘relevant and useful to the query’

‘elements or features that they most disliked about Phind’ – “not being able to go back”

• During task: “Is there a way to go back?” (2)

‘search method they preferred overall’ – 9 to 3 in favour of keyword searching

Page 25: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 25

Results

75% of the users preferred the keyword searching over phrase browsing overall. Despite liking the Phind interface, the participants found many problems. – main functional problem was Phind's inability to perform multi-

word queries. – Phind's unfamiliarity: new interface has too many new

elements

Page 26: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 26

Results: links

two previously-reported design issues • Blandford et al (JCDL’01)

– “working across boundaries” • in the different paradigms of browser-based keyword

searching vs. the Java-based Phind interface• inconsistent experiences with the opening of windows

leading to lost documents • lack of feedback during query evaluation• unfamiliar navigation tools• problems understanding the relationship between frames

and result sets.

– “blind alleys” • when Phind users attempted multi-term phrase queries

Page 27: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 27

Technology

Java applet in Web pagesCould be run as a Server-side process– Reduce the dislocation between 2 interfaces

Selecting words from actual vocabulary– Remove zero-hit queries– Dynamic reactive Java-like interface?

Tension between different routes forward

Page 28: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 28

Caveats

NumbersAuthenticity– Motivation and domain knowledge

Prior experience– Keyword searching on the web

Lack of integration– Normal work patterns– Search mode

Page 29: A User Evaluation of Hierarchical Phrase Browsing

ECDL 2003 29

Conclusion

Phind seems to be ok for explorationMulti-concept queries not goodNot integrated with other searching/browsing mechanismsSmall ‘features’ of Phind confound resultsPositive subjective feedback