beyond basic faceted search ben-yitzhak, et al

24
Beyond Basic Faceted Search Ben-Yitzhak, et al. Fahimeh Fakour CS 572 Summer 2010

Upload: ksena

Post on 06-Jan-2016

26 views

Category:

Documents


2 download

DESCRIPTION

Beyond Basic Faceted Search Ben-Yitzhak, et al. Fahimeh Fakour CS 572 Summer 2010. Introduction. Importance and significance Background Information Objective Related work Approach and Solutions Enhancements Contributions Pros & Cons. 1. Importance and Significance. Too much info - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Beyond Basic Faceted Search Ben-Yitzhak, et al

Beyond Basic Faceted SearchBen-Yitzhak, et al.

Fahimeh FakourCS 572Summer 2010

Page 2: Beyond Basic Faceted Search Ben-Yitzhak, et al

Introduction1. Importance and significance2. Background Information3. Objective4. Related work5. Approach and Solutions6. Enhancements7. Contributions8. Pros & Cons

7/7/2010 2Beyond Basic Faceted Search

Page 3: Beyond Basic Faceted Search Ben-Yitzhak, et al

1. Importance and Significance

• Too much info• Transactions

7/7/2010 3Beyond Basic Faceted Search

Page 4: Beyond Basic Faceted Search Ben-Yitzhak, et al

1. Importance and Significance (cont)

• Categories, lists, and the human mind

7/7/2010 4Beyond Basic Faceted Search

Page 5: Beyond Basic Faceted Search Ben-Yitzhak, et al

7/7/2010 Beyond Basic Faceted Search 5

• Research done in IBM & Yahoo Research labs

• Facets, buckets, and categories– Navigate multiple paths for different

ordering

• Free text queries• List of matching

documents with count

2. Background Information

Page 6: Beyond Basic Faceted Search Ben-Yitzhak, et al

3. Objective

• Extend traditional facet – Beyond numbersNumbers

Words

• Search & Index correlated documents

• Similarity to OLAP: multi-dimensional data

7/7/2010 6Beyond Basic Faceted Search

Page 7: Beyond Basic Faceted Search Ben-Yitzhak, et al

4. Related Work• Multifaceted search– Lexical subsumption– Synsets and hypernym– RawSugar social tagging

• Online Analytical Processing (OLAP)– Multi-dimensional data– Aggregation of data: Cube

• N-dimensional “group by”

Exciting new technique

7/7/2010 7Beyond Basic Faceted Search

Page 8: Beyond Basic Faceted Search Ben-Yitzhak, et al

5. Approach & Solutions5.1 Technologies: Lucene & Solr5.2 Data Model5.3 Facet hierarchy: Forest5.4 Creating the facet paths5.5 Running the facet query5.6 Example

7/7/2010 8Beyond Basic Faceted Search

Page 9: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.1. Technologies: Lucene & Solr

• Posting element:

docID, offset, payload

• Matching document processing

byte array of additional info (runtime accessible)

7/7/2010 9Beyond Basic Faceted Search

Page 10: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.2. Data Model• Taxonomy: hierarchical

relationships among facets– Predefined taxonomy– Acquired/Learned through

documents

• Facet-path forest– Tree: top-level facet

7/7/2010 10Beyond Basic Faceted Search

Page 11: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.3. Facet hierarchy: Forest

7/7/2010 11Beyond Basic Faceted Search

Page 12: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.4. Creating the facet paths

• Posting element for document for each prefix of Pi

• Add path to taxonomy index• Encode all k paths related to this

document

7/7/2010 12Beyond Basic Faceted Search

Page 13: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.5. Running the facet query

• Terms:– Faceted query string + taxonomy

subtrees– Faceted result set ranked list of

documents matching query + counters

• Lucene: use the Taxonomy Index function to determine ordinal number of paths

7/7/2010 13Beyond Basic Faceted Search

Page 14: Beyond Basic Faceted Search Ben-Yitzhak, et al

5.6. Example

Facet$clothing: doc1,doc2Facet$clothing$children’s: doc1Facet$clothing: doc1,doc2Facet$clothing$children’s: doc1

7/7/2010 14Beyond Basic Faceted Search

Page 15: Beyond Basic Faceted Search Ben-Yitzhak, et al

6. Enhancements

7/7/2010 15Beyond Basic Faceted Search

Page 16: Beyond Basic Faceted Search Ben-Yitzhak, et al

6.1. Business Intelligence• Qualitative rather than quantitative– Best sellers rather than number of

books published by author

7/7/2010 16Beyond Basic Faceted Search

Page 17: Beyond Basic Faceted Search Ben-Yitzhak, et al

6.2. Dynamic Facets: Welcome to the real world

• Not always independent data

• Example:– Running shorts • Different sizes per color• Location & price

7/7/2010 17Beyond Basic Faceted Search

Page 18: Beyond Basic Faceted Search Ben-Yitzhak, et al

6.2. Dynamic Facets: Solution

• Use tree over the data

7/7/2010 18Beyond Basic Faceted Search

Page 19: Beyond Basic Faceted Search Ben-Yitzhak, et al

6.2. Dynamic Facets: Solution (cont)

7/7/2010 19Beyond Basic Faceted Search

Page 20: Beyond Basic Faceted Search Ben-Yitzhak, et al

7. Contributions• “rich” aggregation : qualitative• Engineering details• Correlation in facet values

7/7/2010 20Beyond Basic Faceted Search

Page 21: Beyond Basic Faceted Search Ben-Yitzhak, et al

8.1. Pros • Detailed description of engineering

aspects & design decisions• Use of implemented technologies• Clearly defines the scope of the paper• Give foundation/background

information• Compatible with real life data

7/7/2010 21Beyond Basic Faceted Search

Page 22: Beyond Basic Faceted Search Ben-Yitzhak, et al

8.2. Cons • Experiments and testing: No

qualitative measurement– effectiveness of “qualitative” facets

• Not explain relevance of some of the previous work

• Criteria for display/grouping?– Key use cases & known user access

patterns not explained

• Build taxonomy: depth/breadth?

7/7/2010 22Beyond Basic Faceted Search

Page 23: Beyond Basic Faceted Search Ben-Yitzhak, et al

Thank You

7/7/2010 23Beyond Basic Faceted Search

Page 24: Beyond Basic Faceted Search Ben-Yitzhak, et al

References

247/7/2010 24Beyond Basic Faceted Search

Ben-Yitzhak, et al. “Beyond Basic Faceted Search”. Proceedings of the international conference on Web search and web data mining. Pp.33-44, 2008. <http://nadav.harel.org.il/papers/p33-ben-yitzhak.pdf>

“Faceted Search with Solr” Lucid Imagination. July 1, 2010. <http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Faceted-Search-Solr >

“Faceted classification” Wikipedia. July 7, 2010 <http://en.wikipedia.org/wiki/Faceted_classification >

Lemieux, Earley, and Associates. “Designing for Faceted Search” User Interface Engineering. July 6, 2010 <http://www.uie.com/articles/faceted_search/>(Originally in KM World, March 2009)

Mattman, Chris. “Query Models” (presentation slides for class)