query classification using asymmetrical learning zheng zhu birkbeck college, university of london

15
Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Upload: isaiah-ruiz

Post on 28-Mar-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Query Classification Using Asymmetrical Learning

Zheng ZhuBirkbeck College, University

of London

Page 2: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Content

• Query Classification• Our Approach• Experiment Results• Conclusion

Page 3: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Query Classification

• The task is to assign a query to one or more predefined categories, based on its topics. (from wikipedia)

• Applications: Paid Placement Advertisement, Federated Search.

• Challenge: Query is short, noisy.

Page 4: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Query Classification

• To handle those challenges, (Pseudo) Relevance Feedback is used to enrich the queries.

• But it involves sophisticated searching and ranking function.

• The motivation is to study the performance of query classification in the absence of PRF.

Page 5: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Query Classification

• Another approach is to enrich the queries with co-occurrence terms from query logs. For example, the query “machine learning” is strongly correlated to “machine learning algorithm” and “machine learning research” in query logs.

Page 6: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Our Approach

• Enrichment Strategy: 1. Zero enrichment.2. Pseudo Relevance Feedback.3. Related Suggestions From Yahoo.

Page 7: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Our Approach

• Vector Space Model: A document is represented as a vector. Each dimension corresponds to a separate term. If a term occurs in the document, its value in the vector is non-zero.

Page 8: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Our Approach

• N-gram model in word level and character level.

• Linear SVM.• Ensemble Linear SVM

(Symmetrical case), base classifier trained from snippets, titles, urls respectively.

Page 9: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Our Approach

• Multi-label, Multi-class problem: decompose it to Binary class problem.

• Evaluation Criteria: Micro-Precision, Micro-Recall and Micro-F1.

Page 10: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Results

Page 11: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Results – Symmetrical Case

Page 12: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Result –Symmetrical Case

Page 13: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Result – Asymmetrical Case

Page 14: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

Conclusion

• Pseudo-Relevance Feedback yields better performance, however it is a post-search strategy.

• Yahoo suggested keyword achieve worse result.

• Training with PRF, testing with suggested keywords is in the middle, but it doesn’t require the searching and ranking.

Page 15: Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London

• Thanks