feedback - 서울시립대 데이터마이닝...

12
Feedback 지능정보시스템연구실 컴퓨터과학과 천상진([email protected])

Upload: others

Post on 11-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

Feedback

지능정보시스템연구실

컴퓨터과학과천상진([email protected])

Page 2: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

Feedback takes the results of a user’s actions or

previous search results to improve retrieval results

Page 3: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

Intro

✓ relevance feedback

✓ pseudo feedback

use explicit relevance judgement by user effort

simply assume the top k documents are relevant

✓ implicit feedbackuse user’s clickthrough data

“feedback in a TR system is based on learning from previous queries

to improve retrieval accuracy in the future queries.”

Page 4: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.1 Feedback in the Vector Space Model

→ Rocchio Feedback Method

we want to place the query vector in a better position

in the high-dimensional term space, plotting it closer to relevant documents.

Page 5: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.1 Feedback in the Vector Space Model

the set of relevant feedback documents

the original query vector

the modified vector

the set of non-relevant feedback documents

weight of terms from negative doc.

centroid

weight of terms from positive doc.

weight of terms in original query

Page 6: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.1 Example

Page 7: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.1 Example

✓ compute the centroid of positive and negative feedback documents

✓ modify the original query to create the expanded query

Page 8: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.2 Feedback in Language Model

Kullback-Leibler divergence retrieval model

Page 9: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit
Page 10: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

𝜆 ∈ [0, 1]

Page 11: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

Query: “airport security”Mixture model approach

Page 12: Feedback - 서울시립대 데이터마이닝 연구실datamining.uos.ac.kr/.../uploads/2019/02/07-feedback.pdf · 2019-04-30 · Intro relevance feedback pseudofeedback use explicit

7.2 Feedback in Language Model

→ choose 𝜃𝐹 to maximize the log likelihood of the feedback documents

EM algorithm for estimating its parameters (Chapter 17…)

In practice, it isn’t feasible to try all values of 𝜃, so we use