putting qa back in mrqausing tf-idf drqa (chen et al., acl'17) creating a shorter support...

50
Putting QA back in MRQA Antoine Bordes with Angela Fan, Yacine Jernite, Ethan Perez, David Grangier, Jason Weston, Michael Auli, Chloe Braud, Claire Gardent, Emily Dinan, Stephen Roller, Kurt Schuster MRQA Workshop -- 11/4/19 -- Hong Kong

Upload: others

Post on 28-May-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Putting QA back in MRQA

Antoine Bordes with Angela Fan, Yacine Jernite, Ethan Perez, David Grangier, Jason Weston, Michael Auli, Chloe Braud, Claire Gardent, Emily Dinan, Stephen Roller, Kurt Schuster

MRQA Workshop -- 11/4/19 -- Hong Kong

Page 2: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

2

Friday June 17, 2016

Page 3: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

3

Page 4: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

4

No BERT!No Transformers even!

No PyTorch

No SQuAD!!!

Page 5: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

5

SQuAD had appeared on Arxiv the day before the workshop!

Page 6: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

6

Since then?

Page 7: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

7

https://paperswithcode.com/sota/question-answering-on-squad11

Human performance (82.3)

June 17, 2016

Page 8: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

8

QA is now only a sub-benchmark?

At least, SQuAD &Machine Reading are.

October 24, 2019

Page 9: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Standard post-SQuaD Datasets

Although a fever (pyrexia) could be considered any body temperature above the normal temperature of 98.6 degrees Fahrenheit (98.6 F or 37 C), medically, a person is not considered to have a significant fever until the temperature is above 100.4 F (38.0 C).

What temperature is a fever?QUESTION

CONTEXT

Page 10: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Standard post-SQuaD Datasets

QUESTION

ANSWER100.4

CONTEXT

Although a fever (pyrexia) could be considered any body temperature above the normal temperature of 98.6 degrees Fahrenheit (98.6 F or 37 C), medically, a person is not considered to have a significant fever until the temperature is above 100.4 F (38.0 C).

What temperature is a fever?

Page 11: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

What causes a fever?

Why do you need to bring your temperature down?

Why are fevers dangerous?

How about answering --

Page 12: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

12

It's time to get to the core of what QA is!

Open domainDeveloped and argumentative

Machine Reading is one of the keys

Page 13: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Long-form QA

Page 14: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Long Form (open-domain) Question Answering

Why do you need to bring your temperature down?

Up to a point, having a fever is a good thing when you're fighting an infection as in the case of sepsis (infection in the blood). Many pathogens don't fare well in even a degree or two of average raised temperature, while your body is much more resilient. It's still a pretty serious condition on its own, and sepsis is frequently fatal regardless of the not only the body's attempts to fight it, but with medical intervention.

The problems in general however, start when the fever is too high, or just high for too long. Your body will release something called chaperone molecules that help your proteins fold correctly, but there will still be errors and it's more energetically expensive. This chaperone molecules also have limits, and past a certain point your body fails on a number of levels.

For one, a lot of what your cells do is interact with, transport, [....]

QUESTION

ANSWER

Page 15: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Why is moving to long-form QA difficult?

• Generation of fluent outputs + Their Evaluation

• Retrieval of relevant sources

• Information synthesis

• Lack of Data!

Page 16: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Dataset and Pretrained Models:

http://www.github.com/facebookresearch/ELI5

https://facebookresearch.github.io/ELI5/

Page 17: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Explain Like I'm Five Dataset (Fan et al. ACL'19)

Page 18: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Open Domain

QUESTION

INFORMATION RETRIEVAL WEB DOCUMENTS

Common Crawl

Page 19: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

ELI5

MS Marco v2

TriviaQA

Narrative QA

CoQA

SQuAD 2.0

Hotpot QA

0 35 70 105 140

Answer Length

Longer Questions

Page 20: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Questions about Diverse Topics

Chemistry

BiologyPhysics

Culture

History

Mathematics

Technology

Economics

Engineering

Religion

Literature

Government

Psychology

Page 21: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

More Why? Questions

ELI5

MS Marco v2

TriviaQA

Narrative QA

CoQA

SQuAD 2.0

Hotpot QA

0 10 20 30 40 50

Percentage of WHY Questions

Page 22: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

More How? Questions

ELI5

MS Marco v2

TriviaQA

Narrative QA

CoQA

SQuAD 2.0

Hotpot QA

0 6 12 18 24 30

Percentage of HOW Questions

Page 23: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

QUESTION

WEB DOCUMENTS

ANSWER

Explain Like I'm Five Dataset 270,000 EXAMPLES

100 per question42 words

200,000 words

130 words

Page 24: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Models

Page 25: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Modeling Challenges

WEB DOCUMENTS • Find Relevant Information in a Multi-Document Setting

Page 26: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Modeling Challenges

QUESTION

• Find Relevant Information in a Multi-Document Setting

• Query-driven Information Reasoning

Page 27: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Modeling Challenges

ANSWER

• Find Relevant Information in a Multi-Document Setting

• Query-driven Information Reasoning

• Writing a Long Text Answer

Page 28: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Dealing with Long Web Input

WEB DOCUMENTS

Over 200,000 words long

SUPPORT DOCUMENT

Around 1,000 words long

Page 29: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

29

DR QA

Using TF-IDF

DrQA (Chen et al., ACL'17)

Page 30: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Creating a Shorter Support Document with TF-IDF

850 words avg

SUPPORT DOCUMENT

Page 31: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

31

Albert Einstein, a German theoretical physicist, published the theory of relativity.

He won the physics Nobel Prize.

The theory of relativity is one of the two pillars of modern physics.

GRAPH CONSTRUCTION STEPS

Albert Einstein

the theory of relativity

wonthe Physics Nobel Prize

one of the two pillars of modern

physicsis

published

MERGE OPERATION: theory of relativity EXISTS AS A NODEADDED TO GRAPH

COREFERENCE: he and Albert Einstein

MERGE OPERATION: Albert EinsteinEXISTS AS A NODE

Puppies are very cute.

QUERY: Can someone finally explain the theory of general relativity?

1 2

3 4

FILTER OPERATION: low TF-IDF overlap with queryNOT ADDED TO GRAPH

NODE WEIGHT + 1

NODE WEIGHT + 1

DOCUMENT SENTENCES with GRAPH OPERATIONS

CONSTRUCTED GRAPH

Creating a Shorter Support Document with Local KB (Fan et et al.

SUPPORT DOCUMENT

Page 32: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

32

Avg Input Size

Graph

Web Search

Triples Only

200k

100k

Leng

th

- Large compression effect, reducing input size by an order of magnitude. - Linearized output of the KB can be processed with (almost) standard Seq2Seq

Creating a Shorter Support Document with Local KB (Fan et et al.

Missing Answer Tokens

% M

issin

g To

kens

40

20

60

TF-IDF Extract Top 1

Top 5Top 10

Top 25Top 50

All 100

Triples Only

Graph Built from N Hits

(lower is better)

(All 100)

Graph, Len 850

(All 100)

- Despite compression, the graph contains most of the relevant tokens

Page 33: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Modeling Approaches

QUESTION ANSWERSUPPORT

DOCUMENT

• Extractive

• Abstractive

Page 34: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Extractive Models

Find sentences of the support document to copy as the

QUESTIONSUPPORT

DOCUMENTEXTRACTIVE

ANSWER

Page 35: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Abstractive Models

QUESTIONSUPPORT

DOCUMENT

ANSWER

Generate each word of the

Page 36: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Abstractive Models

SEQUENCE TO SEQUENCE MULTITASK

training time: train on many tasksto add the benefit of language

Page 37: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

training time: train on many tasksSEQUENCE TO

SEQUENCE LANGUAGE MODELING

Page 38: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

training time: train on many tasksSEQUENCE TO

SEQUENCE LANGUAGE MODELING

Page 39: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

MASKED LANGUAGE MODELING

masked

training time: train on many tasksSEQUENCE TO

SEQUENCE LANGUAGE MODELING

Page 40: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Abstractive Models

SEQUENCE TO SEQUENCE MULTITASK

inference time: standard question answering

QUESTIONSUPPORT

DOCUMENT

ANSWER

Page 41: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Evaluation

Page 42: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Automatic Evaluation

0

7

14

21

28

35

카테고리 축Extractive TFIDF Local KBRegion 1 Region 1 Region 1

3028.9

20.6

ROUGE

Multitask Abstractive

Page 43: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

43

that phenomena in small enough patches of the spacetime obey laws compatible with special relativity principle of equivalence: the gravitational forces are locally indistinguishable from fictitious forces

more attention

at the experiments whichdemonstrated that in histheory of relativity, spaceand time are one

Albert Einstein

General Relativity

developed

had

many discoveries

looked

is

Einstein’s theory

isa theory of gravitation

a difficult theory to comprehend like most of other scientific theories

is

assumes

was

the first major new theory of gravity since Isaac Newton’s

color denotes attention Question: Can someone finally explain the theory of general relativity?

Generated Answer: It’s a bit complicated to explain. General relativity is a theory of Albert Einstein. Einstein developed this theory. Relativity is measure of how fast something moves relative to the reference frame. In theory, the speed of light is constant. If you are traveling at a constant speed, and then you move at a speed relative to an observer, there is a constant rate for that object to move. General relativity is observed in the experiments of Albert Einstein. It is about space, and space, and gravity.

Interpretation

Page 44: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Human Evaluation

MODEL 1 ANSWER

MODEL 2 ANSWER

MEASUREHUMAN

PREFERENCE

SAME QUESTION

Page 45: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

LM

Human

s2s Extractive Multi-task

98.4* 93.1* 85.5* 88.8*

Human Evaluation

far from human performance

Still a long way to go but we are putting the pieces together!

Prediction: Long-form, open-domain and extractive QA will merge

Page 46: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

Knowledge beyond QA

Page 47: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

47

Wizard of Wikipedia (Dinan et al. ICLR'19)

Page 48: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

48

Retrieval-based model (~extractive)

Page 49: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge

49

Generative model

Page 50: Putting QA back in MRQAUsing TF-IDF DrQA (Chen et al., ACL'17) Creating a Shorter Support Document with TF-IDF 850 words avg SUPPORT DOCUMENT. 31 Albert Einstein, a German ... Knowledge