robust shallow semantic parsing of text › ~dipanjan › ttic_presentation.pdf · semantic parsing...

217
Dipanjan Das Carnegie Mellon University Toyota Technological Institute at Chicago February 14, 2012 Robust Shallow Semantic Parsing of Text

Upload: others

Post on 04-Jul-2020

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Dipanjan DasCarnegie Mellon University

Toyota Technological Institute at Chicago

February 14, 2012

Robust Shallow Semantic Parsing of Text

Page 2: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

2

Page 3: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

I want to go to Chicago on Sunday

3

Page 4: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

I want to go to on Sunday

Natural Language Understanding

P V ADP V ADP N ADP N

Shallow Syntax:Part-of-Speech Tagging

4

Chicago

Page 5: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Deeper Syntax:Dependency parsing

I want to go to on Sunday

P V ADP V ADP N ADP N

5

Chicago

Page 6: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

I want to go to on Sunday

P V ADP V ADP N ADP N

6

Chicago

Page 7: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

I want to go to on Sunday

P V ADP V ADP N ADP N

Encodes an eventor

scenario

7

Chicago

Page 8: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

I want to go to on Sunday

P V ADP V ADP N ADP N

8

Chicago

Page 9: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

I want to go to on Sunday

P V ADP V ADP N ADP N

participant orrole

for the frame

9

Chicago

Page 10: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

I want to go to on Sunday

Experiencer

10

Chicago

Page 11: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Natural Language Understanding

Shallow Semantics: Frames and Roles

Focus of this talk!(Das, Schneider, Chen and Smith, NAACL 2010;

Das and Smith, ACL 2011)

11

I want to go to on Sunday

Experiencer

Chicago

Page 12: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

12

Page 13: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

ApplicationsMotivation Choice offormalism

13

Page 14: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

Argument identificationFrame identification

14

Page 15: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

Argument identificationFrame identification

(use of latent variables) (dual decomposition)

15

Page 16: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

Novel graph-based learning algorithms16

Page 17: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

Motivation ApplicationsChoice offormalism

17

Page 18: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothing

18

Page 19: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

19

Page 20: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

Large body of research on syntax, includingDas and Petrov, ACL 2011

Cohen, Das and Smith, EMNLP 2011Martins, Das, Smith and Xing, EMNLP 2008

20

Page 21: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

storeor

financial entity?

21

Page 22: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

Store of what?Of what size?Whose store?

22

Page 23: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

Store of what?Of what size?Whose store?

23

Page 24: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

What was reduced?

24

Page 25: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

What was reduced?

To what?

25

Page 26: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

What was reduced?

To what?

26

Page 27: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothing

27

Page 28: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Bengal ’s massive stock of food was reduced to nothing

AUSE_CHANGE_OF_POSITION_ON_A_SCALE

28

Page 29: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Origins: (Computational) Linguistics

29

Page 30: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Origins: (Computational) Linguistics

Case Grammar(“The Case for Case”, Fillmore, 1968)

30

Page 31: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

I gave some money to him

Case Grammar(“The Case for Case”, Fillmore, 1968)

Origins: (Computational) Linguistics

31

Page 32: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

(cases are words/phrases required by a predicate)

I gave some money to him

Agent Object Beneficiary

Case Grammar(“The Case for Case”, Fillmore, 1968)

Origins: (Computational) Linguistics

32

Page 33: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

(cases are words/phrases required by a predicate)

I gave some money to him

Agent Object Beneficiary

Semantic valency of a predicate

Correlation with syntax(e.g. subject and object)

Obligatory cases / optional cases

Case Grammar(“The Case for Case”, Fillmore, 1968)

Origins: (Computational) Linguistics

33

Page 34: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

34

Page 35: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

35

Page 36: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

36

Page 37: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

Relates the meaning of a word with world knowledge

(e.g. gave evokes a IVING frame;

it has several participating roles;the frame is evoked by other words,

such as bequeath, contribute, donate)

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

37

Page 38: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Relates the meaning of a word with world knowledge

(e.g. gave evokes a IVING frame;

it has several participating roles;the frame is evoked by other words,

such as bequeath, contribute, donate)

38

Page 39: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Datasets

Slide idea taken from Brendan O’Connor

39

Page 40: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Datasets

Data-Driven ShallowSemantic Parsing

Slide idea taken from Brendan O’Connor

40

Page 41: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Scripts(“Scripts, Plans,

Goals and Understanding”, Schank and Abelson, 1977)

MUCACE

GENIA

Datasets

Data-Driven ShallowSemantic Parsing

Information Extraction(template filling)

Slide idea taken from Brendan O’Connor

41

Page 42: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Scripts(“Scripts, Plans,

Goals and Understanding”, Schank and Abelson, 1977)

MUCACE

GENIA

Datasets

Data-Driven ShallowSemantic Parsing

Information Extraction(template filling)

Slide idea taken from Brendan O’Connor

42

Page 43: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Scripts(“Scripts, Plans,

Goals and Understanding”, Schank and Abelson, 1977)

MUCACE

GENIA

Datasets

Data-Driven ShallowSemantic Parsing

Information Extraction(template filling)

Slide idea taken from Brendan O’Connor

43

Page 44: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Scripts(“Scripts, Plans,

Goals and Understanding”, Schank and Abelson, 1977)

MUCACE

GENIA

Datasets

Data-Driven ShallowSemantic Parsing

Information Extraction(template filling)

Slide idea taken from Brendan O’Connor

structurally similar!

44

Page 45: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Why this Linguistic Formalism?

45

Page 46: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

46

Why this Linguistic Formalism?

Page 47: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

47

Why this Linguistic Formalism?

Page 48: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

48

Why this Linguistic Formalism?

Page 49: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

A1 A4

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

49

Why this Linguistic Formalism?

Page 50: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

A1 A4

symbolic set of semantic roles(six total)

Verb-specific meaning for these labels

Conflates the meaning of different roles due to oversimplification

(Yi et al., 2007)

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

50

Why this Linguistic Formalism?

Page 51: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deepSemantic Parsing into Logical Forms

(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

51

Why this Linguistic Formalism?

Page 52: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

What states border the state that borders the most states

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

52

Why this Linguistic Formalism?

Page 53: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

What states border the state that borders the most states

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

53

Why this Linguistic Formalism?

Page 54: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

What states border the state that borders the most states

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Trained on very restricted domains

Poor lexical coverage

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

54

Why this Linguistic Formalism?

Page 55: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

55

Why this Linguistic Formalism?

Page 56: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

56

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

Does not model quantification or negation

unlike logical forms

Why this Linguistic Formalism?

Page 57: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Deeper than PropBank-style semantic role labeling

57

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

Why this Linguistic Formalism?

Page 58: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Models all types of part-of-speech categories

58

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

Why this Linguistic Formalism?

Page 59: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Larger lexical coverage than logical form parsers

59

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

Why this Linguistic Formalism?

Page 60: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

shallow

deep

PropBank-style Semantic Role Labeling(Pradhan et al., 2008; Punyakanok et al., 2008)

Semantic Parsing into Logical Forms(Ge and Mooney, 2005; Zettlemoyer and Collins., 2005)

Lexicon actively increasing in size

60

Frame-Semantic Parsing(Gildea and Jurafsky, 2002, Johansson and Nugues, 2007; this work)

Why this Linguistic Formalism?

Page 61: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

ApplicationsMotivation ApplicationsChoice offormalism

61

Page 62: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?

Possible Applications

62

Page 63: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?

Possible Applications

63

Page 64: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?

Possible Applications

64

Page 65: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?

Possible Applications

65

Page 66: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?reserve

Possible Applications

66

Page 67: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Question Answering

Bengal ’s massive stock of food was reduced to nothing

Whose stock of food was diminished ?reserve

Bilotti et al. (2007)

Possible Applications

67

Page 68: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Information Extraction

In 1997, France's stock of unirradiated civil plutonium increased to 72 tons.

Saudi Arabia has 267 billion barrels in reserves of oil.

Does Egypt have stockpiles of biological weapons?

Bengal’s massive stock of food was reduced to nothing.

Possible Applications

68

Page 69: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Information Extraction

In 1997, France's stock of unirradiated civil plutonium increased to 72 tons.

Saudi Arabia has 267 billion barrels in reserves of oil.

Does Egypt have stockpiles of biological weapons?

Bengal’s massive stock of food was reduced to nothing.

Possible Applications

69

Page 70: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Information Extraction

In 1997, France's stock of unirradiated civil plutonium increased to 72 tons.

Saudi Arabia has 267 billion barrels in reserves of oil.

Does Egypt have stockpiles of biological weapons?

Bengal’s massive stock of food was reduced to nothing.

Possessor Desc Resource

Bengal Massive food

France - unirradiated civil plutonium

Saudi Arabia 267 billion barrels oil

Egypt - Biological weapons

Possible Applications

70

Page 71: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Multilingual Applications

Bengal ’s massive stock of food was reduced to nothing

Possible Applications

71

Page 72: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Multilingual Applications

Bengal ’s massive stock of food was reduced to nothing

������ ��� ��� � ����� ��� �� ��� ���

Possible Applications

72

Page 73: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Multilingual Applications

Bengal ’s massive stock of food was reduced to nothing

������ ��� ��� � ����� ��� �� ��� ���

Possible Applications

73

Page 74: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Multilingual KBs

Multilingual Applications

Bengal ’s massive stock of food was reduced to nothing

������ ��� ��� � ����� ��� �� ��� ���

Cross lingual IR

Machine translation

Possible Applications

74

Page 75: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

2. Statistical models for structure prediction

(Das, Schneider, Chen and Smith, NAACL 2010)

75

Page 76: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

76

Page 77: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

77

Page 78: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

frame

78

Page 79: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

frame

roles

79

Page 80: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

framecore roles

non-core roles

80

Page 81: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

framecore roles

non-core roles

excludes relationship

81

Page 82: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

framecore roles

non-core roles

archive.V, arrange.V, bag.V, bestow.V bin.V

predicates

excludes relationship

82

Page 83: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

DISPERSAL

Agent

Cause

Individuals

Distance

Time

TRANSITIVE_ACTION

Agent

Cause

Patient

Event

Place

Time

INSTALLING

Agent

Component

Fixed_location

Area

Time

STORING

Agent

Location

Theme

Area

Time

STORE

Possessor

Resource

Supply

Descriptor

83

Page 84: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

DISPERSAL

Agent

Cause

Individuals

Distance

Time

TRANSITIVE_ACTION

Agent

Cause

Patient

Event

Place

Time

INSTALLING

Agent

Component

Fixed_location

Area

Time

STORING

Agent

Location

Theme

Area

Time

STORE

Possessor

Resource

Supply

Descriptor

84

Page 85: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

PLACING

Agent

Cause

Goal

Theme

Area

Time

DISPERSAL

Agent

Cause

Individuals

Distance

Time

TRANSITIVE_ACTION

Agent

Cause

Patient

Event

Place

Time

INSTALLING

Agent

Component

Fixed_location

Area

Time

STORING

Agent

Location

Theme

Area

Time

STORE

Possessor

Resource

Supply

Descriptor

inheritance

used by

85

Page 86: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

Benchmark Dataset(SemEval 2007)

665 frames720 role labels

8.4K unique predicate types

Training set:2.2K sentences

11.2K predicate tokens

Test set:120 sentences

1. 1K predicate tokens86

Page 87: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

For comparison with past state of the art

(very small dataset)

Benchmark Dataset(SemEval 2007)

665 frames720 role labels

8.4K unique predicate types

Training set:2.2K sentences

11.2K predicate tokens

Test set:120 sentences

1. 1K predicate tokens87

Page 88: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

New Data(FrameNet 1.5, 2010)

877 frames1068 role labels

9.3K unique predicate types

Training set:3.3K sentences

19.6K predicate tokens

Test set:2420 sentences

4.5K predicate tokens

Benchmark Dataset(SemEval 2007)

665 frames720 role labels

8.4K unique predicate types

Training set:2.2K sentences

11.2K predicate tokens

Test set:120 sentences

1. 1K predicate tokens88

Page 89: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Structure of Lexicon and Data

New Data(FrameNet 1.5, 2010)

877 frames1068 role labels

9.3K unique predicate types

Training set:3.3K sentences

19.6K predicate tokens

Test set:2420 sentences

4.5K predicate tokens

Benchmark Dataset(SemEval 2007)

665 frames720 role labels

8.4K unique predicate types

Training set:2.2K sentences

11.2K predicate tokens

Test set:120 sentences

1. 1K predicate tokens89

Page 90: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

2. Statistical models for structure prediction

Argument identificationFrame identification

(use of latent variables) (dual decomposition)

90

Page 91: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

2. Statistical models for structure prediction

Argument identificationFrame identification

(use of latent variables) (dual decomposition)

91

Page 92: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

92

Page 93: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

ambiguous

93

Page 94: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

Find the best among all the frames

94

Page 95: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

95

Page 96: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

96

Page 97: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Direct modeling using logistic regression

97

Page 98: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Two problems:1. Unable to model unknown

predicates at test time

Frame Identification

Direct modeling using logistic regression

98

Page 99: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Direct modeling using logistic regression

Two problems:1. Unable to model unknown

predicates at test time

2. Number of features:

99

Page 100: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Direct modeling using logistic regression

Two problems:1. Unable to model unknown

predicates at test time

2. Number of features: ≈ 50 million

100

Page 101: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Logistic regression with a latent variable

101

Page 102: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Logistic regression with a latent variable

102

Page 103: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Logistic regression with a latent variable

103

Predicates evoking a frame in supervised data, e.g.cargo.N, inventory.N, reserve.N, stockpile.N, store.N, supply.N

evoke STORE

Page 104: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Logistic regression with a latent variable

104

Does not look at the predicate’s surface form

Page 105: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

105

Page 106: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

106

TORE

stock.N

stockpile.N

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

Page 107: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

107

TORE

stock.N

stockpile.N

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

LexSem = {synonym}

Page 108: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

108

TORE

stock.N

stockpile.N

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

LexSem = {synonym}

If STORE

stockpile.N

synonym LexSem

Page 109: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

109

TORE

stock.N

stockpile.N

Bengal ’s massive stock of food was reduced to nothingN X A N ADP N V V ADP N

LexSem = {synonym}

If STORE

stockpile.N

synonym LexSem

comes from WordNet!

Page 110: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

110

Number of features:

Page 111: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

111

Number of features:

≈ 500 K

(1% of the features we had before)

Page 112: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

112

Number of features:

≈ 500 K

(1% of the features we had before)

Aside:

Probabilistic modeling of language meaning using syntax, lexical semantics and latent structure:

Paraphrase identification (Das and Smith, ACL 2009)

Page 113: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Training:

Maximum conditional log-likelihood(batch training with L-BFGS)

113

Page 114: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Fast inference:

If predicate is unseen,

otherwise,

114

Page 115: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

58

63

68

73

78

83

88

93

UTD LTH ThisWork

F-Measure

New Data

115

Page 116: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

58

63

68

73

78

83

88

93

UTD LTH ThisWork

F-Measure

New Data

Bejan and Hathaway, (2007)

116

Page 117: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

58

63

68

73

78

83

88

93

UTD LTH ThisWork

F-Measure

New Data

Johansson and Nugues, (2007)

117

Page 118: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

68.3

58

63

68

73

78

83

88

93

UTD LTH ThisWork

F-Measure

New Data

118

Page 119: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

68.3

58

63

68

73

78

83

88

93

UTD LTH ThisWork

F-Measure

auto predicates

New Data

119

Page 120: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark

60.464.0

68.3

74.2

58

63

68

73

78

83

88

93

UTD LTH ThisWork

ThisWork

F-Measure

auto predicates givenpredicates

New Data

120

Page 121: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark New Data

60.464.0

68.3

74.2

58

63

68

73

78

83

88

93

UTD LTH ThisWork

ThisWork

F-Measure

auto predicates givenpredicates

90.5

58

63

68

73

78

83

88

93

ThisWork

F-Measure

givenpredicates

121

Page 122: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Frame Identification

Results

Benchmark New Data

60.464.0

68.3

74.2

58

63

68

73

78

83

88

93

UTD LTH ThisWork

ThisWork

F-Measure

auto predicates givenpredicates

90.5

80.0

58

63

68

73

78

83

88

93

ThisWork

ThisWork

F-Measure

givenpredicates

no hidden variable

122

Page 123: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

2. Statistical models for structure prediction

Argument identificationFrame identification

(use of latent variables) (dual decomposition)

123

Page 124: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Bengal ’s massive stock of food was reduced to nothing

TORE

124

Page 125: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Bengal ’s massive stock of food was reduced to nothing

TORE

125

Page 126: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

126

Page 127: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

Ideal mapping!

127

Page 128: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

128

Page 129: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

Violates overlap constraints

129

Page 130: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

PLACING

Agent

Cause

Goal

Theme

Area

Time

Mutual exclusion constraint

archive.V, arrange.V, bag.V, bestow.V bin.V

130

Page 131: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

PLACING

Agent

Cause

Goal

Theme

Area

Time

Mutual exclusion constraint

archive.V, arrange.V, bag.V, bestow.V bin.V

If an agent places something, there cannot be a cause role in the sentence

131

Page 132: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

PLACING

Agent

Cause

Goal

Theme

Area

Time

Mutual exclusion constraint

archive.V, arrange.V, bag.V, bestow.V bin.V

132

AgentThe waiter placed food on the table.

In Kabul, hauling water put food on the table.Cause

Page 133: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

SIMILARITY

Dimension

Differentiating_fact

Entity_1

Entity_2

Degree

Requires constraint

difference.N, resemble.V,

unliike.A, vary.V

133

Page 134: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

A mulberry resembles a loganberry.

second entity

first entity

SIMILARITY

Dimension

Differentiating_fact

Entity_1

Entity_2

Degree

Requires constraint

difference.N, resemble.V,

unliike.A, vary.V

134

Page 135: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Other types of structural constraints

A mulberry resembles.

SIMILARITY

Dimension

Differentiating_fact

Entity_1

Entity_2

Degree

Requires constraint

difference.N, resemble.V,

unliike.A, vary.V

135

!

Page 136: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

A constrained optimization

problem

136

Page 137: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

137

Page 138: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

138

Page 139: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

139

Page 140: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

a binary variable for each role, span tuple

140

Page 141: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

a binary vector for all role, span tuples

141

Page 142: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

142

Page 143: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

143

Page 144: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

Uniqueness144

Page 145: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

Prevents overlap 145

Page 146: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

more structural constraints

146

Page 147: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

An integer linear program (ILP)

more structural constraints

147

Page 148: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

An integer linear program (ILP)

more structural constraints

148

Punyakanok, Roth and Yih(2008)

Page 149: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problem

Often, very slow solutions

An integer linear program (ILP)

more structural constraints

149

Page 150: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

A constrained optimization problemAn integer linear program (ILP)

more structural constraints

Fast ILP solvers proprietary

150

Page 151: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Dual Decomposition with Alternating Direction Method of Multipliers

developed with colleagues at CMU(Das, Martins and Smith, forthcoming)

151

Page 152: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

basic part:

entire space: all tuples

152

Page 153: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

basic part:

entire space: all tuples

Break down the problem into many small components

e.g. -- find the best span for a role

-- for a sentence position, find the best tuple

-- for a pair of mutually exclusive roles,find the best tuple

153

Page 154: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

basic part:

entire space: all tuples

Break down the problem into many small components

e.g. -- find the best span for a role

-- for a sentence position, find the best tuple

-- for a pair of mutually exclusive roles,find the best tuple

impose agreement between components

154

Page 155: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

For each component, a binary vector:

155

Page 156: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

For each component, a binary vector:

Total score assigned to :

156

Page 157: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

For each component, a binary vector:

Total score assigned to :

uses

157

Page 158: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

158

Page 159: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

witness vector for consensus

159

Page 160: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Primal:

160

Page 161: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Primal':

Integer constraints relaxed

161

Page 162: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Primal':

An augmented Lagrangian function

162

Page 163: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Saddle point can be found using several decoupled

worker problems

163

Page 164: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Three types of iterative updates:

1. Lagrange multiplier updates ( )2. Consensus variable updates ( )

3. updates

164

Page 165: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Three types of iterative updates:

1. Lagrange multiplier updates ( )2. Consensus variable updates ( )

3. updatesAt decoupled workers

165

Page 166: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Three types of iterative updates:

1. Lagrange multiplier updates ( )2. Consensus variable updates ( )

3. updatesAt decoupled workers

e.g. for each role, we have a worker that imposes a XOR/uniqueness constraint

166

Page 167: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Three types of iterative updates:

1. Lagrange multiplier updates ( )2. Consensus variable updates ( )

3. updatesAt decoupled workers

e.g. for each role, we have a worker that imposes a XOR/uniqueness constraint

Projection onto a simplexa simple sort operation

167

Page 168: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approachAn augmented Lagrangian function

Three types of iterative updates:

1. Lagrange multiplier updates ( )2. Consensus variable updates ( )

3. updatesAt decoupled workers

e.g. for each role, we have a worker that imposes a XOR/uniqueness constraint

Projection onto a simplexa simple sort operation

Challenge:define fast, simple workers

168

Page 169: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Advantages:

1) Significant speedup2) No proprietary solver necessary

169

Page 170: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Advantages:

43.12

4.780

10

20

30

40

50

CPLEX (ILP) Dual Decomposition

Time to decode the test set in seconds

1) Significant speedup2) No proprietary solver necessary

170

Page 171: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

An alternate approach

Advantages:

43.12

4.780

10

20

30

40

50

CPLEX (ILP) Dual Decomposition

Time to decode the test set in seconds

1) Significant speedup2) No proprietary solver necessary

171

Certificate of Optimality in

>99% of examples

Page 172: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Bengal

massive stock

of food

food

massive

Bengal ’s massive stock of food

massive stock of food

172

Page 173: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Learning?

173

Page 174: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

stock

TORE

Bengal ’s

Learning?

Maximum conditional log-likelihoodof local role span pairs

(batch training using L-BFGS)

174

Page 175: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Results

New Data

Benefit of joint inference175

8283.8

76.4 76.2

79.1 79.8

70

75

80

85

Local DualDecomposition

Precision

Recall

F-Measure

Page 176: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Results

New Data

Benefit of joint inference

8283.8

76.4 76.2

79.1 79.8

70

75

80

85

Local DualDecomposition

Precision

Recall

F-Measure

501 linguistic violations

176

Page 177: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Argument Identification

Results

New Data

Benefit of joint inference

501 linguistic violations No violations

177

8283.8

76.4 76.2

79.1 79.8

70

75

80

85

Local DualDecomposition

Precision

Recall

F-Measure

Page 178: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Full Parsing

Final Results

Benchmark New Data

37.9

45.6

35

40

45

50

55

60

65

70

UTD LTH ThisWork

F-Measure

auto predicates

178

Page 179: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Full Parsing

Final Results

Benchmark New Data

37.9

45.6

50.2

35

40

45

50

55

60

65

70

UTD LTH ThisWork

F-Measure

auto predicates

179

Page 180: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Full Parsing

Final Results

Benchmark New Data

37.9

45.6

50.253.6

35

40

45

50

55

60

65

70

UTD LTH ThisWork

ThisWork

F-Measure

auto predicates givenpredicates

180

Page 181: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Full Parsing

Final Results

Benchmark New Data

37.9

45.6

50.253.6

35

40

45

50

55

60

65

70

UTD LTH ThisWork

ThisWork

F-Measure

auto predicates givenpredicates

68.5

35

40

45

50

55

60

65

70

This Work

givenpredicates

F-Measure

181

Page 182: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

2. Statistical models for structure prediction

Novel graph-based learning algorithms

182

Page 183: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

1. Why semantic analysis?

3. Semi-supervised learning for robustness

Novel graph-based learning algorithms

(Das and Smith, ACL 2011)

2. Statistical models for structure prediction

183

Page 184: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

90.5

405060708090

All Predicates

F-Measure

Results on Unknown Predicates

46.6

40

50

60

70

80

90

Unknown Predicates

F-Measure

Frame Identification

184

Page 185: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

90.5

405060708090

All Predicates

F-Measure

Results on Unknown Predicates

46.6

40

50

60

70

80

90

Unknown Predicates

F-Measure

Frame Identification

68.5

25303540455055606570

All Predicates

F-Measure

30.2

25303540455055606570

Unknown Predicates

F-Measure

Full Parsing

185

Page 186: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Handling Unknown Predicates

Knowledge of only 9,263 predicates in supervised data

186

Page 187: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Handling Unknown Predicates

Knowledge of only 9,263 predicates in supervised data

187

However, English has lot more potential predicates(~65,000 in newswire English)

Page 188: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Handling Unknown Predicates

Knowledge of only 9,263 predicates in supervised data

However, English has lot more potential predicates(~65,000 in newswire English)

Lexicon expansion using graph-based semi-supervised learning

188

Page 189: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Build a graph over potential predicates as vertices• compute similarity matrix using co-occurrence statistics

Label distribution at each vertexdistribution over frames that the predicate can evoke

How can label propagation help?

189

Page 190: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Build a graph over potential predicates as vertices• compute similarity matrix using co-occurrence statistics

Label distribution at each vertexdistribution over frames that the predicate can evoke

How can label propagation help?

Idea very similar to Das and Petrov (ACL 2011): unsupervised lexicon expansion for POS tagging

190

Page 191: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Example Graph

Seed predicates

191

Page 192: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Example Graph

Seed predicatesUnseen predicates

192

Page 193: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Example Graph

Graph Propagation193

Page 194: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Example Graph

Graph Propagation194

Page 195: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Example Graph

Graph Propagation

Continues till convergence...

195

Page 196: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Brief Overview:Graph-Based Learning

with Labeled and Unlabeled Data

196

Page 197: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

197

Page 198: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

0.9

0.01

0.8

0.9

0.1

= symmetric weight matrix

0.05

198

Page 199: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

labeled datapointsunlabeled datapoints

0.9

0.01

0.8

0.9

0.1

= symmetric weight matrix

0.05

199

Page 200: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

0.9

0.01

0.8

0.9

0.1

= symmetric weight matrix

0.05

200

Page 201: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

0.9

0.01

0.8

0.9

0.1

= symmetric weight matrix

0.05

supervised label distributions

201

Page 202: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

0.9

0.01

0.8

0.9

0.1

= symmetric weight matrix

0.05

supervised label distributions

distributions to be found

202

Page 203: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Label Propagation

Minimize:

Das and Smith, forthcoming 203

Page 204: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Label Propagation

Minimize:

204

Brings the distribution of observed and induced distributions over labeled vertices closer

Page 205: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Label Propagation

Minimize:

205

brings the distributions of similarvertices closer

Page 206: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Label Propagation

Minimize:

206

induces sparse distributionsin each vertex

Page 207: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Constrained Inference

If predicate is seen,

otherwise,

else if predicate is in graph,

207

Page 208: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Constrained Inference

If predicate is seen,

otherwise,

else if predicate is in graph,

Six times faster inference on unknown predicates!

208

Page 209: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Results on Unknown Predicates

46.642.7

65.3

40455055606570

Supervised Self-Training Graph-Based

F-Measure

Frame Identification

209

Page 210: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Results on Unknown Predicates

46.642.7

65.3

40455055606570

Supervised Self-Training Graph-Based

F-Measure

Frame Identification

Full Parsing

30.226.6

46.7

25

30

35

40

45

50

Supervised Self-Training Graph-Based

F-Measure

210

Page 211: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Conclusions

211

Page 212: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Parsing using the theory of frame semantics– richer output than popular SRL systems

(Kingsbury and Palmer, 2002)

– domain general in comparison to deep semantic parsers

Significantly better performance on benchmark datasets than previous work– less independence assumptions

– only two statistical models

– semi-supervised extensions

Conclusions

212

Page 213: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Train parsers in other languages– Spanish, German, Portuguese, Japanese, Chinese, Swedish

Use presented techniques for deeper semantic analysis tasks–Especially semi-supervised learning

Use parser for NLP applications

–(right now being used to bootstrap more annotations)

Future Work

213

Page 214: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Case Grammar(“The Case for Case”,

Fillmore, 1968)

Frame Semantics(“Frame Semantics”,

Fillmore, 1982)

FrameNetPropBankVerbNet

NomBankOntoNotes

Frames(“A Framework

for Representing Knowledge”, Minsky, 1975)

Scripts(“Scripts, Plans,

Goals and Understanding”, Schank and Abelson, 1977)

MUCACE

GENIA

Data-Driven ShallowSemantic Parsing

Information Extraction(template filling)

Slide idea taken from Brendan O’Connor

More annotationsLarger lexicons

214

Page 215: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

Last Word

Parser available at:

http://www.ark.cs.cmu.edu/SEMAFOR

(200 downloads in the past 6 months)

215

Page 216: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

216

Page 217: Robust Shallow Semantic Parsing of Text › ~dipanjan › ttic_presentation.pdf · Semantic Parsing Information Extraction (template filling) Slide idea taken from Brendan O’Connor

UDGMENT_DIRECT_ADDRESS

217