modeling and learning semantic co-compositionality through prototype...

64
Modeling and Learning Semantic Co-Compositionality through Prototype Projections and Neural Networks Masashi Tsubaki , Kevin Duh, Masashi Shimbo, Yuji Matsumoto Nara Institute of Science and T echnology (NAIST), Japan

Upload: others

Post on 17-Mar-2020

13 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Modeling and Learning Semantic Co-Compositionality through Prototype Projections and Neural Networks

Masashi Tsubaki, Kevin Duh, Masashi Shimbo, Yuji Matsumoto

Nara Institute of Science and Technology (NAIST), Japan

Page 2: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Unsupervised word vector re-training algorithm

considering compositionality

New model of compositionality

in word vector space

Masashi Tsubaki 1

Contributions

Two contributions in our work

Page 3: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Unsupervised word vector re-training algorithm

considering compositionality

New model of compositionality

in word vector space

Masashi Tsubaki 1

Contributions

Two contributions in our work

Page 4: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 2

Introduction

From word to phrase representation with matrix-vector operation [Mitchell and Lapata 08], [Baroni and Zamparell 10], [Socher+ 12], [Van de Cruys+ 13]

run

company

run company = operate

marathon

run marathon = race

Modeling of compositionality in word vector space

Page 5: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 2

run

company

run company = operate

marathon

run marathon = race

From word to phrase representation with matrix-vector operation [Mitchell and Lapata 08], [Baroni and Zamparell 10], [Socher+ 12], [Van de Cruys+ 13]

Introduction

Modeling of compositionality in word vector space

Page 6: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 2

run

company

run company = operate

marathon

run marathon = race

From word to phrase representation with matrix-vector operation [Mitchell and Lapata 08], [Baroni and Zamparell 10], [Socher+ 12], [Van de Cruys+ 13]

Introduction

Modeling of compositionality in word vector space

Page 7: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 2

run

company

run company = operate

marathon

run marathon = race

New model inspired by Co-Compositionality

From word to phrase representation with matrix-vector operation [Mitchell and Lapata 08], [Baroni and Zamparell 10], [Socher+ 12], [Van de Cruys+ 13]

Introduction

Modeling of compositionality in word vector space

Page 8: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 3

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 9: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 3

f( run , company ) = operate

f( run , marathon ) = race

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 10: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 3

f( runcompany , company ) = operate

f( runmarathon , marathon ) = race

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 11: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 3

f( runcompany , companyrun ) = operate

f( runmarathon , marathonrun ) = race

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 12: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 3

f( runcompany , companyrun ) = operate

f( runmarathon , marathonrun ) = race

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 13: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

Verb and object are allowed to modify each other’s meanings and generate the overall semantics

3

How do we implement co-compositionality in vector space ?

Question

f( runcompany , companyrun ) = operate

f( runmarathon , marathonrun ) = race

Co-compositionality  

Our Model

Main Idea : Co-Compositionality [Pustejovsky 1995]

Page 14: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 4

Our Model

Matrix-vector operation as an implementation for Co-Compositionality

Prototype Projection

Prototype Projection for Co-Compositionality

Page 15: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

run company

5

Our Model

Co-Compositionality with Prototype Projections

Page 16: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

start

build

buy

run

VerbOf

・・・

company

5

Prototype verbs of “company”

Our Model

Co-Compositionality with Prototype Projections

Page 17: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

× start

build

buy

run

VerbOf

・・・

≈ ・・・

company

5

Prototype verbs of “company”

Our Model

Co-Compositionality with Prototype Projections

Page 18: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

・・・

≈ ・・・

company

5

Latent subspace formed by prototype verb vectors of company

Prototype verbs of “company”

Our Model

Co-Compositionality with Prototype Projections

Page 19: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

runcompany

・・・

≈ ・・・

company Pcompany=VTV

5

(Orthogonal projection matrix to V)

Prototype verbs of “company”

Latent subspace formed by prototype verb vectors of company

Our Model

Co-Compositionality with Prototype Projections

Page 20: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

runcompany

・・・

≈ ・・・

company Pcompany=VTV

Prototype Projection

5

(Orthogonal projection matrix to V)

Prototype verbs of “company”

Latent subspace formed by prototype verb vectors of company

Our Model

Co-Compositionality with Prototype Projections

Page 21: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

runcompany

ObjectOf

・・・

≈ ・・・

firm

bank

hotel

・・・

company Pcompany=VTV

Prototype Projection

5

(Orthogonal projection matrix to V)

Prototype verbs of “company”

Prototype objects of “run”

Our Model

Co-Compositionality with Prototype Projections

Page 22: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

runcompany

ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company Pcompany=VTV

Prototype Projection

5

(Orthogonal projection matrix to V)

Prototype verbs of “company”

Prototype objects of “run”

Our Model

Co-Compositionality with Prototype Projections

Page 23: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

run

VerbOf

runcompany

ObjectOf

・・・ O

Prun=OTO

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

companyrun

Pcompany=VTV Prototype Projection

5

(Orthogonal projection matrix to V)

Prototype verbs of “company”

Prototype objects of “run”

Our Model

Co-Compositionality with Prototype Projections

Page 24: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

×

V

start

build

buy

operate =

run

VerbOf

runcompany

ObjectOf

・・・ O

Prun=OTO

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

companyrun

+  

Pcompany=VTV Prototype Projection

5

Prototype verbs of “company”

Prototype objects of “run”

(Orthogonal projection matrix to V)

Our Model

Co-Compositionality with Prototype Projections

Page 25: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 6

start finish

buy

build

race

operate

run

Tease out the proper semantics from aggregate representation by projection to latent space

Assume various senses of “run” are aggregated in one vector

Prototype verbs of “marathon”

Prototype verbs of “company”

Our Model

Intuitive image of prototype projection

Page 26: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 7

Experiment

Evaluation dataset [Grefenstette and Sadrzadeh 11]

200 subject-verb-object triples judged by 25 participants

Subj-Verb-Obj Landmark verb Similarity of human judgment

People-run-company operate 7

People-run-company move 2

Evaluation : Verb disambiguation in subject-verb-object triples

Page 27: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 7

Evaluation dataset [Grefenstette and Sadrzadeh 11]

200 subject-verb-object triples judged by 25 participants

Subj-Verb-Obj Landmark verb Similarity of human judgment

People-run-company operate 7

People-run-company move 2

Final co-compositional vector for subject-verb-object

subj + cocompositioned(verb,obj)

Experiment

Evaluation : Verb disambiguation in subject-verb-object triples

Page 28: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

Models are evaluated by Spearman’s rank correlation between vectors’ computed similarity and human judgment

7

Evaluation dataset [Grefenstette and Sadrzadeh 11]

200 subject-verb-object triples judged by 25 participants

Subj-Verb-Obj Landmark verb Similarity of human judgment

People-run-company operate 7

People-run-company move 2

Final co-compositional vector for subject-verb-object

subj + cocompositioned(verb,obj)

Experiment

Evaluation : Verb disambiguation in subject-verb-object triples

Page 29: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Experiment

Masashi Tsubaki 8

×

V

start

build

buy

run

VerbOf ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

Extracted 20 prototype words from ukWaC corpus

Implementation details

Page 30: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 8

high frequency

×

V

start

build

buy

run

VerbOf ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

Extracted 20 prototype words from ukWaC corpus

Experiment

Implementation details

Page 31: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 8

high frequency both high frequency and high similarity

×

V

start

build

buy

run

VerbOf ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

Extracted 20 prototype words from ukWaC corpus

Experiment

Implementation details

Page 32: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 8

high frequency both high frequency and high similarity 80% of the top singular values

×

V

start

build

buy

run

VerbOf ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

Extracted 20 prototype words from ukWaC corpus

Experiment

Implementation details

Page 33: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 8

high frequency both high frequency and high similarity 80% of the top singular values

Word representation [Blacoe and Lapata 12] ①Distributional vector (2000 dim) ②Neural vector (50 dim)

×

V

start

build

buy

run

VerbOf ObjectOf

・・・ O

≈ ・・・

× firm

bank

hotel

・・・

≈ ・・・

company

Extracted 20 prototype words from ukWaC corpus

Experiment

Implementation details

Page 34: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Experiment

Masashi Tsubaki 9

Add [Mitchell and Lapata 08]

Multiply [Mitchell and Lapata 08]

Grefenstette and Sadrzadeh 11

Mathematical model based on abstract categorical framework

Van de Cruys+13 Multi-way interaction model

based on non-negative matrix factorization

sbj × verb×obj

sbj + verb+obj

Baselines : Models compared to ours

Page 35: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Achieves high performance (ρ = 0.41)

0.31  0.35  

0.21  

0.37  0.41  

0  

0.05  

0.1  

0.15  

0.2  

0.25  

0.3  

0.35  

0.4  

0.45  

1   2   3   4   5  

Cor

rela

tion ρ

Masashi Tsubaki 10

Add Multiply Grefenstette and Sadrzadeh 11

Van de Cruys+ 13

Our Model

Result and Discussion

Correlation with human judgment (Distributional vector)

Page 36: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 11

State of the art performance (ρ= 0.44)

0.31   0.3  

0.21  

0.37  

0.44  

0  

0.05  

0.1  

0.15  

0.2  

0.25  

0.3  

0.35  

0.4  

0.45  

0.5  

1   2   3   4   5  Add Multiply Grefenstette and Sadrzadeh 11

Van de Cruys+ 13

Our Model

Result and Discussion

Cor

rela

tion ρ

Correlation with human judgment (Neural vector)

Page 37: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 11

State of the art performance (ρ= 0.44)

0.31   0.3  

0.21  

0.37  

0.44  

0  

0.05  

0.1  

0.15  

0.2  

0.25  

0.3  

0.35  

0.4  

0.45  

0.5  

1   2   3   4   5  Add Multiply Grefenstette and Sadrzadeh 11

Van de Cruys+ 13

Our Model

Result and Discussion

Co-Compositionality is useful for word sense disambiguation Prototype projection is effective implementation for Co-Compositionality

Cor

rela

tion ρ

Correlation with human judgment (Neural vector)

Page 38: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Unsupervised word vector re-training algorithm

considering compositionality

New model of compositionality

in word vector space

Masashi Tsubaki 12

Introduction

Two contributions in our work

Page 39: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Unsupervised word vector re-training algorithm

considering compositionality

New model of compositionality

in word vector space

Masashi Tsubaki 13

Introduction

Two contributions in our work

Page 40: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Our Model

Masashi Tsubaki 14

Re-training word representation with decomposition of phrase vector

Compositional Neural Language Model

Page 41: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

v orun company

z  =  v  +  o

s=uTz

Re-training word representation with decomposition of phrase vector

u

①Compute the score s of correct phrase

Our Model

Compositional Neural Language Model

Page 42: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

vc eat company

zc  =  vc  +  o

Re-training word representation with decomposition of phrase vector

sc=uTz

u

o

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase

Our Model

Compositional Neural Language Model

Page 43: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

Re-training word representation with decomposition of phrase vector

vc vc eat company

zc  =  vc  +  o

sc=uTz

u

o

J =max 0,1− s+ sc( )Correct score > Incorrect score

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase

Our Model

Compositional Neural Language Model

Page 44: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

Re-training word representation with decomposition of phrase vector

vc eat company

zc  =  vc  +  o

sc=uTz

u

o

J =max 0,1− s+ sc( )Correct score > Incorrect score

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase     ③Minimize cost function by SGD, u → unew, z → znew

Our Model

Compositional Neural Language Model

Page 45: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

znew

Re-training word representation with decomposition of phrase vector

s=uTz

unew

J =max 0,1− s+ sc( )Correct score > Incorrect score

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase     ③Minimize cost function by SGD, u → unew, z → znew

Our Model

Compositional Neural Language Model

Page 46: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

vnew run company

znew

Re-training word representation with decomposition of phrase vector

s=uTz

unew

o

J =max 0,1− s+ sc( )Correct score > Incorrect score

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase     ③Minimize cost function by SGD, u → unew, z → znew ④New verb vector is vnew = znew - o

Our Model

Compositional Neural Language Model

Page 47: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 14

vnew run company

J =max 0,1− s+ sc( )Correct score > Incorrect score

znew

Re-training word representation with decomposition of phrase vector

New word representations considering compositionality

s=uTz

unew

o

①Compute the score s of correct phrase ②Compute the score sc of corrupted incorrect phrase     ③Minimize cost function by SGD, u → unew, z → znew ④New verb vector is vnew = znew - o

Our Model

Compositional Neural Language Model

Page 48: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

x y

z  =  x  +  y

s=uTz

u

Our Model

Compositional Neural Language Model

Page 49: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

x y

z  =  x  +  y

s=uTz

u Co-Compositionality with Prototype Projection +

Our Model

Compositional Neural Language Model

Page 50: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

x y

z  =  x  +  y

s=uTz

u Co-Compositionality with Prototype Projection +

v orun company

Pverb Pobj

Our Model

Compositional Neural Language Model

Page 51: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Our Model

Masashi Tsubaki 15

Compositional Neural Language Model with Prototype Projection

x y

z  =  x  +  y

s=uTz

u Co-Compositionality with Prototype Projection +

v orun company

Pverb Pobj

Co-Compositional Neural Language Model

Page 52: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

Compositional Neural Language Model with Prototype Projection

x y

z  =  x  +  y

s=uTz

u

v orun company

Pverb Pobj

①Prototype projection for both verb and object ②Optimize parameters with same method as Compositional NLM ③Minimize

minv

xnew −Pobjv2+λ v 2( )

Our Model

Co-Compositional Neural Language Model

Page 53: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

Compositional Neural Language Model with Prototype Projection

xnew y

s=uTz

znew

unew

v orun company

Pverb Pobj

①Prototype projection for both verb and object ②Optimize parameters with same method as Compositional NLM ③Minimize

minv

xnew −Pobjv2+λ v 2( )

Our Model

Co-Compositional Neural Language Model

Page 54: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

Compositional Neural Language Model with Prototype Projection

xnew y

s=uTz

znew

unew

vnew orun company

Pverb Pobj

①Prototype projection for both verb and object ②Optimize parameters with same method as Compositional NLM ③Minimize

minv

xnew −Pobjv2+λ v 2( )

Our Model

Co-Compositional Neural Language Model

Page 55: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 15

Compositional Neural Language Model with Prototype Projection

xnew y

s=uTz

vnew orun company

Pverb Pobj

znew

unew

New word representations considering co-compositionality

①Prototype projection for both verb and object ②Optimize parameters with same method as Compositional NLM ③Minimize

minv

xnew −Pobjv2+λ v 2( )

Our Model

Co-Compositional Neural Language Model

Page 56: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Experiment

Masashi Tsubaki 16

Original neural vector [Blacoe and Lapata 12]

Re-trained neural vector with our learning models

vs.

Evaluation : Verb disambiguation [Grefenstette and Sadrzadeh 11]

Page 57: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki

Original neural vector [Blacoe and Lapata 12]

Re-trained neural vector with our learning models

Training data Extracted 5000 Verb-Obj pairs from ukWaC corpus

Hyper-parameters Learning rate:0.01, Regularization:10^4 20 iterations (One iteration is one run through the training data)

vs.

16

Experiment

Evaluation : Verb disambiguation [Grefenstette and Sadrzadeh 11]

Page 58: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Result and Discussion

Masashi Tsubaki 17

Higher performance with re-trained word representation

Cor

rela

tion ρ

0.31  

0.44  

0.38  

0.47  

0  

0.05  

0.1  

0.15  

0.2  

0.25  

0.3  

0.35  

0.4  

0.45  

0.5  

1   2  Co-Compositional NLM Compositional NLM

Original

Re-trained

New state of the art performance (ρ= 0.47)

Correlation with human judgment (Re-trained neural vector)

Page 59: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Unsupervised word vector re-training algorithm

considering compositionality

New model of compositionality

in word vector space

Masashi Tsubaki 18

Summary

Co-Compositionality with Prototype Projection

Compositional & Co-Compositional Neural Language Models Achieve state of the art on verb disambiguation task

Conclusion

Page 60: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 19

Examples Appendix

Page 61: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 20

Results of the different compositionality models Appendix

Page 62: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 21

The number of prototype words Appendix

Page 63: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 22

Variations in model configuration Appendix

Page 64: Modeling and Learning Semantic Co-Compositionality through Prototype …kevinduh/papers/tsubaki13cocomp... · 2013-11-06 · Modeling and Learning Semantic Co-Compositionality! through

Masashi Tsubaki 23

Composition operator and parameter Appendix