application of future sentence reference extraction in...

24
*Yoko Nakajima Michal Ptaszynski Hirotoshi Honma Fumito Masui Application of Future Sentence Reference Extraction in Support of Future Event Prediction International Workshop on Language Sense on Computer in IJCAI2016 1

Upload: others

Post on 18-Jul-2020

16 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

*Yoko NakajimaMichal PtaszynskiHirotoshi Honma

Fumito Masui

Application of Future Sentence Reference Extraction in Support of Future Event Prediction

International Workshop on Language Sense on Computer in IJCAI2016

1

Page 2: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

2

• options: 1.Strategy option ‘A’ when both events ‘X’ and ‘Y’ happen. 2.Strategy option ‘B’ when the event ‘X’ happens and the

event ‘Y’ does not happen. 3.Strategy option ‘C’ when the event ‘Y’ happens and the

event ‘X’ does not happen.4.Strategy option ‘A’ when both events ‘X’ and ‘Y’do not

happen.

Background

Economictrend Political policy

Internationalsituations

Newsvarious mentionsMention from others

• Future events : - Event ‘X’- Event ‘Y’

2

Page 3: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

3

• Typical options: 1.Strategy option ‘A’ when both events ‘X’ and ‘Y’ happen. 2.Strategy option ‘B’ when the event ‘X’ happens and the

event ‘Y’ does not happen. 3.Strategy option ‘C’ when the event ‘Y’ happens and the

event ‘X’ does not happen.4.Strategy option ‘A’ when both events ‘X’ and ‘Y’do not

happen.

Background

• Future events : - Event ‘X’- Event ‘Y’

supporting

3

Page 4: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

4

knowledge including

Future reference sentences

Background

• options: 1.Strategy option ‘A’ when both events ‘X’ and ‘Y’ happen. 2.Strategy option ‘B’ when the event ‘X’ happens and the

event ‘Y’ does not happen. 3.Strategy option ‘C’ when the event ‘Y’ happens and the

event ‘X’ does not happen.4.Strategy option ‘A’ when both events ‘X’ and ‘Y’do not

happen.

• past stories • current facts • experiences • survey results • research

✦ decision

Support of Future Event Prediction

4

Page 5: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

5

Relevant study:Future events extraction

• Extract the future events- method with explicit future tense expressions (for

English) [1]  next year, in 20xx, later,from 20xx to 20xx, .. etc.

“X may happen in 20xx” “X is scheduled to happen next week” “X will happen until 20xx”

“may happen” “is scheduled to happen” “ will happen”

- method with future word expressions (for English) [2]

[1] Jatowt et al. Extracting Collective Expectations about the Future from Large Text Collection,2009[2] Kanazawa et al. Extracting Explicit and Implicit future-related Information from the Web.(O),2010

5

Page 6: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

6

main topics

1. Automatic Extraction of future reference sentences based on mtorphosemantic.

2. Future prediction Support experiment

6

Page 7: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

7

Morphosemantic Structure :MS

Morphosemantic Patterns :MoPs

training

classify

Classification Method

labelingTest data with MS

Future ReferenceSentences

OtherSentences

Natural Language Sentences (future / no future

each 130 sentences)

Classifier(Future/ No future) generate

Classifier(Future/ No future)

Trading phase Test phase

7

Page 8: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

8

propose method for classification of future reference sentences

Morphology + Semantic

“Taro will give Hanako some flower.”

“n pp n pp n pp v av av s” with morpheme

“Taro ha Hanako ni hana wo okuru darou”

n:nounpp:a postpositional particlev: verbav:auxiliary verbs:symbol

Morphosemantic

J Japanese:

English:E

Morphology:

Morphology

8

Page 9: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

9

propose method for classification of future reference sentences

“Taro will give Hanako some flower.”“Taro ha Hanako ni hana wo okuru darou”

“[Agent][Goal][Object][State-change]” with semantic role labels

based on predicate argument structure considered dependency between words

J

E

Hana woHanako niTaro ha

okuru darou

[Agent] [Goal] [Object]

[State_change]

argumentssemantic

role labels

predicate argument

Semantic

9

Page 10: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

10

Natural Language Sentences

E : “Taro will give Hanako flowers”.JR: “Taro wa/ Hanako ni/ hana wo/ okuru darou”.

“[Agent][Goal][Object][State_change]”

[Agent]*[Object] [Agent]*[State_change] *[Point]*[State_change] [Agent]*[Object][State_change]

MoPs:

MS:

• combine elements (1 ~ 6 ) of MS and “*” • keep order elements of MS

JR: Japanese RomanizedE:English

Generating MoPs from a sentence

training

Classifier

10

Page 11: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

11

classify

Classification Method

Test data with MS

Future ReferenceSentences

OtherSentences

Classifier

Classifying phase

<with MS> [Agent][Goal][Object][State_change] [Time][Agent][Object][No_state_change][State_change]] [Object][Time][No_state_change] [Place][Adverb][State_change] [Organization][Noun][Place][Noun] …

<FRS> JR: “John wa Mary ni tegami wo okuru darou”. E: “John will send Mary a letter”.

<Other sentences> JR: “ “big apple” wa new york shi no nikkune-mu dearu”. E: “The big apple is New York City’s nickname”.

11

Page 12: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

12

Evaluation

• 270 sentences from Mainichi Newspaper (1996) 100 sentences and 170 sentences annotated by three people.• domain: Economics and International• Keyword: energy

Test data setting:

break-even point0.76

Threshold values for classifying FRS

MoPsClassifire: calculated with length awarded for Morphosemantic pattens

F Nf

F-measure

12

Page 13: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

13

Future Prediction Support Experiment

Experiment setup - 30 participants(male:22, female:8 [19~25 years old(28people), 45~50 years old(2people) ])- 7 future event questions toward two years selected from FPCT[*1] in 2009- 17~30 FPSS for each the question, extracted automatically from Mainichi Newspaper in 2009.- Compared with correct rate with the result of FPCT

Future reference sentences

Questions from

FPCT

Futureevent

happen/not

[*1]: Future Prediction Competence Test

This Experimenttypical human processing

Future Event

Prediction Prediction

Futureevent

happen/not

13

Page 14: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

14

The Future Prediction Competence Test

Future Predict Competence Test:   - by Language Responsibility Assurance Association[3] from 2006. - prediction questions which future trend (events) toward one or two years. - will correct the test 2years later and certify as appropriate grade. Purpose:  - to support people of increased public responsibility for examples, managers, politicians etc. - people responsible of making decisions influencing civic life. Format: - 6 fields (politics, economics, international events, science and technology, society, leisure) - answer at least 15 questions choosing from 30 questions. - multiple-choice questions,answer predicting specific numbers and reasons. - 8points/a question (correct:6points,when requiring reasons: 2points) Allow something: - browse any and all materials, and are free to seek the opinions of others. - one year.

[3] The Language Responsibility Assurance Association: http://homepage3.nifty.com/genseki/

14

Page 15: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

15

Question1:Predict whether the following bills approve it at the end of June, 2010, and they are passed.

(1) Local franchise grant to a permanent residence foreigner: (a) be enacted (b) not be enacted

answer:【 】 ✴ Specify which sentence (number ID) from the prepared Future Prediction Support

Sentences was most useful in making the above decision:【 】

(2) Civil law revision in acknowledgment of a separate surname for a married couple: (a) be enacted (b) not be enacted

answer:【 】 ✴ Specify which sentence (number ID) from the prepared Future Prediction Support

Sentences was most useful in making the above decision:【 】

Question form (1)

15

Page 16: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

16

(A) The US forces will be still present and further reinforced comparing to October 2009. (B) The US forces will be still present on similar level comparing to October 2009. (C) The US forces will be still present but in decreased number comparing to October 2009. (D) The US forces will be completely withdrawn. Answer: [ 1st candidate: / 3rd candidate: / 2nd candidate: ]

✴ Specify which sentence (number ID) from the prepared Future Prediction Support Sentences was most useful in making the above decision:

1st candidate: [ ] 2nd candidate: [ ] 3rd candidate: [ ]

Question form (2)

Question3: Predict the stationing status of US forces in Afghanistan at the end of June 2011.

16

Page 17: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

17

Some examples show research participants FPSSs for question3

No. FPSS

1 Other newspapers are also carrying the Mainichi Newspaper’s three-part feature on trilateral

coordination between Japan, Korea, and the US on North Korean nuclear arms, cooperation between Japan and Korea on reconstruction aid to Afghanistan, and the establishment of regular meetings or “shuttle diplomacy” between the respective leaders of these countries.

2 Additionally, it revealed their intention to finish the Iraq War through the gradual withdrawal of US

combat troops stationed there, and put full force into the War on Terror in Afghanistan.

3 Substantial negotiations toward realizing the campaign pledge to reduce the number of stationed US

forces “within 16 months of inauguration” have begun, aiming for an early formulation of a comprehensive plan that includes sending more U.S. troops to Afghanistan, a key battleground in the War on Terror.

4 Ahmad Saif (29), an engineer in Baghdad, rejoiced that President Obama had reemphasized the need

to focus on the War on Terror in Afghanistan, increasing the likelihood of an early withdrawal of U.S. troops from Iraq.

5 At a cabinet-level meeting between Finance and Foreign Ministers of each country, in addition to

steps to be taken on the deterioration of public order in Afghanistan caused by formerly dominant Taliban forces, the agenda featured discussion of water resource development policies in response to the ongoing drought, and negotiations over assistance measures.

17

Page 18: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

18

corr

ect

accu

racy

rat

e

0

22.5

45

67.5

90

average highest lowest

6.7

61.1

33.4

14.3

85.7

42.9

Supporting Future Trend PredictionThe Future Prediction Competence Test

Experiment Result (1)

comparison the correct accuracy rate between this experiment and FPCT

18

Page 19: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

19

Experiment Result (2)

comparison pass the grade between this experiment and FPCT

The

pass

ing

rate

0

15

30

45

60

on this Experiment on the FPCT

18.2

13.3

20.0

18.220.0

1st grade 2nd grade 3rd grade

53.3%

36.4%

<passing line>1st grade: over 50%2nd grade: over 40%3rd grade: over 30%

19

Page 20: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

0

2.5

5

7.5

10

FPSS Number1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Number of Correct Number of Incorrect

20

number of referred each FPSS when question3 was answerd.

Experiment Result (3)

20

Page 21: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

21

Conclusion

To verify practical effectiveness of our method for predicting future events, we performed the experiment in which 30 participants answered the questions excerpted from FPCT questions in 2009 by only reading FPSS. We also compared the results to the result of original FPCT.

• An average of 10 % improvement over the results of the original FPCT. • Background knowledge may influence it. • The cost such reference information and time for answering can be reduced. • Some sentences are helpful for predicting, some are not.

21

Page 22: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

22

Future Works

• Extract FPSS with other corpora (other newspaper, column is wrote by expert, …)

• Apply to real-world problems (company management, economic trend, …)

• Analyze useful Future Predicting Supporting Sentences. ‣sofisticate FPSS.

• Review about the experiment setting. ‣no background knowledge effects.

22

Page 23: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

23

Application of Future Sentence Reference Extraction in Support of Future Event Prediction

*Yoko NakajimaMichal PtaszynskiHirotoshi Honma

Fumito Masui

International Workshop on Language Sense on Computer in IJCAI2016

23

Page 24: Application of Future Sentence Reference Extraction in ...arakilab.media.eng.hokudai.ac.jp/~ptaszynski/data/ijcai16-WS12_fina… · Evaluation • 270 sentences from Mainichi Newspaper

24

Preliminary investigation on temporal expressions

• 270 sentences randomly collect from newspapers

Future Expression

number of types

wordsfrequency: =1

frequency: >2

temporal 82

in the days ahead, in Y years, next month, within the next Y, after Y years, at an early date,…

80% 20%

word 256

aim to(mezasu), plan to (hoshin, mitooshi)increase(fuyasu), bring ~ to (tonyusuru), a possibility(kanouseiga are), …Fr

equency(times)

0

2

4

6

8

10

12

14

No. of Future Expression Lexicon1 100 200 300 33945.3% 54.7%

24