modeling empathy and distress in reaction to news stories€¦ · empathy tips for engineers. emnlp...

39
EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 1 Sven Buechel 2* Modeling Empathy and Distress in Reaction to News Stories Anneke Buffone 1* Barry Slaff 1 Lyle Ungar 1 João Sedoc 1 World Well-Being Project University of Pennsylvania http://www.wwbp.org 1 JULIE Lab Friedrich-Schiller-Universität Jena http://www.julielab.de 2 * equal contribution

Upload: others

Post on 14-Aug-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 1

Sven Buechel 2*

Modeling Empathy and Distress in Reaction to News Stories

Anneke Buffone 1*

Barry Slaff 1 Lyle Ungar 1 João Sedoc 1

World Well-Being ProjectUniversity of Pennsylvaniahttp://www.wwbp.org

1 JULIE LabFriedrich-Schiller-Universität Jenahttp://www.julielab.de

2

* equal contribution

Page 2: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 2

https://xkcd.com/660/ CC-BY-NC 2.5

Page 3: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 3

https://xkcd.com/660/ CC-BY-NC 2.5

EMPATHY TIPS FOR ENGINEERS

Page 4: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 4

Why Empathy?

• Crucial for how we experience the world and communicate within it

• Great potential for social media analysis

• Scarce work on empathy modeling in written language

Page 5: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 5

Why Empathy?

• Crucial for how we experience the world

and communicate within it

• Great potential for social media analysis

• Scarce work on empathy modeling in written language

Page 6: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 6

Shortcomings of Prior Work

• No publicly available gold standardØ First publicly available gold standard (CC-BY)

• 3rd person ground truthØAnnotations by the experiencer

(new annotation methodology)

• Disconnected from psychological theory and researchØ Distinguish two different types of empathy

(in line with psych. research)

Page 7: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 7

Contribution

• No publicly available gold standard

Ø First publicly available gold standard (CC-BY)

• 3rd person ground truth

ØAnnotations by the experiencer(new annotation methodology)

• Disconnected from psychological theory and research

Ø Distinguish two different types of empathy(in line with psych. research)

Page 8: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 8

Contribution

• No publicly available gold standardØ First publicly available gold standard (CC-BY)

• 3rd person ground truthØAnnotations by the experiencer

(new annotation methodology)

• Disconnected from psychological theory and researchØ Distinguish two different types of empathy

(in line with psych. research)

Page 9: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 9

Contribution

• No publicly available gold standard

Ø First publicly available gold standard (CC-BY)

• 3rd person ground truth

ØAnnotations by the experiencer(new annotation methodology)

• Disconnected from psychological theory and research

Ø Distinguish two different types of empathy(in line with psych. research)

Page 10: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 10

Contribution

• No publicly available gold standardØ First publicly available gold standard (CC-BY)

• 3rd person ground truthØAnnotations by the experiencer

(new annotation methodology)

• Disconnected from psychological theory and researchØDistinguish two different types of empathy

(in line with psych. research)

Page 11: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 11

Empathic Concern vs. Personal Distress

• Consensus in psych. that there are multiple forms of empathy

• We follow most popular distinction by Batson et al. (1987)– Empathic Concern

positive, other-focused

– Personal Distressnegative, self-focused

• Post-hoc analysis shows both are distinct (r=.45)

Page 12: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 12

Empathic Concern vs. Personal Distress

• Consensus in psych. that there are multiple forms of empathy

• We follow most popular distinction by Batson et al. (1987)– Empathic Concern

positive, other-focused

– Personal Distressnegative, self-focused

• Post-hoc analysis shows that both are distinct (r=.45)

Page 13: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 13

Problem Definition

Message Empathy DistressI‘m sorry to hear about Dakota‘s parents. [...] 4.8 3.1

• Given a natural language utterance ...

• ... predict empathy and distress of the writer from [1, 7]

Page 14: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 14

Annotation MethodologyTraditional

Page 15: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 15

Annotation MethodologyTraditional

r1 r2 r3

rfinal

Page 16: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 16

Annotation MethodologyTraditional

r1 r2 r3

rfinalExperiencer

• Annotators “guess“ experiencer‘s feelings

• Models biasedtowards annotators

Page 17: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 17

Annotation Methodology

Traditional

Proposed

r1 r2 r3

Stimulus Experiencer

rfinalExperiencer

• Annotation as psych.

experiment

• Experiencer produces

message and ratings

• Reliable through multi-

item scales

• Annotators “guess“

experiencer‘s feelings

• Models biased

towards annotators

Page 18: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 18

Annotation MethodologyTraditional

Proposed

r1 r2 r3

Stimulus Experiencer

rfinal

r

Experiencer

• Annotation as psych. experiment

• Experiencer produces message and rating

• Reliable through multi-item scales

• Annotators “guess“ experiencer‘s feelings

• Models biasedtowards annotators

Page 19: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 19

Annotation Methodology

Traditional

Proposed

r1 r2 r3

Stimulus Experiencer

rfinal

r

Experiencer

• Annotation as psych.

experiment

• Experiencer produces

message and rating

• Reliable through multi-

item scales

• Annotators “guess“

experiencer‘s feelings

• Models biased

towards annotators

Page 20: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 20

Annotation MethodologyTraditional

Proposed

r1 r2 r3

Stimulus Experiencer

rfinal

r1 r2 r3 rfinal

Experiencer

• Annotation as psych. experiment

• Experiencer produces message and rating

• Reliable through multi-item scales

• Annotators “guess“ experiencer‘s feelings

• Models biasedtowards annotators

Page 21: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 21

Use of Multi-Item Scales

Page 22: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 22

Use of Multi-Item ScalesM

eanEm

path

icCo

ncer

n

Page 23: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 23

Corpus Creation Process

crowdworkers

Qualtricsonline survey

final corpusstimulus

upload manual review

take

1680 (M, E, D)-triples418 online news articles

- read articles- rate Empathy/Distress- write Message(300 to 800 chars)

Page 24: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 24

Corpus Creation Process

crowdworkers

Qualtricsonline survey

final corpusstimulus

upload manual review

take

1680 (M, E, D)-triples418 online news articles

- read articles- rate Empathy/Distress- write Message(300 to 800 chars)

Split-half reliability around r=.9 for both empathy and distress J

Page 25: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 25

Baseline Models for Empathy and Distress

• Models– Ridge Regression– Feed-Forward Net– Convolutional Neural Net (1 conv layer, filter sizes 1-3, 100 channels each)– RNN-type architectures did not work because of long sequences

• Features: FastText embeddings pre-trained on Common Crawl

• 10-fold cross-validation

• Correlation values around r = .4

• Ridge is viable but CNN significantly outperforms the others

Page 26: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 26

Baseline Models for Empathy and Distress

• Models– Ridge Regression– Feed-Forward Net– Convolutional Neural Net (1 conv layer, filter sizes 1-3, 100 channels each)– RNN-type architectures did not work because of long sequences

• Features: FastText embeddings pre-trained on Common Crawl

• 10-fold cross-validation

• Correlation values around r = .4

• Ridge is viable but CNN significantly outperforms the others

Page 27: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 27

Baseline Models for Empathy and Distress

• Models– Ridge Regression– Feed-Forward Net– Convolutional Neural Net (1 conv layer, filter sizes 1-3, 100 channels each)– RNN-type architectures did not work because of long sequences

• Features: FastText embeddings pre-trained on Common Crawl

• 10-fold cross-validation

• Correlation values around r = .4

• Ridge is viable but CNN significantly outperforms the others

Page 28: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 28

Conclusion

• Social media offers great opportunity to study empathy

• Modeling empathy received little attention for written language

• We presented the first publicly available gold standardhttps://github.com/wwbp/empathic_reactions

• We distinguish Empathic Concern and Personal Distress

• New annotation methodology collects reliable ratingsfrom the experiencer using multi-item scales

Page 29: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 29

Sven Buechel *

Modeling Empathy and Distress in Reaction to News Stories

Anneke Buffone *

Barry Slaff Lyle Ungar João Sedoc

https://xkcd.com/660/ CC-BY-NC 2.5

EMPATHY TIPS FOR ENGINEERS

Page 30: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 30

Backup Slides

Page 31: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 31

Previous Work

spoken written

What we talk about todayXiao et al. (2012)Gibson et al. (2015) Khanpour et al. (2017)

Page 32: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 32

Exemplary Entries

Empathy Distress Message

4.8 3.1

I‘m sorry to hear about Dakota‘s parents. No one wantsthat to happen and it‘s unfortunate that her parentscouldn‘t work it out. I hope they are able to still remaincivil around the kids and family. [...]

4.0 5.5

Here‘s an article about [a] crazed person who murderedtwo unfortunate women overseas. Life is crazy. [...] Itfeels like there‘s on place safe in this world to be a woman sometimes.

1.0 1.3

I just read an article about some chowder-head who useda hammer and a pick ax to destroy Donald Trump‘s staron the Hollywood walk of fame. [...] Lol, can you believethis garbage? Who has such a hollow and pathetic lifethat they don‘t have anything better to do with their time than commit petty vandalism because they dislike somepolitician? [...]

Page 33: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 33

Split-Half Reliability (SHR)

• Based on Pearson correlation r

• Flexible (works with crowdsourcing and best-worst scaling)

• Most popular in psychology

• Increasingly popular within CL (Mohammad et al.)

i1 i2 i3 i4 i5 i6d1d2d3d4d5d6

Page 34: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 34

Split-Half Reliability (SHR)

i1 i4 i5d1d2d3d4d5d6

i2 i3 i6d1d2d3d4d5d6

• Based on Pearson correlation r

• Flexible (works with crowdsourcing and best-worst scaling)

• Most popular in psychology

• Increasingly popular within CL (Mohammad et al.)

Page 35: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 35

Split-Half Reliability (SHR)

d1d2d3d4d5d6

d1d2d3d4d5d6

• Based on Pearson correlation r

• Flexible (works with crowdsourcing and best-worst scaling)

• Most popular in psychology

• Increasingly popular within CL (Mohammad et al.)

Page 36: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 36

Bivariate Rating Distribution

• Good coverage of the full range of rating scales

• Empathy and distress are distinct (moderate correlation of r=.45)

Page 37: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 37

Models

• Ridge Regression

• Feed-Forward Net– two layers, 256 and 128 units

• Convolutional Neural Net– one conv layer– filter sizes 1, 2, 3– 100 output channels– average pooling– dense layer (128)

Page 38: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 38

Experimental Setup

• Features: FastText embeddings pre-trained on Common Crawl

• Train distinct models for empathy and distress

• Exclude 10% of data for dev experiments

• 10-fold CV on remaining data

• Evaluate with Pearson correlation r

Page 39: Modeling Empathy and Distress in Reaction to News Stories€¦ · EMPATHY TIPS FOR ENGINEERS. EMNLP 2018 Brussels, Belgium, November 4, 2018 Sven Buechel, Anneke Buffone, Barry Slaff,

EMNLP 2018 Brussels, Belgium, November 4, 2018

Sven Buechel, Anneke Buffone, Barry Slaff, Lyle Ungar, and João Sedoc Modeling Empathy and Distress in Reaction to News Stories 39

Results

• Ridge regression is viable option, outperforms FFN

• CNN significantly outperforms Ridge and FFN(* two-tailed paired t-test; p < .05)

design complexity. Distinct models were trainedfor empathy and distress prediction.

First, ten percent of our newly created goldstandard were randomly sampled to be used indevelopment experiments. Then, the main ex-periment was conducted using 10-fold cross-validation (CV), providing each model with iden-tical train-test splits to increase reliability. The devset was excluded for the CV experiment.

Model performance is measured in terms ofPearson correlation r between predicted valuesand the human gold ratings. Thus, we phrase theprediction of empathy and distress as regressionproblems.

The input to our models is based on wordembeddings, namely the publicly available Fast-Text embeddings which were trained on CommonCrawl (⇡600B tokens) (Bojanowski et al., 2017;Mikolov et al., 2018).

Ridge. Our first approach is Ridge regression,an `2-regularized version of linear regression. Thecentroid of the word embeddings of the words in amessage is used as features (embedding centroid).The regularization coefficient ↵ is automaticallychosen from {1, .5, .1, ..., .0001} during training.

FFN. Our second approach is a Feed-ForwardNet with two hidden layers (256 and 128 units, re-spectively) with ReLU activation. Again, the em-bedding centroid is used as features.

CNN. The last approach is a ConvolutionalNeural Net.5 We use a single convolutional layerwith filter sizes 1 to 3, each with 100 output chan-nels, followed by an average pooling layer and adense layer of 128 units. ReLUs were used for theconvolutional and again for the dense layer.

Both deep learning models were trained usingthe Adam optimizer (Kingma and Ba, 2015) witha fixed learning rate of 10�3 and a batch size of32. We trained for a maximum of 200 epochs yetapplied early stopping if the performance on thevalidation set did not improve for 20 consecutiveepochs. We applied dropout with probabilities of.2, .5 and .5 on input, dense and pooling layers,respectively. Moreover `2 regularization of .001was applied to the weights of conv and dense lay-ers. Word embeddings were not updated.

The results are provided in Table 2. As can beseen, all of our models achieve satisfying perfor-mance figures ranging between r=.379 and .444,

5 Recurrent models did not perform well during develop-ment due to high sequence length.

Empathy Distress Mean

Ridge .385 .410 .398FFN .379 .401 .390CNN .404* .444* .424*

Table 2: Model performance for predicting empathyand distress in Pearson’s r; with row-wise mean; bestresult per column in bold, significant (p < .05) im-provement over other models marked with ‘*’.

given the assumed difficulty of the task (see Sec-tion 3). On average over the two target vari-ables, the CNN performs best, followed by Ridgeand the FFN. While the CNN significantly outper-forms the other models in every case, the differ-ences between Ridge and the FFN are not statis-tically significant for either empathy or distress.6

The improvements of the CNN over the other twoapproaches are much more pronounced for dis-tress than for empathy. Since only the CNN isable to capture semantic effects from composi-tion and word order, our data suggest that thesephenomena are more important for predicting dis-tress, whereas lexical features alone already per-form quite well for empathy.

Discussion. In comparison to closely relatedtasks such as emotion prediction (Mohammad andBravo-Marquez, 2017a) our performance figuresfor empathy and distress prediction are generallylower. However, given the small amount of previ-ous work for the problem at hand, we argue thatour results are actually quite strong. This becomesobvious, again, in comparison with emotion anal-ysis where early work achieved correlation valuesaround r=.3 at most (Strapparava and Mihalcea,2007). Yet state-of-the-art performance literallydoubled over the last decade (Beck, 2017), in partdue to much larger training sets.

Comparison to the limited body of previouswork in text-based empathy prediction is diffi-cult for a number of reasons, e.g., differences indomain, evaluation metric, as well as methodol-ogy and linguistic level of annotation. Khanpouret al. (2017) annotate and model empathy in onlinehealth communities on the sentence-level, whereasthe instances in our corpus are much longer andcomprise multiple sentences. In contrast to ourwork, they treat empathy prediction as a classifi-cation problem. Their best performing model, aCNN-LSTM, achieves an F-score of .78. Gibson

6We use a two-tailed t-test for paired samples based onthe results of the individual CV runs; p < .05.