cs 124/linguist 180 from languages to information · cs 124/linguist 180 from languages to...

138
CS 124/LINGUIST 180 From Languages to Information Detecting Social and Affective Meaning Dan Jurafsky Stanford University

Upload: others

Post on 11-Aug-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

CS 124/LINGUIST 180From Languages to Information

Detecting(Social(and(Affective(Meaning

Dan(JurafskyStanford(University

Page 2: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Affective(meaning

• Drawing(on(literatures(in• affective(computing((Picard(95)• linguistic(subjectivity((Wiebe and(colleagues)• social(psychology((Pennebaker and(colleagues)

• Can(we(model(the(lexical(semantics(relevant(to:• sentiment• emotion• personality• mood(• attitudes

2

Page 3: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Why(compute(affective(meaning?• Detecting:

• sentiment(towards(politicians,(products,(countries,(ideas• frustration(of(callers(to(a(help(line• stress(in(drivers(or(pilots• depression(and(other(medical(conditions• confusion(in(students(talking(to(eJtutors• emotions(in(novels((e.g.,(for(studying(groups(that(are(feared(over(time)

• Could(we(generate:• emotions(or(moods(for(literacy(tutors(in(the(children’s(storybook(domain• emotions(or(moods(for(computer(games• personalities(for(dialogue(systems(to(match(the(user

Page 4: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s(typology(of(affective(statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,(sad,(joyful,(fearful,(ashamed,(proud,(desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,(gloomy,(irritable,(listless,(depressed,(buoyant

Interpersonal(stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,(cold,(warm,(supportive,(contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,(loving,(hating,(valuing,(desiring

Personality(traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,(anxious,(reckless,(morose,(hostile,(envious,(jealous

Page 5: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting(Social(and(Affective(Meaning

Reminder:(Sentiment(Lexicons

Page 6: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s(typology(of(affective(statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,(sad,(joyful,(fearful,(ashamed,(proud,(desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,(gloomy,(irritable,(listless,(depressed,(buoyant

Interpersonal(stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,(cold,(warm,(supportive,(contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,(loving,(hating,(valuing,(desiring

Personality(traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,(anxious,(reckless,(morose,(hostile,(envious,(jealous

Page 7: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

The(General(Inquirer

• Home(page:(http://www.wjh.harvard.edu/~inquirer• List(of(Categories:( http://www.wjh.harvard.edu/~inquirer/homecat.htm

• Spreadsheet:(http://www.wjh.harvard.edu/~inquirer/inquirerbasic.xls• Categories:

• Positiv (1915(words)(and(Negativ (2291(words)• Strong(vs Weak,(Active(vs Passive,(Overstated(versus(Understated• Pleasure,(Pain,(Virtue,(Vice,(Motivation,(Cognitive(Orientation,(etc

• Free(for(Research(Use

Philip(J.(Stone,(Dexter(C(Dunphy,(Marshall(S.(Smith,(Daniel(M.(Ogilvie.(1966.(The(General(Inquirer:(A(Computer(Approach(to(Content(Analysis.(MIT(Press

Page 8: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

LIWC((Linguistic(Inquiry(and(Word(Count)Pennebaker,(J.W.,(Booth,(R.J.,(&(Francis,(M.E.((2007).(Linguistic(Inquiry(and(Word(Count:(LIWC(2007.(Austin,(TX

• Home(page:(http://www.liwc.net/• 2300(words,(>70(classes• Affective(Processes

• negative(emotion((bad,%weird,%hate,%problem,%tough)• positive(emotion((love,%nice,%sweet)

• Cognitive(Processes• Tentative((maybe,%perhaps,%guess),(Inhibition((block,%constraint)

• Pronouns,(Negation((no,%never),(Quantifiers((few,%many)(• $30(or($90(fee

Page 9: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

MPQA(Subjectivity(Cues(Lexicon

• Home(page:(http://www.cs.pitt.edu/mpqa/subj_lexicon.html• 6885(words(from(8221(lemmas

• 2718(positive• 4912(negative

• Each(word(annotated(for(intensity((strong,(weak)• GNU(GPL9

Theresa Wilson, Janyce Wiebe, and Paul Hoffmann (2005). Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis. Proc. of HLT-EMNLP-2005.

Riloff and Wiebe (2003). Learning extraction patterns for subjective expressions. EMNLP-2003.

Page 10: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Bing(Liu(Opinion(Lexicon

• Bing(Liu's(Page(on(Opinion(Mining• http://www.cs.uic.edu/~liub/FBS/opinionJlexiconJEnglish.rar

• 6786(words• 2006(positive• 4783(negative

10

MinqingHu(and(Bing(Liu.(Mining(and(Summarizing(Customer(Reviews.(ACM(SIGKDDJ2004.

Page 11: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

SentiWordNetStefano(Baccianella,(Andrea(Esuli,(and(Fabrizio(Sebastiani.(2010(SENTIWORDNET(3.0:(An(Enhanced Lexical Resource(for(Sentiment Analysis(and(Opinion(Mining.(LRECJ2010

• Home(page:(http://sentiwordnet.isti.cnr.it/• All(WordNet synsets automatically(annotated(for(degrees(of(positivity,(

negativity,(and(neutrality/objectiveness• [estimable(J,3)](“may(be(computed(or(estimated”(

Pos 0 Neg 0 Obj 1 • [estimable(J,1)](“deserving(of(respect(or(high(regard”(

Pos .75 Neg 0 Obj .25

Page 12: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting(Social(and(Affective(Meaning

Sentiment(Lexicons

Page 13: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting(Social(and(Affective(Meaning

Emotion(

Page 14: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s(typology(of(affective(statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,(sad,(joyful,(fearful,(ashamed,(proud,(desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,(gloomy,(irritable,(listless,(depressed,(buoyant

Interpersonal(stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,(cold,(warm,(supportive,(contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,(loving,(hating,(valuing,(desiring

Personality(traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,(anxious,(reckless,(morose,(hostile,(envious,(jealous

Page 15: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Two(families(of(theories(of(emotion

• Atomic(basic(emotions• A(finite(list(of(6(or(8,(from(which(others(are(generated

• Dimensions(of(emotion• Valence((positive(negative)• Arousal((strong,(weak)• Control

15

Page 16: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Ekman’s(6(basic(emotions:Surprise,(happiness,(anger,(fear,(disgust,(sadness

Page 17: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Valence/Arousal-Dimensions

High(arousal,(low(pleasure High(arousal,(high(pleasureanger excitement

Low(arousal,(low(pleasure((((((((((((((((((((((Low(arousal,(high(pleasuresadness relaxation

arou

sal

valence

Page 18: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Atomic-units-vs.-Dimensions

Distinctive

• Emotions(are(units.• Limited(number(of(basic(

emotions.• Basic(emotions(are(innate(and(

universal

Dimensional

• Emotions(are(dimensions.• Limited(#(of(labels(but(

unlimited(number(of(emotions.

• Emotions(are(culturally(learned.

Adapted%from%Julia%Braverman

Page 19: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

One-emotion-lexicon-from-each-paradigm!

1. 8(basic(emotions:• NRC(WordJEmotion(Association(Lexicon((Mohammad(and(Turney 2011)

2. Dimensions(of(valence/arousal/dominance• Warriner,(A.(B., Kuperman,(V.,(and(Brysbaert,(M.((2013)

• Both(built(using(Amazon(Mechanical(Turk

19

Page 20: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Plutchick’s wheel-of-emotion

20

• 8(basic(emotions• in(four(opposing(pairs:• joy–sadness(• anger–fear• trust–disgust• anticipation–surprise(

Page 21: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

NRC-WordEEmotion-Association-Lexicon

21

Mohammad(and(Turney 2011

• 10,000(words(chosen(mainly(from(earlier(lexicons• Labeled(by(Amazon(Mechanical(Turk• 5(Turkers per(hit• Give(Turkers an(idea(of(the(relevant(sense(of(the(word• Result:

amazingly anger 0amazingly anticipation 0amazingly disgust 0amazingly fear 0amazingly joy 1amazingly sadness 0amazingly surprise 1amazingly trust 0amazingly negative 0amazingly positive 1

Page 22: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

The-AMT-Hit

22 …

Page 23: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Lexicon-of-valence,-arousal,-and-dominance

• Warriner,(A.(B., Kuperman,(V.,(and(Brysbaert,(M.((2013). Norms(of(valence,(arousal,(and(dominance(for(13,915(English(lemmas. Behavior%Research%Methods%45,%1191B1207.

• Supplementary(data: This(work(is(licensed(under(a Creative(Commons(AttributionJNonCommercialJNoDerivs(3.0(Unported(License.

• Ratings-for-14,000-words-for-emotional-dimensions:• valence (the(pleasantness(of(the(stimulus)(• arousal (the(intensity(of(emotion(provoked(by(the(stimulus)• dominance (the(degree(of(control(exerted(by(the(stimulus)(

23

Page 24: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Lexicon-of-valence,-arousal,-and-dominance• valence (the(pleasantness(of(the(stimulus)(

9:(happy,(pleased,(satisfied,(contented,(hopeful(1:(unhappy,(annoyed,(unsatisfied,(melancholic,(despaired,(or(bored(

• arousal (the(intensity(of(emotion(provoked(by(the(stimulus)9:(stimulated,(excited,(frenzied,(jittery,(wideJawake,(or(aroused1:(relaxed,(calm,(sluggish,(dull,(sleepy,(or(unaroused;

• dominance (the(degree(of(control(exerted(by(the(stimulus)(9:(in(control,(influential,(important,(dominant,(autonomous,(or(controlling1:(controlled,(influenced,(caredJfor,(awed,(submissive,(or(guided

• Again(produced(by(AMT

24

Page 25: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Lexicon-of-valence,-arousal,-and-dominance:Examples

Valence Arousal Dominance

vacation 8.53 rampage 7.56 self 7.74happy 8.47 tornado 7.45 incredible 7.74whistle 5.7 zucchini 4.18 skillet 5.33conscious 5.53 dressy 4.15 concur 5.29torture 1.4 dull 1.67 earthquake 2.14

25

Page 26: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Other(Useful(Lexicons

Page 27: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Concreteness-versus-abstractness• The(degree(to(which(the(concept(denoted(by(a(word(refers(to(a(perceptible(entity.

• Do(concrete(and(abstract(words(differ(in(connotation?• Storage(and(retrieval?• Bilingual(processing?• Relevant(for(embodied( view(of(cognition((Barsalou 1999(inter(alia)

• Do(concrete(words(activate(brain(regions(involved( in(relevant(perception

• Brysbaert,(M.,(Warriner,(A.(B.,(and Kuperman,(V.((2014) Concreteness(ratings(for(40(thousand(generally(known(English(word(lemmasBehavior%Research%Methods%46,%

904B911.

• Supplementary(data: This(work(is(licensed( under(a Creative(Commons(AttributionJNonCommercialJNoDerivs( 3.0(Unported(License.

• 37,058(English(words(and(2,896(twoJword(expressions(((“zebra(crossing”(and(“zoom(in”),(

• Rating(from(1((abstract)(to(5((concrete)• Calibrator(words:• shirt,(infinity,(gas,(grasshopper,(marriage,(kick,(polite,(whistle,(theory,(and(sugar(27

Page 28: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Concreteness-versus-abstractness• Brysbaert,(M.,(Warriner,(A.(B.,(and Kuperman,(V.((2014) Concreteness(ratings(for(40(thousand(

generally(known(English(word(lemmasBehavior%Research%Methods%46,%904B911.

• Supplementary(data: This(work(is(licensed( under(a Creative(Commons(AttributionJNonCommercialJNoDerivs( 3.0(Unported(License.

• Some(example(ratings(from(the(final(dataset(of(40,000(words(and(phrasesbanana 5bathrobe 5bagel 5brisk 2.5badass 2.5basically 1.32belief 1.19although 1.07

28

Page 29: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Perceptual-Strength-Norms

Connell(and(Lynott norms

29

However, when we examined the original norming instructions used to collect these norms, we found it questionable that participants would have simultaneously considered their sensory experience across all modalities and then managed to aggregate this experience into a single, composite rating per word. Instructions for concreteness ratings, for example, define concrete words as referring to “objects, materials, or persons” and abstract words as referring to something that “ cannot be experienced by the senses” (Paivio, Yuille & Madigan, 1968, p. 5). The resulting ratings, therefore, may reflect different decision criteria at the concrete and abstract ends of the scale, which is consistent with previous observations that the concreteness ratings scale has a bimodal distribution (e.g., Kousta et al., 2011). Imageability ratings are frequently used interchangeably with concreteness ratings (e.g., Binder et al., 2005; Sabsevitz et al., 2005) because of their high correlation and theoretical relationship in dual coding theory. Instructions for imageability ratings repeatedly refer to arousing a “mental image” (Paivio et al., 1968, p. 4), which is likely to lead naïve participants to focus on vision at the expense of other modalities. Both concreteness and imageability ratings could therefore add considerable noise to any dataset that assumed the ratings reflected a smooth continuum of perceptual experience across all modalities.

Our goals in the present paper were twofold. First, we aimed to establish whether concreteness and imageability norms actually reflect the degree with which concepts are perceptually experienced, as is commonly assumed. Second, we examined whether so-called concreteness effects in word processing are better predicted by concreteness/imageability ratings or by strength of perceptual experience. If the former, then forty years of empirical methodology have been validated but the reasons for null and reverse concreteness effects remain unclear. If the latter, then concreteness and imageability ratings are unsuitable for the tasks in which they are employed, and null and reverse concreteness effects are due to the unreliability of perceptual information in these ratings.

Experiment 1

Rather than ask participants to condense their estimations of sensory experience into a single concreteness or imageability rating, modality-specific norming asks people to rate how strongly they experience a variety of concepts using each perceptual modality in turn (i.e., auditory, gustatory, haptic, olfactory or visual: Lynott & Connell, 2009, in prep.; see also Connell & Lynott, 2010; Louwerse

& Connell, 2011).

If concreteness and imageability are a fair reflection of the degree of perceptual information in a concept, then ratings of perceptual strength in all five modalities should be positively related to concreteness and imageability ratings, and these relationships should remain consistent across the rating scale. On the other hand, if we were correct in our hypothesis to the contrary, then we would expect some perceptual modalities to be neglected (i.e., no relationship) or even misinterpreted (i.e., negative relationship) in concreteness and imageability ratings. Specifically, concreteness norming instructions may have led to different decision criteria and therefore distinctly different modality profiles at each end of scale, whereas imageability instructions may have led to a predominantly visual bias.

Method

Materials A total of 592 words were collated that represented the overlap of the relevant sets of norms, so each word had ratings of perceptual strength on five modalities as well as concreteness and imageability (see Table 1 for sample items). Perceptual strength norms came from Lynott and Connell (2009, in prep.), in which participants were asked to rate “to what extent do you experience WORD” (for nouns) or “to what extent do you experience something being WORD” (for adjectives) through each of the five senses (i.e., “by hearing”, “by tasting”, “by feeling through touch”, “by smelling” and “by seeing”), using separate rating scales for each modality. Perceptual strength ratings therefore took the form of a 5-value vector per word, ranging from 0 (low strength) to 5 (high strength). Concreteness ratings were taken from the MRC psycholinguistic database for 522 words, with ratings for the remaining 70 words coming from Nelson, McEvoy and Schreiber (2004). Imageability ratings for 524 words also came from the MRC database, and were supplemented with ratings for a further 68 words from Clark and Paivio (2004). All concreteness and imageability ratings emerged from the same instructions as Paivio et al.'s (1968) original norms, and ranged from 100 (abstract or low-imageability) to 700 (concrete or high-imageability).

Design & Analysis We ran stepwise regression analyses with either concreteness or imageability rating as the dependent variable, and ratings of auditory, gustatory, haptic, olfactory and visual strength as competing predictors. For analysis of consistency across the scales, each dependent variable was split at its midpoint before

Table 1: Sample words, used in Experiments 1 and 2, for which perceptual strength ratings [0-5] match or mismatch ratings

of concreteness and imageability [100-700].

Perceptual strength

Word Auditory Gustatory Haptic Olfactory Visual Concreteness Imageability

soap 0.35 1.29 4.12 4.00 4.06 589 600

noisy 4.95 0.05 0.29 0.05 1.67 293 138

atom 1.00 0.63 0.94 0.50 1.38 481 499

republic 0.53 0.67 0.27 0.07 1.79 376 356

1429

Microsoft Excel Worksheet

Page 30: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Using(the(lexicons(to(detect(affect

Page 31: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Lexicons-for-detecting-document-affect:Simplest-unsupervised-method

• Sentiment:• Sum(the(weights(of(each(positive(word(in(the(document• Sum(the(weights(of(each(negative(word(in(the(document• Choose(whichever(value((positive(or(negative)((has(higher(sum

• Emotion:• Do(the(same(for(each(emotion(lexicon

31

Page 32: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Lexicons-for-detecting-document-affect:Simplest-supervised-method

• Build(a(classifier• Predict(sentiment((or(emotion,(or(personality)(given(features• Use(“counts(of(lexicon(categories”(as(a(features• Sample(features:• LIWC(category(“cognition”(had(count(of(7• NRC(Emotion(category(“anticipation”(had(count(of(2

• Baseline• Instead(use(counts(of(all the(words(and(bigrams(in(the(training(set• This(is(hard(to(beat• But(only(works(if(the(training(and(test(sets(are(very(similar32

Page 33: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Prosody

Page 34: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Prosody

• Three(characteristics(of(speech:• pitch• energy• duration/rateJofJspeech

• That(play(a(role(in(conveying(meaning• And(especially(social(meaning

Page 35: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

USC’s-SAIL-LabShri-Narayanan-and-Dani-Byrd

Page 36: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Happy

Page 37: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Sad

37

Page 38: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Larynx and Vocal Folds• The(Larynx((voice(box)

• A(structure(made(of(cartilage(and(muscle• Located(above(the(trachea((windpipe)(and(below(the(pharynx((throat)• Contains(the(vocal(folds• (adjective(for(larynx:(laryngeal)

• Vocal(Folds((older(term:(vocal(cords)• Two(bands(of(muscle(and(tissue(in(the(larynx• Can(be(set(in(motion(to(produce(sound((voicing)

Text from slides by Sharon Rose

Page 39: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

The larynx, external structure, from front

Figure thanks to John Coleman!!

Page 40: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Vertical slice through larynx, as seen from back

Figure thnx to John Coleman!!

Page 41: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Voicing:•Air(comes(up(from(lungs•Forces(its(way(through(vocal(cords,(pushing(open((2,3,4)•This(causes(air(pressure(in(glottis(to(fall,(since:

• when(gas(runs(through(constricted(passage,(its(velocity(increases((Venturi tube-effect)• this(increase(in(velocity(results(in(a(drop(in(pressure((Bernoulli-principle)

•Because(of(drop(in(pressure,(vocal(cords(snap(together(again((6J10)•Single(cycle:(~1/100(of(a(second.

Figure & text from John Coleman�s web site

Page 42: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Voicelessness

• When(vocal(cords(are(open,(air(passes(through(unobstructed

• Voiceless(sounds:(p/t/k/s/f/sh/th/ch• If(the(air(moves(very(quickly,(the(turbulence(causes(a(different(kind(of(phonation:(whisper

Page 43: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Vocal folds open during breathing

From Mark Liberman�s web site, from Ultimate Visual Dictionary

Page 44: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Vocal Fold Vibration

UCLA Phonetics Lab Demo

Page 45: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Sound-waves-are-longitudinal-waves

Dan Russell Figure

Page 46: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

particle dispacment

pressure

Dan Russell Figure

Page 47: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Remember-High-School-PhysicsSimple-Period-Waves-(sine-waves)

Time (s)0 0.02

–0.99

0.99

0• Characterized by:• period: T• amplitude A• phase φ

• Fundamental frequencyin cycles per second, or Hz

• F0=1/T1 cycle

Page 48: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Simple-periodic-waves

• Computing(the(frequency(of(a(wave:• 5(cycles(in(.5(seconds(=(10(cycles/second(=(10(Hz

• Amplitude:• 1

• Equation:• Y(=(A(sin(2πft)

Page 49: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Speech-sound-waves

• A(little(piece(from(the(waveform(of(the(vowel([iy]• X(axis:(time.(• Y(axis:(

• Amplitude(=(air(pressure(at(that(time(• +:(compression• 0:(normal(air(pressure,(• J:(rarefaction

Page 50: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Back-to-waves:Fundamental-frequency

• Waveform(of(the(vowel([iy]

• Frequency:(10(repetitions(/(.03875(seconds(=(258(Hz• This(is(speed(that(vocal(folds(move,(hence(voicing• Each(peak(corresponds(to(an(opening(of(the(vocal(folds• The(low(frequency(of(the(complex(wave(is(called(the(

fundamental(frequency(of(the(wave(or(F0

.03875 0 Time in seconds

Page 51: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

F0-(informally:-pitch)

We(can(compute(F0(mean,(max,(min(for(each(turnAnd(the(standard(deviation(across(turns

Page 52: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Intensity

We(can(compute(intensity(mean,(max,(min(for(each(turnAnd(the(standard(deviation(across(turns

Page 53: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Prosody

Page 54: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Prosody(for(Emotion(Detection

Page 55: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Acted-speech:-Emotional-Prosody-Speech-and-Transcripts-Corpus-(EPSaT)

• Recordings(from(LDC(• http://www.ldc.upenn.edu/Catalog/LDC2002S28.html

• 8(actors(read(short(dates(and(numbers(in(15(emotional(styles

Slide(from(Jackson(Liscombe

Page 56: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

EPSaT-Examples

happysadangryconfidentfrustratedfriendlyinterested

Slide(from(Jackson(Liscombe

anxiousboredencouraging

Page 57: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Task-1

• Binary(classification• Detect(the(emotion(the(actor(was(instructed(to(use

Page 58: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Major-Problems-for-Classification:Different-Valence/Different-Activation

slide from Julia Hirschberg

Page 59: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

But….Different-Valence/-Same-Activation

slide from Julia Hirschberg

Page 60: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Extracting-audio-features

• OpenSmile• http://www.audeering.com/research/opensmile“Speech%&%Music%Interpretation%by%LargeBspace%Extraction”

• Praat

Page 61: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer-summary:Prosodic-features-for-emotion

Page 62: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Prosody(for(Emotion

Page 63: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Personality(detection

Page 64: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Sample-affective-task:-personality-detection-

64

Page 65: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s-typology-of-affective-statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,-sad,-joyful,-fearful,-ashamed,-proud,-desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,-gloomy,-irritable,-listless,-depressed,-buoyant

Interpersonal-stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,-cold,-warm,-supportive,-contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,-loving,-hating,-valuing,-desiring

Personality-traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,-anxious,-reckless,-morose,-hostile,-envious,-jealous

Page 66: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Personality

• The(internal(structures(and(propensities(that(explain(a(person’s(characteristic(patterns(of(thought,(emotion,(and(behavior.

• Personality(captures(what(people(are(like.

McGraw-Hill/Irwin Chapter 9

Page 67: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

67

The-Big-Five-Dimensions-of-Personality

Extraversion-vs.-Introversion-sociable,(assertive,(playful(vs.(aloof,(reserved,(shy

Emotional-stability-vs.-Neuroticismcalm,(unemotional(vs.(insecure,(anxious

Agreeableness-vs.-Disagreeable-friendly,(cooperative(vs.(antagonistic,(faultfinding

Conscientiousness-vs.-UnconscientiousselfJdisciplined,(organised vs.(inefficient,(careless

Openness-to-experience-intellectual,(insightful(vs.(shallow,(unimaginative

Page 68: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Big-Five-Personality:-Agreeableness

warm,(kind,(cooperative,(sympathetic,(helpful,(and(courteous.• Strong(desire(to(obtain(acceptance(in(personal(relationships(as(a(means(of(expressing(personality.

• Agreeable(people(focus(on(“getting(along,”(not(necessarily(“getting(ahead.”

McGraw-Hill/Irwin Chapter 9

Page 69: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Big-Five-Personality:-Extraversion

talkative,(sociable,(passionate,(assertive,(bold,(and(dominant• Easiest(to(judge(immediately(on(first(meeting• Prioritize(desire(to(obtain(power(and(influence(within(a(social(

structure(as(a(means(of(expressing(personality.• High(in(positive(affectivity(— a(tendency(to(experience(pleasant,(

engaging(moods(such(as(enthusiasm,(excitement,(and(elation.

McGraw-Hill/Irwin Chapter 9

Page 70: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Big-Five-Personality:-Neuroticism

• experience(unpleasant(moods:(hostility,(nervousness,(and(annoyance.

• more(likely(to(appraise(dayJtoJday(situations(as(stressful.• less(likely(to(believe(they(can(cope(with(the(stressors(that(they(

experience.• related(to(locus(of(control (attribute(causes(of(events(to(

themselves(or(to(the(external(environment)• Neurotics:(external(locus(of(control:(believe(that(the(events(that(occur(around(them(are(driven(by(luck,(chance,(or(fate.

• less(neurotic(people(hold(internal(locus(of(control:(believe(that(their(own(behavior(dictates(events. McGraw-Hill/Irwin Chapter 9

Page 71: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

External-and-Internal-Locus-of-Control

McGraw-Hill/Irwin Chapter 9

Page 72: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Big-Five-Personality:-Openness-to-Experience

curious,(imaginative,(creative,(complex,(sophisticated• Also(called(“Inquisitiveness”(or(“Intellectualness”• high(levels(of(creativity,(the(capacity(to(generate(novel(and(

useful(ideas(and(solutions.• Highly(open(individuals(are(more(likely(to(migrate(into(artistic(

and(scientific(fields.

McGraw-Hill/Irwin Chapter 9

Page 73: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Changes-in-Big-Five-Dimensions-Over-the-Life-Span

McGraw-Hill/Irwin Chapter 9

Page 74: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Aside:-Do-Animals-Have-Personalities?

• Gosling((1998)(studied(spotted(hyenas.(• 4(human(observers(rated(44(personality(traits(of(hyenas(• Ran(PCA(on(the(ratings• Five(dimensions:(Assertiveness,(Excitability,(HumanJDirected(Agreeableness,(Sociability,(and(Curiosity

• Related(to(3(human(dimensions:(neuroticism((excitability),(openness((curiosity),(agreeableness((sociability+agree)

Page 75: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Take-the-Big-Five-Inventory

http://www.outofservice.com/bigfive/

Page 76: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Various-text-corpora-labeled-for-personality-of-author

Pennebaker,(James(W.,(and(Laura(A.(King.(1999.("Linguistic(styles:( language(use(as(an(individual(difference."(Journal%of%personality%and%social%psychology 77,(no.(6.

• 2,479(essays(from(psychology(students((1.9(million(words),(“write(whatever(comes(into(your(mind”(for(20(minutes

Mehl,(Matthias( R,(SD(Gosling,(JW(Pennebaker.(2006.((Personality( in(its(natural(habitat:(manifestations( and(implicit( folk(theories( of(personality( in(daily(life.((Journal(of(personality(and(social(psychology(90((5),(862

• Speech(from(Electronically(Activated(Recorder((EAR)(• Random(snippets(of(conversation(recorded,(transcribed• 96(participants,(total(of(97,468(words(and(15,269(utterances

Schwartz,(H.(Andrew,(Johannes(C.(Eichstaedt,(Margaret(L.(Kern,(Lukasz(Dziurzynski,(Stephanie(M.(Ramones,(Megha Agrawal,(AchalShah(et(al.(2013.("Personality,(gender,(and(age(in(the(language(of(social(media:( The(openJvocabulary(approach."(PloS one 8,(no.(9(

• Facebook• 75,000(volunteers• 309(million(words• All(took(a(personality(test

Page 77: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Ears-(speech)-corpus-(Mehl et-al.)

Page 78: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Essays-corpus-(Pennebaker and-King)

Page 79: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Classifiers

• Mairesse,(François,(Marilyn(A.(Walker,(Matthias(R.(Mehl,(and(Roger(K.(Moore.("Using(linguistic(cues(for(the(automatic(recognition(of(personality(in(conversation(and(text."(Journal%of%artificial% intelligence%research (2007):(457J500.• Various(classifiers,(lexiconJbased(and(prosodic(features

• Schwartz,(H.(Andrew,(Johannes(C.(Eichstaedt,(Margaret(L.(Kern,(Lukasz(Dziurzynski,(Stephanie(M.(Ramones,(Megha Agrawal,(Achal Shah(et(al.(2013.("Personality,(gender,(and(age(in(the(language(of(social(media:(The(openJvocabulary(approach."(PloS one 8,(no.• regression(and(SVM,(lexiconJbased(and(allJwords

79

Page 80: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Sample-LIWC-FeaturesLIWC (Linguistic Inquiry and Word Count)

Pennebaker, J.W., Booth, R.J., & Francis, M.E. (2007). Linguistic Inquiry and Word Count: LIWC 2007. Austin, TX

Page 81: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Dialog-act-of-utterance

Labeled(by(parsing(each(utterance(and(then(using(heuristic(rules(based(on(parse(tree:

Commands:(imperatives,(“can(you”,(etc.Backchannels:(yeah,(ok,(uhJhuh,(huhQuestionsAssertions (anything(else)

Page 82: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Prosodic-features

Computed%via%Praat

pitch (mean,(min,(max,(sd):intensity (mean,(min,(max,(sd)voiced(timerate(of(speech((words/second)

Page 83: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Normalizing-LIWC-category-features(Schwartz-et-al-2013,-Facebook-study)

83

• Mairesse:Raw(LIWC(counts

• Schwartz((et(al:Normalized(per(writer:

Page 84: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Sample-results• Agreeable:(

• +Family,(+Home,(JAnger,(JSwear

• Extravert• +Friend,(+Religion,(+Self

• Conscientiousness:• JSwear,(JAnger,(JNegEmotion,(

• Emotional(Stability:(• JNegEmotion,(+Sports,(

• Openness• JCause,(JSpace

84

Page 85: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Decision-tree-for-predicting-extraversionin-essay-corpus-(Mairesse et-al)Mairesse, Walker, Mehl & Moore

Words per sentence

Familiarity

Up

Positive emotions

Grooming

17.91

> 17.91

> 599.7

> 1.66

> 0.11

> 0.64 0.64

599.7

1.66

0.11

Introvert

Extravert

Introvert

Introvert

Introvert

Extravert

Apostrophes

2.57

> 2.57

Achievement

> 1.52

1.52

Extravert

Introvert

Sadness

> 1.44 1.44

Extravert

Introvert

Parentheses

> 0.64 0.64

Introvert

Sexuality

Articles

7.23

> 7.23

> 0.12

Introvert

2.57

> 2.57

0.12

Figure 1: J48 decision tree for binary classification of extraversion, based on the essayscorpus and self-reports.

Remarkably, we can see that the LIWC features outperform the MRC features for everytrait, and the LIWC features on their own always perform slightly better than the fullfeature set. This clearly suggests that MRC features aren’t as helpful as the LIWC featuresfor classifying personality from written text, however Table 13 shows that they can stilloutperform the baseline for four traits out of five.

Concerning the algorithms, we find that AdaboostM1 performs the best for extraversion(56.3% correct classifications), while SMO produces the best models for all other traits. Itsuggests that support vector machines are promising for modelling personality in general.The easiest trait to model is still openness to experience, with 62.5% accuracy using LIWCfeatures only.

4.2 EAR Corpus

Classification accuracies for the EAR corpus are in Table 14. We find that extraversion isthe easiest trait to model using observer reports, with both Naive Bayes and AdaboostM1

476

85

Page 86: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Feature-analysis:-Observed-Extraversion

more(wordshigher(pitchmore(concrete,(imageable wordsgreater(variation(in(intensitygreater(mean(intensitymore(word(repetitions

M5’(Regression(Tree

Page 87: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Using-all-words-instead-of-lexiconsFacebook-study

Schwartz(et(al.((2013)• Choosing(phrases(with(pmi >(2*length(([in(words]

• Only(use(words/phrases(used(by(at(least(1%(of(writers• Normalize(counts(of((words(and(phrases(by(writer

87

Page 88: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Facebook-study,-Learned-words,-Extraversion-versus-Introversion

Page 89: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Facebook-study,-Learned-wordsNeuroticism-versus-Emotional-Stability

89

Page 90: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Evaluating-Schwartz-et-al-(2013)-Facebook-Classifier

• Train(on(labeled(training(data• LIWC(category(counts(• words(and(phrases((nJgrams(of(size(1(to(3,(passing(a(collocation(filter(

• Tested(on(a(heldJout(set• Correlations(with(human(labels

• LIWC(((.21J.29• All(Words((.29J.41

90

Page 91: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

What-about-predicting-personality-from-what-someone-likes?YouyouWu,(Michal(Kosinski,(and(David(Stillwell.("ComputerJbased(personality(judgments(are(more(accurate(than(those(made(by(humans."(Proceedings%of%the%National%Academy%of%Sciences (2015)

1. 86,220(volunteers(filled(in(a(100Jitem(questionnaire• International(Personality(Item(Pool((IPIP)(FiveJFactor(Model(of(personality

2. They(then(used(Facebook(likes(to(predict(personality(for(70,520(people

3. And(asked(their(friends(to(answer(a(10Jitem(questionnaire(for(17,622(people

Page 92: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Wu-et-al.-Algorithm:

judged by two friends. A diagram illustrating the methods ispresented in Fig. 1.

ResultsSelf-Other Agreement. The primary criterion of judgmental accuracyis self-other agreement: the extent to which an external judgmentagrees with the target’s self-rating (17), usually operationalized asa Pearson product-moment correlation. Self-other agreement wasdetermined by correlating participants’ scores with the judgmentsmade by humans and computer models (Fig. 1). Since self-otheragreement varies greatly with the length and context of the re-lationship (18, 19), we further compared our results with thosepreviously published in a meta-analysis by Connely and Ones (20),including estimates for different categories of human judges:friends, spouses, family members, cohabitants, and work colleagues.To account for the questionnaires’ measurement error, self-

other agreement estimates were disattenuated using scales’Cronbach’s α reliability coefficients. The measurement error ofthe computer model was assumed to be 0, resulting in the lower(conservative) estimates of self-other agreement for computer-based judgments. Also, disattenuation allowed for direct com-parisons of human self-other agreement with those reported byConnely and Ones (20), which followed the same procedure.The results presented in Fig. 2 show that computers’ average

accuracy across the Big Five traits (red line) steadily grows withthe number of Likes available on the participant’s profile (x axis).Computer models need only 100 Likes to outperform an averagehuman judge in the present sample (r = 0.49; blue point).†

Compared with the accuracy of various human judges reportedin the meta-analysis (20), computer models need 10, 70, 150, and300 Likes, respectively, to outperform an average work col-league, cohabitant or friend, family member, and spouse (graypoints). Detailed results for human judges can be found inTable S1.How accurate is the computer, given an average person? Our

recent estimate of an average number of Likes per individual is227 (95% CI = 224, 230),‡ and the expected computer accuracyfor this number of Likes equals r = 0.56. This accuracy is sig-nificantly better than that of an average human judge (z = 3.68,P < 0.001) and comparable with an average spouse, the best ofhuman judges (r = 0.58, z = −1.68, P = 0.09). The peak computerperformance observed in this study reached r = 0.66 for partic-ipants with more than 500 Likes. The approximately log-linearrelationship between the number of Likes and computer accu-racy, shown in Fig. 2, suggests that increasing the amount ofsignal beyond what was available in this study could further boostthe accuracy, although gains are expected to be diminishing.Why are Likes diagnostic of personality? Exploring the Likes

most predictive of a given trait shows that they represent activ-ities, attitudes, and preferences highly aligned with the Big Fivetheory. For example, participants with high openness to experi-ence tend to like Salvador Dalí, meditation, or TED talks; par-ticipants with high extraversion tend to like partying, Snookie(reality show star), or dancing.Self-other agreement estimates for individual Big Five traits

(Fig. 2) reveal that the Likes-based models are more diagnostic of

Fig. 1. Methodology used to obtain computer-based judgments and estimate the self-other agreement. Participants and their Likes are represented as a matrix,where entries are set to 1 if there exists an association between a participant and a Like and 0 otherwise (second panel). The matrix is used to fit five LASSO linearregression models (16), one for each self-rated Big Five personality trait (third panel). A 10-fold cross-validation is applied to avoid overfitting: the sample israndomly divided into 10 equal-sized subsets; 9 subsets are used to train the model (step 1), which is then applied to the remaining subset to predict the per-sonality score (step 2). This procedure is repeated 10 times to predict personality for the entire sample. The models are built on participants having at least 20Likes. To estimate the accuracy achievable with less than 20 Likes, we applied the regression models to random subsets of 1–19 Likes for all participants.

†This figure is very close to the average human accuracy (r = 0.48) found in Connelly andOnes’s meta-analysis (20).

‡Estimate based on a 2014 sample of n = 100,001 Facebook users collected for a separateproject. Sample used in this study was recorded in the years 2009–2012.

Youyou et al. PNAS | January 27, 2015 | vol. 112 | no. 4 | 1037

PSYC

HOLO

GICALAND

COGNITIVESC

IENCE

S

Page 93: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Results:

some traits than of others. Especially high accuracy was observedfor openness—a trait known to be otherwise hard to judge due tolow observability (21, 22). This finding is consistent with previousfindings showing that strangers’ personality judgments, based ondigital footprints such as the contents of personal websites (23),are especially accurate in the case of openness. As openness islargely expressed through individuals’ interests, preferences, andvalues, we argue that the digital environment provides a wealth ofrelevant clues presented in a highly observable way.Interestingly, it seems that human and computer judgments

capture distinct components of personality. Table S2 lists cor-relations and partial correlations (all disattenuated) betweenself-ratings, computer judgments, and human judgments, basedon a subsample of participants (n = 1,919) for whom bothcomputer and human judgments were available. The averageconsensus between computer and human judgments (r = 0.37) isrelatively high, but it is mostly driven by their correlations withself-ratings, as represented by the low partial correlations (r =0.07) between computer and human judgments. Substantialpartial correlations between self-ratings and both computer (r =0.38) and human judgments (r = 0.42) suggest that computer andhuman judgments each provide unique information.

Interjudge Agreement. Another indication of the judgment accu-racy, interjudge agreement, builds on the notion that two judgesthat agree with each other are more likely to be accurate thanthose that do not (3, 24–26).The interjudge agreement for humans was computed using

a subsample of 14,410 participants judged by two friends. As thejudgments were aggregated (averaged) on collection (i.e., we didnot store judgments separately for the judges), a formula was usedto compute their intercorrelation (SI Text). Interjudge agreement

for computer models was estimated by randomly splitting theLikes into two halves and developing two separate models fol-lowing the procedure described in the previous section.The average consensus between computer models, expressed

as the Pearson product-moment correlation across the Big Fivetraits (r = 0.62), was much higher than the estimate for humanjudges observed in this study (r = 0.38, z = 36.8, P < 0.001) or inthe meta-analysis (20) (r = 0.41, z = 41.99, P < 0.001). All resultswere corrected for attenuation.

External Validity. The third measure of judgment accuracy, ex-ternal validity, focuses on how well a judgment predicts externalcriteria, such as real-life behavior, behaviorally related traits, andlife outcomes (3). Participants’ self-rated personality scores, aswell as humans’ and computers’ judgments, were entered intoregression models (linear or logistic for continuous and di-chotomous variables respectively) to predict 13 life outcomesand traits previously shown to be related to personality: lifesatisfaction, depression, political orientation, self-monitoring,impulsivity, values, sensational interests, field of study, substanceuse, physical health, social network characteristics, and Face-book activities (see Table S3 for detailed descriptions). The ac-curacy of those predictions, or external validity, is expressed asPearson product-moment correlations for continuous variables,or area under the receiver-operating characteristic curve (AUC)for dichotomous variables.§As shown in Fig. 3, the external validity of the computer

judgments was higher than that of human judges in 12 of the 13

Fig. 2. Computer-based personality judgment accuracy (y axis), plotted against the number of Likes available for prediction (x axis). The red line representsthe average accuracy (correlation) of computers’ judgment across the five personality traits. The five-trait average accuracy of human judgments is positionedonto the computer accuracy curve. For example, the accuracy of an average human individual (r = 0.49) is matched by that of the computer models based onaround 90–100 Likes. The computer accuracy curves are smoothed using a LOWESS approach. The gray ribbon represents the 95% CI. Accuracy was averagedusing Fisher’s r-to-z transformation.

§AUC is an equivalent of the probability of correctly classifying two randomly selectedparticipants, one from each class, such as liberal vs. conservative political views. Note thatfor dichotomous variables, the random guessing baseline corresponds to an AUC = 0.50.

1038 | www.pnas.org/cgi/doi/10.1073/pnas.1418680112 Youyou et al.

Page 94: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Summary-on-Personality-Detection

• From-text-and-speech• Not(a(solved(task• Text(and(prosodic(features(both(somewhat(useful• Especially(hard(to(extract(selfJlabeled(personality• Especially(hard(to(detect(openness

• From-likes:• Seems(to(be(an(easier(task• ComputerJbased(judgment(of(personality((r(=(0.56)(correlates(more(strongly(with(selfJratings(than(average(human(judgments(do((r(=(0.49)

Page 95: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Detecting(Personality

Page 96: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Other(tasks:(Detecting(student(uncertainty(and(disinterest(for(online(education

Page 97: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-student-confusion-in-a-tutoring-system

ITSpoke:-Intelligent(Tutoring(Spoken(Dialogue(SystemDiane(Litman,(Katherine(ForbesJRiley,(Scott(Silliman,(Mihai(Rotaru• Jackson(Liscombe,(Julia(Hirschberg,(Jennifer(J.(Venditti.(2005.(Detecting(

Certainness(in(Spoken(Tutorial(Dialogues• ForbesJRiley,(Kate,(and(Diane(Litman.("Benefits(and(challenges(of(realJtime(

uncertainty(detection(and(adaptation(in(a(spoken(dialogue(computer(tutor."(Speech(Communication(53,(no.(9((2011):(1115J1136.

Page 98: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Tutorial-corpus:-how-certain-is-the-student

• 151(dialogues(from(17(subjects• student(first(writes(an(essay,(then(discusses(with(tutor

• both(are(recorded(with(microphones• manually(transcribed(and(segmented(into(turns• 6778(student(utterances((average(2.3(seconds)• each(utterance(handJlabeled(for(certainty

Jackson(Liscombe,(Julia(Hirschberg,(Jennifer(J.(Venditti.(2005.(Detecting(Certainness(in(Spoken(Tutorial(Dialogues

Page 99: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Uncertainty-in-ITSpoke

Slide(from(Jackson(Liscombe

[71-67-1:92-113]

um <sigh> I don’t even think I have anidea here ...... now .. mass isn’t weight...... mass is ................ the ..........space that an object takes up ........ isthat mass?

um <sigh> I don’t even think I have anidea here ...... now .. mass isn’t weight...... mass is ................ the ..........space that an object takes up ........ isthat mass?

um <sigh> I don’t even think I have anidea here ...... now .. mass isn’t weight...... mass is ................ the ..........space that an object takes up ........ isthat mass?

um <sigh> I don’t even think I have anidea here ...... now .. mass isn’t weight...... mass is ................ the ..........space that an object takes up ........ isthat mass?

um <sigh> I don’t even think I have anidea here ...... now .. mass isn’t weight...... mass is ................ the ..........space that an object takes up ........ isthat mass?

Page 100: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Does-it-help?-A-tutorial-system-that-adapts-to-uncertaintyForbesJRiley,(Kate,(and(Diane(Litman.("Benefits(and(challenges(of(realJtime(uncertainty(detection(and(adaptation(in(a(spoken(dialogue(computer(tutor."(Speech(Communication(53,(no.(9((2011):(1115J1136.

ager, where states represent tutor questions and responsesand transitions between states represent student answervalues. The dialogue manager determines the semantic clas-sification and correctness value for each student answerusing Levenshtein distance between the answer string anda set of strings corresponding to semantic concepts repre-senting anticipated correct and incorrect answers to thecurrent tutor question. For instance, the semantic conceptdownward is the semantic classification for answer stringssuch as “down”, “towards earth”, etc. There are 51 differ-ent semantic concepts shared among the 142 different tutorstates; the set of answer strings which represent eachsemantic concept were assembled from prior ITSPOKEcorpora. If an answer string is successfully classified as ananticipated semantic concept, it receives the correctnessvalue associated with that concept, otherwise, the answerstring is labeled as incorrect.

In non-adaptive ITSPOKE, the next tutor state dependsonly on the correctness value of the student answer. Inuncertainty-adaptive ITSPOKE, the next tutor statedepends on both the correctness and uncertainty value ofthe student answer, where the uncertainty value is deter-mined by prosodic features and other features of the stu-dent answer (see Sections 4 and 5).

Finally, the tutor response produced by the dialoguemanager is sent to the Cepstral text-to-speech synthesis sys-tem8 and played to the student through the headphone.The tutor text is also displayed on the web interface, asshown in Fig. 2.

Earlier implementations of the non-adaptive ITSPOKEsystem have been used in prior experiments as a platformto compare human-computer versus human-human spokendialogue tutoring, spoken versus typed dialogue computertutoring, and synthesized versus pre-recorded tutor speech(Litman et al., 2006; Forbes-Riley et al., 2006). In theseprior experiments, significant overall learning has beenassociated with ITSPOKE tutoring (as measured by pretestand posttest before and after system use, respectively).

However, there is still significant room for improvement;for example, the average student posttest score after usingthe ITSPOKE system has never exceeded 75% (out of100%).

4. UNC-ITSPOKE: automatically adapting to uncertainty

The automatic uncertainty adaptation used in thisexperiment was based on the hypothesis that student learn-ing could be significantly improved by detecting and adapt-ing to student uncertainty during the computer tutoring. Inparticular, the adaptation was motivated by research thatviews uncertainty as well as incorrectness as a signal of a“learning impasse”, i.e., as an opportunity for the studentto better learn the material about which s/he is uncertainor incorrect; this research further suggests that experienc-ing uncertainty can motivate the student to take an activerole in constructing a better understanding of the materialbeing tutored (e.g. VanLehn et al., 2003; Kort et al., 2001).

We further observed that to be motivated to resolve alearning impasse, the student must first perceive that itexists. Incorrectness and uncertainty differ in this percep-tion. Incorrectness simply signals the student has reachedan impasse, while uncertainty signals the student perceivess/he has reached an impasse. Based on this, we associatedeach combination of binary uncertainty (UNC,CER) andcorrectness (INC,COR) with an “impasse severity”, as inFig. 6. COR + CER corresponds to a state where a studentis not experiencing an impasse, since s/he is correct and notuncertain about. INC + CER corresponds to a state wherea student is experiencing the most severe type of impasse,since s/he is incorrect and not aware of it. INC + UNCand COR + UNC answers indicate impasses of lesserseverity: the student is incorrect but aware s/he may be,and the student is correct but uncertain if s/he is, respec-tively. In (Forbes-Riley et al., 2008) we show empirical sup-port for distinguishing impasse severities.

We hypothesized that student learning would increase ifUNC-ITSPOKE would provide additional substantivecontent to remediate all learning impasse states. As dis-cussed in Section 3, both types of incorrectness impasse

Fig. 5. ITSPOKE architecture.

8 The Cepstral system is a commercial outgrowth of the Festival system(Black, 1997).

1120 K. Forbes-Riley, D. Litman / Speech Communication 53 (2011) 1115–1136

Page 101: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Give-more-information-if-the-student- is-uncertainForbesJRiley,(Kate,(and(Diane(Litman.("Benefits(and(challenges(of(realJtime(uncertainty(detection(and(adaptation(in(a(spoken(dialogue(computer(tutor."(Speech(Communication(53,(no.(9((2011):(1115J1136.

Page 102: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Features-for-uncertainty

Page 103: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Conclusions

• Uncertainty(is(very(hard(to(detect• FJscore(of(.27

• Even(so,(using(the(uncertainty(detector(improved(learner(outcomes(a(bit(over(not(using(it.

• But(need(better(detection(of(uncertainty,(and(also(better(detection(of(correct(answers.

Page 104: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-disengagementKate(ForbesJRiley,(Diane(Litman,(Heather(Friedberg,(Joanna(Drummond.(2012.((Intrinsic(and(Extrinsic(Evaluation(of(an(Automatic(User(Disengagement(Detector(for(an(UncertaintyJAdaptive(Spoken(Dialogue(System.((NAACL(2012.

Page 105: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Disengagement-Features

Most(important(feature:(Pause(prior(to(start(of(turn(<250ms(means(disengagement

Page 106: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Does-disengagement-detection-help-the-system?

YesBut(not(for(all(students

Only(for(male(students((improved(their(performance(significantly)

Litman,(Diane,(and(Kate(ForbesJRiley.("Evaluating(a(Spoken(Dialogue(System(that(Detects(and(Adapts(to(User(Affective(States."(In(15th%Annual%Meeting%of%the%Special%Interest%Group%on%Discourse%and%

Dialogue,(p.(181.(2014

Page 107: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Interpersonal(Stance(Detection

Page 108: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s-typology-of-affective-statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,-sad,-joyful,-fearful,-ashamed,-proud,-desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,-gloomy,-irritable,-listless,-depressed,-buoyant

Interpersonal-stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,-cold,-warm,-supportive,-contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,-loving,-hating,-valuing,-desiring

Personality-traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,-anxious,-reckless,-morose,-hostile,-envious,-jealous

Page 109: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Automatically-Extracting-Social-Meaning-from-Speed-Dates

Rajesh(Ranganath,(Dan(Jurafsky,(and(Daniel(A.(McFarland.(2013.(Detecting(friendly,(flirtatious,(awkward,(and(assertive(speech(in(speedJdates.(Computer%Speech%and%Language%27:1,(89J115.

McFarland,(Daniel,(Dan(Jurafsky,(and(Craig(M.(Rawlings.(2013.("Making(the(Connection:(Social(Bonding(in(Courtship(Situations.”(American%

Journal%of%Sociology.

Page 110: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-stance

1000(4Jminute(speed(datesSubjects(labeled(selves and(each-other-for(

• friendly((each(on(a(scale(of(1J10)• awkward• flirtatious• assertive

Page 111: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Linguistic-features-we-examined

• Words:• HEDGES:((kind(of,(sort(of,(a(little,(I(don’t(know,(I(guess• NEGEMOTION:-bad,(weird,(crazy,(problem,(tough,(awkward,(boring• LOVE:-love,(loved,(loving,(passion,(passions,(passionate• WORK:(research,(advisor,(lab,(work,(finish,(PhD,(department• I: I,(me,(mine,(my,(myself,(you,(your,(yours,(etc.

• Prosody• pitch(ceiling,(pitch(floor,(energy,(rate(of(speech

Page 112: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Dialog-act-features

• Clarification-questionsWhat?(((Excuse(me?

• Laughter[Beginning%of%turn]%%[End%of%turn]

• AppreciationsAwesome!%%%%%That’s%amazing!%%%%%Oh,%great

• SympathyThat%sounds%terrible!%%That’s%awful!%%%That%sucks!

Page 113: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Positive-and-negative-assessments

Sympathy(that’s|that(is|that(seems|it(is|that(sounds)(very|really|a( little|sort(of)?((terrible|awful|weird|sucks|a( problem|tough|too( bad)

Appreciations-(“Positive-feedback”)

(Oh)?((Awesome|Great|All( right|Man|No( kidding|wow|my(god)

That((‘s|is|sounds|would(be)((((so|really)?(great|funny|good|interesting|neat|amazing|nice|not( bad|fun)

(Goodwin, 1996; Goodwin and Goodwin, 1987; Jurafsky et al., 1998)

Page 114: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Interruptions

A:((Not(necessarily.((I(mean(it(happens(to(not(necessarily(be(my(thing,(but(there(are(plenty(ofJJB:((No,(no,(I(understand(your(point.

Page 115: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Model

• Multinomial(logistic(regression(classifier• Predict(whether(a(conversation(side(is(labeled(

flirt/friendly/assertive/awkward• Given(linguistic(features

• Then(look(at(the(feature(weights

Page 116: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Linguistic-signs-of-awkwardness

• Awkward(men(and(women(use(more(hedges• kind%of,%sort%of,%a%little

• People(who(are(so(uncomfortable(in(the(date• So(in(need(of(distancing(themselves

• That(they(can’t(even(commit(to(their(sentence.

Page 117: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

What-makes-someone-seem-friendly?“Collaborative-conversational-style”

• Friendly-people:• laugh(at(themselves• don’t(use(negative(emotions

• Friendly-men• are(sympathetic(and(agree(more(often• don’t(interrupt• don’t(use(hedges

• Friendly-women:• higher(max(pitch• laugh(at(their(date

Page 118: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

What-do-flirters-do?

Men:! raise(pitch(floor! laugh(turnJinitially((at(

their(date,(teasing?)! say(“you”(! don’t(use(words(

related(to(academics

Women:! raise(pitch(ceiling! laugh(turnJfinally(

(at(themselves?)! say(“I”(! use(negation((don’t,%

no,%not)

Page 119: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Unlikely-words-for-male-flirting

• academia• interview• teacher• phd• advisor• lab• research• management• finish

Page 120: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

How-consistently-do-people-read-each-others-linguistic-cues?

Each-dater-labeled-themselves

• I(flirted((from(1J10)• I(was(friendly((from(1J10)• I(was(awkward((from(1J10)• I(was(assertive((from(1J10)

And-labeled-their-partner

• My(date(flirted((from(1J10)• …

How-much-do-these-labels-agree?--

• If(I(thought(I(was(friendly,(does(my(date(agree?

Page 121: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Not-at-all

Page 122: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Not-at-all!

People(disagree(with(their(date(about(stance:

Male is flirting- (1E10)

Male(101 says: 8

Female(127 says: 1

Page 123: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Why?

People(assume(their(date(is(behaving(like(themselves:

Male-is-flirting

Female-is-flirting

Male(101 says: 8 7

Female(127 says: 1 1

Page 124: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Correlations-between-my-behavior-and-what-I-say-about-my-date

I label-myself---x I-label-date

I-label-date-x date-labels-themselves

Flirting .73 .15

Friendly .77 .05

Awkward .58 .07

Assertive .58 .09

Page 125: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

We-are-not-very-good-at-modeling-each-other’s-intentions

• At(least(not(in(4(minutes• Speakers(instead(base(their(judgments(on(their(own(behavior(or(intentions

Page 126: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

How-does-clicking-happen?

• Sociology(literature:• bonding(or(“sense(of(connection”(is(caused(by

• homophily:(select(mate(who(shares(your(attributes(and(attitudes• motives-and-skills-

• mutual-coordination-and-excitement

• (Durkheim:(religious(rituals,(unison(singing,(military)

• But(what(is(the(role(of(language?• Background:(speed(dating(has(power(asymmetry• women-are-pickier• Lot(of(other(asymmetric(role(relationships((teacherJstudent,(doctorJpatient,(bossJemployee,(etc.)(

Page 127: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Our-hypothesis:-targeting-of-the-empowered-party

• The(conversational(target(is(the(woman• both(parties(should(talk(about(her(more

• The(woman’s(face(is(important• the(man(should(align(to(the(woman(and(show(understanding

• The(woman’s(engagement(is(key• in(a(successful(bonding,(she(should(be(engaged

Page 128: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Results:-Clicking-associated-with:

• both(parties(talk(about(the(woman• women(use(I,%• men(use(you

• man(supports(woman’s(face• men(use(appreciationsand(sympathy,(• men(accommodatewomen’s(laughter• men interrupt(with(collaborative%completions

• woman(is(engaged• women(raise(their(pitch,(vary(loudness(and(pitch• women(avoid(hedges

Hierarchical(regression(dyad(model,(net(of(actor,(partner,(dyad(features

Page 129: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Function-of-Interruptions?

Theory-1:-Control:(Used(by(men(to(take(the(floor((Zimmerman(and(West(1975;(West(1985)Theory-2:-Shared-meaning,-alignment,-engagement:-(Tannen 1994;(Coates(1996,(1997),(collaborative(floor((Edelsky 1981)

Page 130: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

We-found:--interruptions-are-joint-construction-(“collaborative-completions”)

• a(turn(where(a(speaker(completes(the(utterance(begun(by(the(alter((Lerner,(1991;(Lerner,(1996).

So(are(you(almostJJ

On(my(way(out,(yeah.

Page 131: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Or-showing-shared-understanding

Female:((I(didn’t(used(to(like(it(but(now(I’m—Male:(((((Oh(same(for(me.…

Page 132: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Other-factors-that-might-matter• Height(• BMI(((Body(mass(index)• ForeignJborn• Dating(experience(• Looking(for(relationship?• Order(of(date(in(evening(• Met(before• Age(difference(• Hobby(similarities

Page 133: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

These-factors-do-influence-stance

• More(likely(to(flirt:• highJBMI(men(or(women• taller(men(and(shorter(women• men(later(in(the(evening

• More(likely(to(be(flirted(with• lowJBMI(women• highJBMI(men

• Bigger((taller,(heavier)(men(say(they(are(more(assertive

Page 134: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

But-language-still-matters

Language Traits Language-+-Traits

Male(flirt

66% 64 72

Female(flirt

74 55 76

accuracy((baseline(is(50%)

Page 135: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Conclusions-from-Dating

• We(are(not(very(good(at(reading(intentions• How-friendly-people-sound

• be(sympathetic,(ask(clarification(questions,(agree,(accommodate

• How-to-date:• Don’t(talk(about(your(advisor• Daters-focus-on-the-empowered-party• We(can(detect(this(with(relatively(simple(linguistic(features

Page 136: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Interpersonal(Stance(Detection

Page 137: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Detecting-Social-and-Affective-Meaning

Summary

Page 138: CS 124/LINGUIST 180 From Languages to Information · CS 124/LINGUIST 180 From Languages to Information Detecting(Social(and(Affective(Meaning Dan(Jurafsky Stanford(University

Scherer’s-typology-of-affective-statesEmotion:(relatively(brief(episode(of(synchronized(response(of(all(or(most(organismic(subsystems(in(response(to(the(evaluation(of(an(event(as(being(of(major(significance

angry,-sad,-joyful,-fearful,-ashamed,-proud,-desperate

Mood:(diffuse(affect(state(…change(in(subjective(feeling,(of(low(intensity(but(relatively(long(duration,(often(without(apparent(cause

cheerful,-gloomy,-irritable,-listless,-depressed,-buoyant

Interpersonal-stance:(affective(stance(taken(toward(another(person(in(a(specific(interaction,(coloring(the(interpersonal(exchange

distant,-cold,-warm,-supportive,-contemptuous

Attitudes:(relatively(enduring,(affectively(colored(beliefs,(preferences(predispositions(towards(objects(or(persons(

liking,-loving,-hating,-valuing,-desiring

Personality-traits:(emotionally(laden,(stable(personality(dispositions(and(behavior(tendencies,(typical(for(a(person

nervous,-anxious,-reckless,-morose,-hostile,-envious,-jealous