language modeling - sameer singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdflanguage...

38
Language Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017 Based on slides from Dan Jurafsky, Noah Smith, and everyone else they copied from.

Upload: others

Post on 05-Oct-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

LanguageModeling

Prof.SameerSinghCS295:STATISTICALNLP

WINTER2017

January24,2017

BasedonslidesfromDanJurafsky,NoahSmith,andeveryoneelsetheycopiedfrom.

Page 2: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Outline

CS295:STATISTICALNLP(WINTER2017) 2

Wrapup WordEmbeddings

IntroductiontoLanguageModels

N-GramBasedLanguageModels

SmoothingLanguageModels

Page 3: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Outline

CS295:STATISTICALNLP(WINTER2017) 3

Wrapup WordEmbeddings

IntroductiontoLanguageModels

N-GramBasedLanguageModels

SmoothingLanguageModels

Page 4: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Predictsurroundingwords

CS295:STATISTICALNLP(WINTER2017) 4

Abottleoftezguino isonthetable.

u

v

Page 5: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

NegativeSampling

CS295:STATISTICALNLP(WINTER2017) 5

Page 6: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

NeuralViewofEmbeddings

CS295:STATISTICALNLP(WINTER2017) 6

Page 7: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Wordembeddings

CS295:STATISTICALNLP(WINTER2017) 7

Variations

• Skip-gram:predictcontextfromword• CBOW:predictwordfromcontextbagofwords• Dependencies:abetterdescriptionofcontext

Uses

• Similarity:• Grammar:• Analogies

• Gender:• Facts:

Page 8: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Outline

CS295:STATISTICALNLP(WINTER2017) 8

Wrapup WordEmbeddings

IntroductiontoLanguageModels

N-GramBasedLanguageModels

SmoothingLanguageModels

Page 9: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

LanguageModels

CS295:STATISTICALNLP(WINTER2017) 9

ProbabilityofaSentence

• Isagivensentencesomethingyouwouldexpecttosee?• Syntactically(grammar)andSemantically(meaning)

ProbabilityoftheNextWord

• Predictwhatcomesnextforagivensequenceofwords.• ThinkofitasV-wayclassification

Page 10: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Task:SpeechRecognition

CS295:STATISTICALNLP(WINTER2017) 10

“eyesaweofan”

“Isawavan”

OR

Page 11: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Task:MachineTranslation

CS295:STATISTICALNLP(WINTER2017) 11

Page 12: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Task:HandwritingRecognition

CS295:STATISTICALNLP(WINTER2017) 12

http://www.cedar.buffalo.edu/handwriting/HRoverview.html

Page 13: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Task:ImageCaptioning

CS295:STATISTICALNLP(WINTER2017) 13

Page 14: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Task:SpellingCorrection

CS295:STATISTICALNLP(WINTER2017) 14

Theofficeisaboutfifteenminuetsfrommyhouse

P(aboutfifteenminutesfrom)>>P(aboutfifteenminuetsfrom)

Page 15: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

OtherApplications

CS295:STATISTICALNLP(WINTER2017) 15

Summarization

QuestionAnswering

DialogSystems

Page 16: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

EvaluatingLanguageModels

CS295:STATISTICALNLP(WINTER2017) 16

Bestchoice:Extrinsic

2ndchoice:Intrinsic

Page 17: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Perplexity

CS295:STATISTICALNLP(WINTER2017) 17

Page 18: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

GeneratingTextfromanLM

CS295:STATISTICALNLP(WINTER2017) 18

Page 19: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Outline

CS295:STATISTICALNLP(WINTER2017) 19

Wrapup WordEmbeddings

IntroductiontoLanguageModels

N-GramBasedLanguageModels

SmoothingLanguageModels

Page 20: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

DirectLanguageModeling

CS295:STATISTICALNLP(WINTER2017) 20

P(“Idonotlikegreeneggsandham”)

P(w|“Idonotlikegreeneggsand”)

Page 21: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

ApplyingtheChainRule

CS295:STATISTICALNLP(WINTER2017) 21

Page 22: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

MarkovAssumption

CS295:STATISTICALNLP(WINTER2017) 22

Page 23: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

UnigramLanguageModel

CS295:STATISTICALNLP(WINTER2017) 23

Page 24: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

BigramLanguageModel

CS295:STATISTICALNLP(WINTER2017) 24

Page 25: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

BerkeleyRestaurantProject

CS295:STATISTICALNLP(WINTER2017) 25

Page 26: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

BerkeleyRestaurantProject

CS295:STATISTICALNLP(WINTER2017) 26

Page 27: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

N-GramLanguageModels

CS295:STATISTICALNLP(WINTER2017) 27

“ThecomputerwhichIhadjustputintothediningroomonthefifthfloorcrashed.”

“ThecomputerwhichIhadjustputintothediningroomonthefifthfloorhadlunch.”

Page 28: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Shakespeare

CS295:STATISTICALNLP(WINTER2017) 28

Page 29: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

WallStreetJournal

CS295:STATISTICALNLP(WINTER2017) 29

Page 30: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

ImplementationTips

CS295:STATISTICALNLP(WINTER2017) 30

UseLogs

• Preventunderflow• Sums,insteadofproducts

Filteroutn-grams

• Raren-gramsarenoisy/havelowprob• Useunigramstofilterbigrams…

Page 31: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Outline

CS295:STATISTICALNLP(WINTER2017) 31

Wrapup WordEmbeddings

IntroductiontoLanguageModels

N-GramBasedLanguageModels

SmoothingLanguageModels

Page 32: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

ZeroProbabilityProblem

CS295:STATISTICALNLP(WINTER2017) 32

• Truthiness• #letalonethehashtags• bigly

NewwordsRarewords/combinations Mispellings

• “minuets”• Becausecorpusisfinite..

Trainingset:…deniedtheallegations…deniedthereports…deniedtheclaims…deniedtherequest

P(“offer”|deniedthe)=0

• Testset…deniedtheoffer…deniedtheloan

Page 33: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

LaplaceSmoothing

CS295:STATISTICALNLP(WINTER2017) 33

Page 34: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

IntuitionBehindSmoothing

CS295:STATISTICALNLP(WINTER2017) 34

Whenwehavesparsestatistics:

P(w|deniedthe)3allegations2reports1claims1request7total

allegatio

ns

repo

rts

claims

attack

requ

est

man

outcome

P(w|deniedthe)2.5allegations1.5reports0.5claims0.5request2other7total

allegatio

ns

attack

man

outcome

…allegatio

ns

repo

rts

claims

requ

est

Stealprobabilitymasstogeneralizebetter

Page 35: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

BerkeleyRestaurantProject

CS295:STATISTICALNLP(WINTER2017) 35

Page 36: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

BerkeleyRestaurantProject

CS295:STATISTICALNLP(WINTER2017) 36

Page 37: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Backoff andInterpolation

CS295:STATISTICALNLP(WINTER2017) 37

• Usetrigram,unlessrare• Thenusebigram,unlessrare• Thenuseunigram..

Backoff

• Combineallthree!• Linearfunctionwithparameters• Learnonheldoutdata

Interpolation

Page 38: Language Modeling - Sameer Singhsameersingh.org/.../wi17/slides/lecture-0124-lang-models.pdfLanguage Modeling Prof. Sameer Singh CS 295: STATISTICAL NLP WINTER 2017 January 24, 2017

Upcoming…

CS295:STATISTICALNLP(WINTER2017) 38

• Homework1isdue:January26,2017• Write-up,data,andcodeforHomework2isup• Homework2isdue:February9,2017

Homework

• Proposalisdue:February7,2017(~2weeks)• Makethingsmoreconcrete:approach,metrics,baselines• Mentionprogress,andaddressmyconcerns,ifany• Only2pages

Project