language acquisition computational intelligence...

71
Language Acquisition Computational Intelligence 4/7/05 LouAnn Gerken

Upload: others

Post on 26-Sep-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Language Acquisition Computational Intelligence

4/7/05LouAnn Gerken

Page 2: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overview

• The language learner’s problem space• A sample of computational capacities

of human infants• Turning the induction problem into a

deduction problem

Page 3: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

The Language Learner’sProblem Space

Page 4: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Language vs. Communication

Mental message (codedin some lingua mentis)

Did it hurt to get all ofthose holes in your ear?

Create hierarchicalstructure and find wordsto convey message

Ordered sequence oflanguage specific sub-meaning units

Recover wordsfrom acoustics

Parse sentence structure

Interpret structure

Page 5: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Language vs. Communication

• Language researchers generallyreserve the term ‘language’ to refer to asystem with ‘duality of patterning’– Combinable sub-meaning units– Combinable meaning units

• Communication is any system thatconveys information from one organismto another.

Page 6: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

What problems face alanguage learner?

• Induction problem (at 2 levels)• Poverty of the stimulus (at 2 levels)• Noisy data (at 2 levels)

Page 7: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

The Induction Problem

• You’re about to see 5 slides of 3colored bars in frames.

• The first 4 slides exhibit a property thatyou need to learn.

• Decide whether or not the 5 slideexhibits the property in question.

Page 8: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 9: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 10: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 11: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 12: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

?

Page 13: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Answer

• NO• The property in question in whether the

area covered by the bars is greaterthan 50% of the area of the rectangle.

• This generalization is more natural forpigeons to learn than for humans.

Page 14: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Try it again

Page 15: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 16: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 17: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 18: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 19: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

?

Page 20: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Answer

• YES• Property in question in whether the 3

bars are unequal in height.• This generalization is more natural for

humans than for pigeons.

Page 21: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Try it again

Page 22: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 23: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 24: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 25: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 26: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

?

Page 27: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Answer

• YES• You only saw examples decreasing in

height from left to right. But thegeneralization was still that the barsonly had to be different heights.

Page 28: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

The Induction Problem

• You have to be able to discern therelevant dimension(s)

• Any set of input data potentially allowsan infinite number of generalizations.

• If the learner receives a random subsetof data, many incorrect generalizationscan be ruled out.

Page 29: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Poverty of the Stimulus

• Generalization in any domain requires thelearner to solve the induction problem (aswell as the problem of noisy data).

• But some linguists think that there’s anadditional problem for language learners:Some types of input that could rule outincorrect generalizations is entirely missingduring the acquisition process. E.g., …

Page 30: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Poverty of the Stimulus• Statement-question pair with one clause

– The man is tall.– Is the man tall?

• Statement with 2 clauses– The man who is tall is nice.

• Possible questions– Is the man who tall is nice?– Is the man who is tall nice?

• It is argued that children don’t hear the correct inputbefore they start to produce the correct 2-clausequestion.

Page 31: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Poverty of the Stimulus

• It’s my sense that such claims willeventually be seen as incorrect.

• However, if they ARE true, thenlanguage learning presents a learningproblem above and beyond theinduction problem.

Page 32: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Computational Capacities ofHuman Infants

Page 33: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Headturn PreferenceProcedure

Page 34: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs
Page 35: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Learning in the lab -General procedure

• Infants are exposed for about 2 min. tofamiliarization stimuli.

• They are then tested on stimuli that areeither consistent or inconsistent withthe familiarization stimuli.

• The dependent measure is listeningtime for consistent vs. inconsistent.

• Any significant difference counts

Page 36: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Novely vs. Familiarity

0

10

20

30

40

50

60

70

80

90

100

familiarity novelty

Lea

rnin

g S

tren

gth

Page 37: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overview of InfantComputation

• Descriptive statistics• Conditional probabilities• Patterns of identity• Input with multiple generalizations• Categories

Page 38: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Descriptive Statistics

Maye, Werker & Gerken, 2001

Page 39: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Which acoustic differencesmark category distinctions?

• The same acoustic differences that arenot used to mark meaning distinctionsappear allophonically.

• E.g., ‘zip lip’ vs. ‘back lip’ in English vs.Mid-Waghi

• How do infants learn which acousticdifferences reflect different categories?

Page 40: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Maye, Werker & Gerken, 2001Training Distribution

0

1

2

3

4

1 2 3 4 5 6 7 8

Token in Continuum

Fre

qu

en

cy

Unimodal

Bimodal

Page 41: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Maye et al., 2001

• Infants were familiarized with strings ofsyllables from the continuum plus fillers.

• Half heard a unimodal and half heard abimodal distribution from the continuum.

• At test, they either heard one of theend points multiple times in successionof the two end points alternating.

Page 42: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Maye et al., 2001

0

1

2

3

4

5

6

7

8

6 mos.

Unimodal

8 mos.

Unimodal

6 mos.

Bimodal

8 mos.

Bimodal

Alternating

Non-Alternating

Page 43: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Conditional Probabilities

Page 44: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Infants’ Computational Prowess Saffran et al., 1996

Listen to the following stimulus and try to find the words

• You heard 3 3-syllable words stuck togetherbidakugolabupadotigolabubidaku

• da follows bi 100% of the time, whereas gofollows bu 33% of the time.

• 7.5-month-old infants can use conditionalprobabilities across adjacent syllables, asshown by the fact that they listen longer tostatistical part words like bupado thanstatistical words like padoti

Page 45: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Patterns of Identity

Page 46: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

AAB Stimuli from Marcus et al., 1999

B

A

di je li we

le leledi leleje leleli lelewe

wi wiwidi wiwije wiwili wiwiwe

ji jijidi jijije jijili jijiwe

de dededi dedeje dedeli dedewe

Page 47: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Marcus et al. 1999

• Half of the infants were familiarized with AABand half on ABA strings.

• Over 12 test trials, all infants heard AAB andABA strings instantiated in new syllables(popoga, kokoba and pogapo, kobako)

• Infants listened significantly longer duringinconsistent test trials.

Page 48: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Input with MultipleGeneralizations

Gerken (in press)

Page 49: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Different subsets of an input set supportdifferent generalizationsB

A

di je li we

le leledi leleje leleli lelewe

wi wiwidi wiwije wiwili wiwiwe

ji jijidi jijije jijili jijiwe

de dededi dedeje dedeli dedewe

Page 50: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Different subsets of an input set supportdifferent generalizations

• 2 groups of 9-month-olds were familiarizedwith synthesized tokens from either diagonalor column of previous table.

• Half in each group heard AAB or ABA strings.• At test, all infants heard 2 AAB and 2 ABA

strings instantiated in new syllables (popoga,kokoba and pogapo, kobako)

Page 51: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Generalization to New StringsDiagonal vs. Column

Page 52: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Did the Column-ExposedMake a Less AbstractGeneralization (dee)?

• A third group of infants was familiarizedwith the column stimuli (AAB or ABA).

• They were then tested on stringscontaining “dee” popodi, kokodi orpodipo, kodiko)

Page 53: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Infants Make Less AbstractGeneralization from Column

Page 54: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Categories

(Gerken, Wilson & Lewis, 2005)

Page 55: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Russian Gender Categories

???kornyemmedved-yem

stroiltel-yem

uchitel-yem

zhitel-yem

pisaryakornya???stroitelyauchitelyazhitelya

???kniguvannuruchkurubashkupolku

korovojknigoj???ruchkojrubashkojpolkoj

tested on 6 trials each of

grammatical: vannoj korovu medvedya pisaryem

ungrammatical: vannya korovyem medvedoj pisaru

Page 56: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Can Infants Learn RussianGender Categories?

• 2 groups of 17-month-olds– Group 1 was familiarized for 2 min. to

words in which a subset were double-marked

– Group 2 received an identical test, but wasfamiliarized with stimuli containing onlycase ending cues to gender (no wordsending in “tel” or “k” were included)

Page 57: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Correlated Cues for CategoryInduction

0

2

4

6

8

10

12

Correlated Cues Case Marking Only

Familiarization Condition

Mean

Lis

ten

ing

Tim

e (

sec.)

grammatical

ungrammatical

Page 58: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Turning Induction intoDeduction

Page 59: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Turning Induction intoDeduction

• There are 3 approaches to theinduction problem for language:– ANN’s

• generalize, but do so by pre-coding therelevant dimensions of the problem space

• produce ‘weird’ errors• don’t account for non-linearities in behavior …

Page 60: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Abrupt vs. Gradual Learning

black triangle or white square vs.white triangle or black square

black vs. white

Page 61: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Abrupt vs. Gradual Learning

Page 62: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Turning Induction intoDeduction

– Hypothesis testing (often Bayesian), like invisual object identification

– Universal Grammar …

Page 63: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Motivation for UG Approach

• Only humans have it• Language creation by individual

children (and writing system creation)• Similarities across languages• Poverty of the stimulus

Page 64: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

The UG Approach

• Languages of the world differ fromeach other in a limited number of ways(parametric variation).

• Humans are born with all parametervalues latently available, and minimalinput is needed to set a parameter.

• Early utterances and de novo creationof language reflects default settings.

Page 65: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• Language learning entails solving theinduction problem at 2 levels.– sub-meaning units (phonology)– meaning units (syntax)

Page 66: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• Language learning also might entaildealing with stimulus poverty, in whichsome critical information is withheldfrom young learners.

Page 67: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• Infants have many computational skills.

Page 68: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• But we don’t know how they might usethem to solve the induction problem.

Page 69: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• Alternatively, infants might be engagedin deduction. But the empiricalevidence for a particular view ofdeduction (parameter setting) is notvery strong.

Page 70: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs

Overall Summary

• Research areas:– computational approaches to induction– more information about the computational

skills of infants– exploring deductive constraints on learners

Page 71: Language Acquisition Computational Intelligence 4/7/05elmo.sbs.arizona.edu/sandiway/ling696/louann.pdf · black triangle or white square vs. white triangle or black square black vs