chapter 7 the associative structure of instrumental conditioning

26
Chapter 7 The Associative Structure of Instrumental Conditioning

Upload: christopher-lawson

Post on 18-Jan-2018

240 views

Category:

Documents


0 download

DESCRIPTION

3 main components in operant learning situation 1.Stimulus – S (or sometimes S d )  the discriminative stimulus sets the occasion for reward by signaling when the response will be followed by the reinforcer 2. Response – R 3. Outcome – O

TRANSCRIPT

Page 1: Chapter 7 The Associative Structure of Instrumental Conditioning

Chapter 7

The Associative Structure of Instrumental Conditioning

Page 2: Chapter 7 The Associative Structure of Instrumental Conditioning

The way we talk about Pavlovian conditioning is very ‘cognitive’

we say that animals form mental representations of the relationships among stimuli animal has a representation of the CS that gets associated with some representation of the US when we present a CS, it calls up a representation of the US Instrumental/operant conditioning is now viewed in the same way subjects/animals are information-processors, not only with respect to stimuli (Pavlovian) but also with respect to their own behavior (operant)

Page 3: Chapter 7 The Associative Structure of Instrumental Conditioning

3 main components in operant learning situation

1. Stimulus – S (or sometimes Sd) the discriminative stimulus sets the occasion for reward by signaling when the response will be followed by the reinforcer

2. Response – R

3. Outcome – O

Page 4: Chapter 7 The Associative Structure of Instrumental Conditioning

Associations develop among each of these elements:S-R association

the discriminative stimulus can become directly associated with the response

S-O association the discriminative stimulus can become associatedwith the outcome (basically a Pavlovian association)

R-O association the response becomes associated with the outcome

Page 5: Chapter 7 The Associative Structure of Instrumental Conditioning

In recent years, this notion that animals develop mental representations of behavior is probably best shown in the work of Rescorla

Rescorla and colleagues have done a # of experiments demonstrating that animals develop R-O associations

The typical way to demonstrate R-O association is to train rats to make a response for a particular outcome and then devalue that outcome — should lead to a decrease in responding

Page 6: Chapter 7 The Associative Structure of Instrumental Conditioning

R-O associationColwill & Rescorla (1985)

Training Devaluation TestR1 O1

LP Sucrose

R2 O2

CP food(Same rats get both)

O1 LiCLO2 nothing

R1 and R2

Everything is counterbalanced, but for thesake of simplicity say LP = sucrose andCP = food and the sucrose is devalued

Page 7: Chapter 7 The Associative Structure of Instrumental Conditioning

R-O association

animals should have developed 2 different R-O associations

during test, 20 min with both responses available they could LP or CP but no outcome was given, i.e., essentially an extinction test

if R1 evokes memory for devalued or aversive outcome, but R2 does not, then should see a decrease in R1

Page 8: Chapter 7 The Associative Structure of Instrumental Conditioning

1

2

34567

Meanresp/min

R1 -outcomewas devalued

Time

R2 -outcomenot devalued

R-O associationResults:

Page 9: Chapter 7 The Associative Structure of Instrumental Conditioning

when outcome (reinforcer) was devalued by pairings with LiCl, the response that produced the reinforcer declined

the reason is that subjects remembered the reinforcer as being aversive and therefore devalued the response that was associated with that outcome

so, memory for, or representation of, the goal object is crucial for the execution of the response

R-O associationResults:

Page 10: Chapter 7 The Associative Structure of Instrumental Conditioning

S-O association

Like Pavlovian CSs, Sd (discriminative stimuli) also become associated with outcomes

Colwill & Rescorla (1988)

Sd training Response training Test

S1 R1 O1

N LP Suc

S2 R2 O2

L CP food

R3 O1

R4 O2

2 new responsesAll rats get both

S1: R3 vs R4

S2: R3 vs R4

Page 11: Chapter 7 The Associative Structure of Instrumental Conditioning

This is essentially what happened

S-O association

if rat has S-O association (i.e., knew which outcome went with which Sd), then when given S1 on test, should perform the response that was associated with the same outcome

i.e., when given S1 — should perform R3 S2 — should perform R4

Page 12: Chapter 7 The Associative Structure of Instrumental Conditioning

S-O association

Results:

2

4

6

10

Meanresp/min Different

outcome

Trials

Sameoutcome

8

Page 13: Chapter 7 The Associative Structure of Instrumental Conditioning

S-O association

Results:

in the presence of a particular Sd, the rats performed the response that was associated with the same outcome, more than the response associated with the different outcome

evidence for S-O association

Page 14: Chapter 7 The Associative Structure of Instrumental Conditioning

S-R association somewhat simpler to demonstrate

T BP food

see more BP during the T than in its absence Rescorla has shown with devaluation experiments that even with complete devaluation, see some responding due to S-R association

for ex., devalue foodin the presence of the T, rat still barpresses (but won’t eat the food)

Page 15: Chapter 7 The Associative Structure of Instrumental Conditioning

In addition to the simple associations of 2 elements (i.e., S-R, S-O, R-O), can also have hierarchical associations

the Sd becomes an occasion setter that signals when the response will be followed by a reinforcer

S R OR nothing

so, the Sd signals the relationship between a response and its outcome

S [R O]

Hierarchical Associations

Page 16: Chapter 7 The Associative Structure of Instrumental Conditioning

Hierarchical Associations

Recall from Pavlovian conditioning that a CS is only powerful when it reliably predicts a US

When the CS provides no reliable information about the occurrence of the US, then conditioning is weak

The same idea has been applied to the learning of ahierarchical association

Page 17: Chapter 7 The Associative Structure of Instrumental Conditioning

S R OR nothing

Hierarchical Associations

In this situation, the S is informative about when the R will be followed by the O

S R OR O

However, in the second situation, the S is provides noinformation about when the R will be followed by the O

Page 18: Chapter 7 The Associative Structure of Instrumental Conditioning

Hierarchical Associations

Rescorla (1990) used this idea to obtain evidence for a hierarchical association

Training Test

S1 [R1 O1]S1 [R2 O2]

S2 [R1 O2]

S2 [R2 O1]But also,R1 O1

R2 O2

S1: R1 vs R2

S2: R1 vs R2

4 30-s presentations of bothdiscriminative stimuli, with both responses available

Which Sd is informative about theR-O relation???

Page 19: Chapter 7 The Associative Structure of Instrumental Conditioning

Results:

Hierarchical Associations

1

2

345

67

Meanresp/min

S2 - informative

Trials

S1 -not informative

Page 20: Chapter 7 The Associative Structure of Instrumental Conditioning

Theories of Reinforcement

1. Reinforcement as stimulus presentation

What identifies a reinforcer?

Thorndike

a stimulus that is satisfying the problem with this definition is that it is circular

Page 21: Chapter 7 The Associative Structure of Instrumental Conditioning

Theories of Reinforcement

Hull’s Drive Reduction Theory a biological need upsets the body’s homeostasis andinduces a drive state any stimulus that satisfies the biological need, restoreshomeostasis, and thus reduces the drive state serves as areinforcer the problem with this definition is that many reinforcers do not restore/maintain homeostasis incentive motivation (response elicited by reinforcer), curiosity, praise, criticism

Page 22: Chapter 7 The Associative Structure of Instrumental Conditioning

Theories of Reinforcement2. Reinforcement as behaviorA. The Premack Principle rather than talking about reinforcing stimuli, Premack focused on reinforcing responses so, instead of saying food is a reinforcing stimulus,Premack said eating is a reinforcing response the only difference between an operant responseand a reinforcer is the probability of occurrence avoided circularity by defining a reinforcer as a moreprobable behavior than the ‘operant’ behavior that is, high probability behaviors will reinforce lowprobability behaviors, but not the reverse

Page 23: Chapter 7 The Associative Structure of Instrumental Conditioning
Page 24: Chapter 7 The Associative Structure of Instrumental Conditioning

When a rat is water deprived (E1), it drinks more than it runs. Therefore, drinking reinforces running, but running does not reinforce drinking. When a rat is not waterdeprived (E2), it runs more than it drinks. Then running reinforces drinking, but drinking does not reinforce running

Page 25: Chapter 7 The Associative Structure of Instrumental Conditioning

Theories of Reinforcement

B. Behavioral Regulation Approaches expanded on Premack Principle took into account the animal’s repertoire of behaviorin a context this established the “bliss point”, the optimum distribution of responding for the subject operant contingencies shift the subject away from the bliss point; the subject behaves so as to approach the optimum distribution as closely as possible

Page 26: Chapter 7 The Associative Structure of Instrumental Conditioning

Time Studying