learning. associative and cognitive learning classical conditioning: learning to link two stimuli in...

Learning

Associative and Cognitive Learning

Classical conditioning: learning to link two stimuli in a

way that helps us anticipate an

event to which we have a reaction

Operant conditioning:

changing behavior

choices in response to

consequences

Cognitive learning: acquiring new behaviors and information

through observation and information, rather than by

direct experience

Associative Learning

Child associates his “response” (behavior) with consequences. Child learns to repeat behaviors (saying “please”) which were

followed by desirable results (cookie). Child learns to avoid behaviors (yelling “gimme!”) which were

followed by undesirable results (scolding or loss of dessert).

Associative Learning: Operant Conditioning

Cognitive LearningCognitive learning refers to acquiring new behaviors and information mentally, rather than by direct experience.Cognitive learning occurs: 1.by observing events and the behavior of others. 2.by using language to acquire information about events experienced by others.

Adaptation to the Environment

Learning—any process through which experience at one time can alter an individual’s behavior at a future time

Behaviorism

The attempt to understand observable activity in terms of observable stimuli and observable responses

John B. Watson (1913)B. F. Skinner (1938)

Pavlov’s Dogs

Digestive reflexes and salivation

Psychic secretion

Neutral Stimulus—Bell

Does not normally elicit a response or reflex action by itself a bell ringing a color a furry object

Unconditioned Stimulus—Food

Always elicits a reflex action: an unconditioned response food blast of air noise

Unconditioned Response —Salivation

A response to an unconditioned stimulus—naturally occurring Salivation at smell of food Eye blinks at blast of air Startle reaction in babies

Conditioned Stimulus—Bell

The stimulus that was originally neutral becomes conditioned after it has been paired with the unconditioned stimulus

Will eventually elicit the unconditioned response by itself

Conditioned Response

The original unconditioned response becomes conditioned after it has been elicited by the neutral stimulus

Classical Conditioning Phenomenon

ExtinctionSpontaneous recoveryGeneralizationDiscrimination training

15

AcquisitionWhat gets “acquired”? The association between a neutral stimulus (NS) and an unconditioned stimulus (US). How can we tell that acquisition has occurred? The UR now gets triggered by a CS (drooling now gets triggered by a bell).

Timing For the association to be acquired, the neutral stimulus (NS) needs to repeatedly appear before the unconditioned stimulus (US)…about a half-second before, in most cases. The bell must come right before the food.

Acquisition refers to the initial stage of learning/conditioning.

John B. Watson and Little Albert

Conditioned emotional responses

GeneralizationExtinction

Classical Conditioning and Drug Use

Regular use may produce “placebo response” where user associates sight, smell, taste with drug effect

Cognitive Aspects of Classical Conditioning

Reliable and unreliable signals Actively process information Robert Rescorla

Taste Aversions

Early Operant Conditioning

E. L. Thorndike (1898)Puzzle boxes and cats

Scratch at bars

Push at ceiling

Dig at floorSituation:stimuliinside ofpuzzle box

Howl

Etc.

Etc.

Press lever

First Trialin Box

Scratch at bars

Push at ceiling

Dig at floorSituation:stimuliinside ofpuzzle box

Howl

Etc.

Etc.

Press lever

After ManyTrials in Box

Thorndikes Puzzle Box

B. F. Skinner’s Operant Conditioning

Did not like Thorndike’s term “satisfying state of affairs”

Interested in emitted behaviorsOperant—voluntary response that

acts on the environment to produce consequences

Reinforcement—the occurrence of a stimulus following a response that increases the likelihood of the response being repeated

Operant Conditioning

Reinforcers

Primary—a stimulus that is inherently reinforcing for a species (biological necessities)

Conditioned—a stimulus that has acquired reinforcing value by being associated with a primary reinforcer

Reinforcement Reinforcement:

feedback from the environment that makes a behavior more likely to be done again. Positive +

reinforcement: the reward is adding something desirable

Negative - reinforcement: the reward is ending something unpleasant

For the meerkat, this warm light is desirable.

This meerkat has just completed a task out in the cold

Behavior shaped by accidental reinforcement

Punishment

Presentation of a stimulus following a behavior that acts to decrease the likelihood that the behavior will be repeated

Don’t think about the beach

Don’t think about the waves, the sand, the towels and sunscreen, the sailboats and surfboards. Don’t think about the beach. Are you obeying the instruction? Would you obey this instruction more if you were punished for thinking about the beach?

Problem: Punishing focuses on what NOT to do, which does not guide people to a desired behavior.Even if undesirable behaviors do stop, another problem behavior may emerge that serves the same purpose, especially if no replacement behaviors are taught and reinforced.

Lesson: In order to teach desired behavior, reinforce what’s right more often than punishing what’s wrong.

Problems with Punishment

Does not teach or promote alternative, acceptable behavior

May produce undesirable results such as hostility, passivity, fear

Likely to be temporary May model aggression

Operant Conditioning Terms

ShapingExtinctionSpontaneous RecoveryDiscriminative StimuliSchedules of Reinforcement

School: long before tablet computers, B.F. Skinner proposed machines that would reinforce students for correct responses, allowing students to improve at different rates and work on different learning goals.

Sports: athletes improve most in the shaping approach in which they are reinforced for performance that comes closer and closer to the target skill (e.g., hitting pitches that are progressively faster).

Work: some companies make pay a function of performance or company profit rather than seniority; they target more specific behaviors to reinforce.

Applications of Operant Conditioning

Discrimination

Discrimination: the ability to become more and more specific in what situations trigger a response.

Shaping can increase discrimination, if reinforcement only comes for certain discriminative stimuli.

For examples, dogs, rats, and even spiders can be trained to search for very specific smells, from drugs to explosives.

Pigeons, seals, and manatees have been trained to respond to specific shapes, colors, and categories.

Bomb-finding rat

Manatee that selects shapes

Reinforcement Schedules

Continuous—every correct response is reinforced; good way to get a low frequency behavior to occur

Partial—only some correct responses are reinforced; good way to make a behavior resistant to extinction

Partial Schedules—Ratio

Ratio schedules are based on number of responses emitted

Fixed ratio (FR)—a reinforcer is delivered after a certain (fixed) number of correct responses

Variable ratio (VR)—a reinforcer is delivered after an average number of responses, but varies from trial to trial

Partial Schedules—Interval

Interval schedules are based on time. Fixed interval (FI)—reinforcer is delivered for

the first response after a fixed period of time has elapsed

Variable interval (VI)—reinforcer is delivered for the first response after an average time has elapsed, differs between trials

Why we might work for money If we repeatedly introduce a neutral

stimulus before a reinforcer, this stimulus acquires the power to be used as a reinforcer.

A primary reinforcer is a stimulus that meets a basic need or otherwise is intrinsically desirable, such as food, sex, fun, attention, or power.

A secondary/conditioned reinforcer is a stimulus, such as a rectangle of paper with numbers on it (money) which has become associated with a primary reinforcer (money buys food, builds power).

How often should we reinforce?

Do we need to give a reward every single time? Or is that even best?

B.F. Skinner experimented with the effects of giving reinforcements in different patterns or “schedules” to determine what worked best to establish and maintain a target behavior.

In continuous reinforcement (giving a reward after the target every single time), the subject acquires the desired behavior quickly.

In partial/intermittent reinforcement (giving rewards part of the time), the target behavior takes longer to be acquired/established but persists longer without reward.

Cognitive Aspects of Operant Conditioning

Cognitive map—term for a mental representation of the layout of a familiar environment

Latent learning—learning that occurs in the absence of reinforcement, but is not demonstrated until a reinforcer is available

Learned helplessness—phenomenon where exposure to inescapable and uncontrollable aversive events produces passive behavior

Learned Helplessness can be produced when negative events are perceived as uncontrollable

Biological Predispositions

Animal training issues

Instinctive drift—naturally occurring behaviors that interfere with operant responses - raccoons rubbed coins

Classical Conditioning vs. Operant Conditioning

Observation Learning

Observation Modeling Imitation Albert Bandura and the Bobo doll study

Antisocial Effects of Observational Learning

What happens when we learn from models who demonstrate antisocial behavior, actions that are harmful to individuals and society?

Children who witness violence in their homes, but are not physically harmed themselves, may hate violence but still may become violent more often than the average child.

Perhaps this is a result of “the Bobo doll effect”? Under stress, we do what has been modeled for us.

Have a wonderful day. Love you. :) 😃

Media Models of ViolenceDo we learn antisocial behavior such as violence from indirect observations of others in the media?

Research shows that viewing media violence leads to increased aggression (fights) and reduced prosocial behavior (such as helping an injured person).This violence-viewing effect might be explained by imitation, and also by desensitization toward pain in others.

Do what I say, not what I do—

This will teach you to hit your brother—

Why do you do that, you know you get in trouble for it—

Famous last words???

learning. associative and cognitive learning classical conditioning: learning to link two stimuli in...

Documents

response behavior

unconditioned stimulus

placebo response

neutral stimulus ns

consequencescognitive

reaction operant conditioning

new behaviors

behavior choices