How do animals “know” when a schedule is on extinction?

Slides:



Advertisements
Similar presentations
Theories of Learning Chapter 4 – Theories of Conditioning
Advertisements

Learning Unit 5. Topics in Learning Unit Defining Learning Classical Conditioning Operant Conditioning Cognitive Learning.
Extinction of Learned Behavior Lesson 14. Extinction of Reinforced Behavior n Behavior changes relatively permanent l Modifiable if situation changes.
Chapter 9 – Extinction of Conditioned Behavior Outline –Effects of Extinction Procedures Decreased responding Increased variability in responding –Extinction.
PSY 402 Theories of Learning Chapter 9 – Motivation.
There’s never just one reinforcer Hmm…what to do?
Aversive Control of Behavior: Punishment & Avoidance Lesson 16.
Lecture 20: Extinction (Pavlovian & Instrumental) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
How do animals “know” when a schedule is on extinction?
Extinction of Conditioned Behavior Effects of Extinction Extinction and Original Learning Paradoxical Effects in Extinction.
PSY 402 Theories of Learning Chapter 5 – The Role of Conditioning in Behavior.
Extinction of Conditioned Behavior Effects of Extinction Extinction and Original Learning Paradoxical Effects in Extinction.
PSY 402 Theories of Learning Chapter 4 – Theories of Conditioning.
Punishment.
PSY 402 Theories of Learning Chapter 9 – Motivation.
Last Day To Register  This is the last day to register for the November special election.  To register, go to: Rock the Vote website:
Negative Reinforcement
Classical Conditioning
Chapter 10 Aversive Control: Avoidance and Punishment.
Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.
Learning Prof. Tom Alloway. Definition of Learning l Change in behavior l Due to experience relevant to what is being learned l Relatively durable n Conditioning.
Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.
Learning Theories Learning To gain knowledge, understanding, or skill, by study, instruction, or experience.
4 th Edition Copyright 2004 Prentice Hall5-1 Learning Chapter 5.
Classical Conditioning
CHAPTER 4 Pavlovian Conditioning: Causal Factors.
Chapter 5 Learning. chapter 5 What is Learning? Occurs whenever experience or practice results in a relatively permanent change in behavior.
Operant Conditioning E.L. Thorndike and B.F. Skinner.
Psychology 2250 Last Class Characteristics of Habituation and Sensitization -time course -stimulus-specificity -effects of strong extraneous stimuli (dishabituation)
Learning Experiments and Concepts.  What is learning?
PRINCIPLES OF APPETITIVE CONDITIONING Chapter 6 1.
PSY402 Theories of Learning Chapter 6 (Cont.) Factors Affecting Appetitive Learning.
Operant conditioning (Skinner – 1938, 1956)
Factors Influencing Conditioning  CS and US Intensity, and Attention to the CS  Temporal relationship  Predictiveness  Preparedness  Redundancy 1.
Learning Chapter 5.
Knowledge acquired in this way.
Extinction of Conditioned Behavior Effects of Extinction  the rate of responding decreases  response variability increases  experiment by Neuringer,
Blocking The phenomenon of blocking tells us that what happens to one CS depends not only on its relationship to the US but also on the strength of other.
Unit 1 Review 1. To say that learning has taken place, we must observe a change in a subject’s behavior. What two requirements must this behavioral change.
Extinction of Conditioned Behavior Chapter 9 Effects of Extinction Extinction and Original Learning What is learned during Extinction.
PSY402 Theories of Learning Monday February 10, 2003.
Learning Principles & Applications 7-9% of AP Exam.
Chapter 6 LEARNING. Learning Learning – A process through which experience produces lasting change in behavior or mental processes. Behavioral Learning.
Becoming Extinct. Extinction of Conditioned behavior Extinction is a typical part of the learning process – Pavlovian or Instrumental contingency can.
Rescorla-Wagner Model  US-processing model  Can account for some Pavlovian Conditioning phenomena: acquisition blocking unblocking with an upshift conditioned.
More Extinction.
Mechanisms of Extinction
Chapter 5 Learning © 2013 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution.
Extinction of Conditioned Behavior
Factors Affecting Performance on Reinforcement Schedules
How do animals “know” when a schedule is on extinction?
PSY 402 Theories of Learning
Learning.
Module 20 Operant Conditioning.
PSY402 Theories of Learning
Case Study: The Little Albert Experiment
Extinction of Learned Behavior
PSY402 Theories of Learning
Operant Conditioning Unit 4 - AoS 2 - Learning.
Learning and Conditioning
Learning liudexiang.
UNIT 4 BRAIN, BEHAVIOUR & EXPERIENCE
Chapter 5 Learning.
PSY402 Theories of Learning
LEARNING.
Module 27 – Operant Conditioning 27
PSY 402 Theories of Learning
Part 1- Behaviorist Learning Theory
Agenda To Get: To Do: Guided notes Intro Unit 7: Learning
How do animals “know” when a schedule is on extinction?
Presentation transcript:

How do animals “know” when a schedule is on extinction?

Remember Operant conditioning extinction differs from classical conditioning extinction Responds decreases to near zero for both Operant conditioning: Transient increase Extinction induced aggression

Neuringer, Kornell & Olufs, 2001 2 groups of rats Examined response variability Experimental chamber: 2 response levers: one lever on opposite walls During reinforcement: 3 responses in row to obtain food pellet (in any combination) One group reinforced for variability of responses Got food only if sequence was different that what did on an earlier trial Second group: just any 3 responses Then both groups shifted to EXT condition: Predictions on rate of EXT in each group?

Results: Examined Last 4 sessions of reinforcement phase First 4 sessions of EXT Rats in variability group INCREASED variety of responses during initial extinction, but reduced responses overall Rats in control group showed increase in variability as well!

So: Why is this study important? Extinction: Decreased overall responses But INCREASED variability of those responses Animals tried something DIFFERENT Remember this is for operant conditioning Animal is in control of contingency Try something NEW in case the contingency changed But quit with no reinforcement

Emotional effects of Extinction Frustration: Frustrative non-reward energizes behavior! Aggression: Frustration aggression Will aggress towards model if model placed in cage with rat or pigeon during initial extinction Will bite or attack objects Will attack a real cohort Does this occur in humans?

What happens to learning during EXT? Extinction does NOT reverse effects of acquisition Reduces responding Emotional outcomes But does not erase learning! Four lines of research demonstrate this: Spontaneous recovery Renewal Reinstatement Reinforcer devaluation

Extinction and Spontaneous Recovery EXT produces decline in conditioned responding, but this decline dissipates with time That is, the response begins to come back Because there is no reinstatement of the contingency, this is called spontaneous recovery Spontaneous recovery = return of previously extinguished response Critical factor: introducing period of rest between end of extinction training and assessment of responding Behavior that is suppressed during extinction training recovers after a period of rest Original learning trumps extinction condition Better the response is originally learned, and the shorter the extinction training, the more likely spontaneous recovery

Renewal of Responding Renewal Recovery of excitatory responding to an extinguished stimulus Produced by shift away from the contextual cues that were present during extinction. Responding will recover from extinction if extinction trials occur in contextually different situation than original training

Renewal of Responding Bouton and King, 1983 Used conditioned suppression to study acquisition and extinction of conditioned fear in rats Rats first trained to press response lever for food Then paired CS with foot shock: Trained in 1 of 2 operant chambers Shock condition had distinctively different contextual cues Rats assigned to 1 of 3 groups for extinction phase: 2 groups: 20 EXT trials of CS no shock Group A: extinction in SAME context as original fear conditioning Group B: extinction in different context as original fear conditioning 3rd group: no extinction trials = control group

Renewal of Responding Results: Great suppression at start of EXT for both groups Then: following EXT in either distinct or similar context: series of test trials in ORIGINAL setting LEAST suppression for group that received training in different context Training in different context resulted in less generalization

Renewal of Responding Why? Renewal effect occurs because memory of extinction is specific to cues that were present during extinction trials Shift away from the context of extinction disrupts memory of extinction Appears that original learning of response-reinforcement is better remembered than sequence of extinction

Renewal of Responding Important applied implications! Even if a therapeutic intervention successfully results in extinction of a response, the original response may return if the person or animal encounters contexts that are highly similar to the original learning! Extinction must be well generalized to overcome effects of response-reinforcer contingency!

Reinstatement Reinstatement Recovery of responding to an extinguished stimulus produced by exposure to unconditioned stimulus Again- problematic for applied situations: Reduce behavior through EXT A pairing of the response and reinforcer occurs You just “reinstated” the behavior

Reinstatement Tends to be somewhat context specific Supports theory that conditioning occurs due to contextual stimuli S+: Response-reinforcer situation is often fairly context- specific EXT then, is also context specific and may easily be reinstated Context of conditioning PLUS any remaining excitement to CS/S+ could summate into a reinstatement of responding In a way, is a more context specific form of renewal

What happens to R-Sr relation and EXTINCTION? Does the organism retain knowledge of the reinforcer? Evidence suggests YES! IF conditioned behavior reflects an S-O responding or R-O association, devaluation of the reinforcer should produce decrement in responding similar to that found in EXT! What do the data say? S-O associations are not forgotten or lost during CC procedures Specificity of reinstatement behavior shows that animal must have knowledge of reinforcer

Ostlunk & Balleine, 2007 Rats trained to lever press for 2 reinforcers: 1 lever = food pellet 1 lever = sugar water Quick extinction procedure for BOTH Then gave 1 reinforcer for next lever press on either food lever or water lever If food given: Responding on that lever increased If water given: Responding on that lever increased NOT both or random lever: Rats remembered which lever that reinforcer was for!

Enhancing Extinction So is extinction just worthless? No, several things can be done to increase the effectiveness of extinction Procedures for enhancing extinction Carefully choose the number and spacing of extinction trials Reduce spontaneous recovery Reduce opportunities for renewal Use compounding extinction stimuli

Number and spacing of EXT trials Increase the number of extinction trials! Results in further decrease in responding Conduct extinction trials in massed trials, not spaced out No opportunity for reinstatement or recovery No other contextual pairings can occur

Reduce Spontaneous Recovery Repeat periods of extinction, rest and retest Successive cycles reduce the likelihood of spontaneous recovery Alternatively (and more often used in appetitive work): increase the interval between training and extinction Introduce cues associated with extinction

Reducing Renewal Conduct extinction trials in a variety of contextual settings Basically, generalization of extinction Make extinction a generalized response Present reminder cues that EXT is in effect Again, providing specific cues NOT to respond

Compounding Extinction Stimuli Present 2 stimuli at same time that both have undergone EXT Deepens the extinction cues for BOTH cues That 2 extinction cues increases the level of extinction suggests that extinction operates in an error-correction process (e.g., RW model) If organism responds and gets what expected- no change If organism responds and gets MORE than expected- more responding If organism responds and gets LESS than expected- EXT 2 cues provide better signaling of the outcome!

Compounding Extinction Stimuli VERY interesting prediction here for treatment of fear: If EXT cue is compounded with conditioned inhibitor cue during extinction training Conditioned inhibitor = signal for ABSCENCE of a US Conditioned inhibitor = safety signal indicating that the aversive US won’t happen IF safety signal compounded with fear stimulus during extinction: Absence of US fully predicted by safety signal Won’t be any error to encourage learning that the fear stimulus no longer ends in shock Safety signal should BLOCK extinction of the fear stimulus SHOULD be interference rather than facilitation of the EXT process

What IS learned in EXT? Inhibitory S-R associations Remember: learning involves S-O, R-O and S-R associations Extinction procedures appear to not change S-O and S-R associations Instead: changes in S-R associations occur Non reinforcement produces inhibitory S-R associations Nonreinforcement of a response in the presence of a specific stimulus produces an inhibitory S-R association that serves to suppress that response whenever that S is present What? Learn “don’t respond when presented with that S”

What IS learned in EXT? Inhibitory S-R associations This helps to explain frustration and aggression Expect that S will have a certain outcome During extinction, that outcome for S changes Omission of expected reward creates disappointment and frustration Nonreinforcement triggers aversive frustrative reaction which in turn discourages responding

Pardoxical Reward Effects Assume that decline in responding during EXT is due to frustative effects of an unexpected loss of reinforcement, Then would expect more rapid extinction following training that establishes GREATER expectations of reward. This is supported by data showing the Overtraining extinction effect Magnitude reinforcement extinction effect Partial reinforcement extinction effect (PREE)

Other Extinction Effects Magnitude reinforcement extinction effect Less persistence of instrumental behavior in extinction following training with a large reinforcer More persistance of responding with a small or moderate reinforcer. Effect is most prominent with continuous reinforcement.

Other Extinction Effects Overtraining extinction effect Less persistence of instrumental behavior in extinction following extensive training with reinforcement (overtraining) Faster extinction following moderate levels of reinforcement training. Again, effect most prominent with continuous reinforcement

Partial Reinforcement Extinction Effect: PREE Extinction occurs at different rates depending on the schedule: Continuous reinforcement: FAST extinction Partial reinforcement schedules: SLOWER extinction Variable schedules show slower extinction than fixed (rate or time) schedules. Partial Reinforcement Extinction Effect: PREE used to describe greater persistence in instrumental responding during extinction after partial (or intermittent) reinforcement training Faster extinction after continuous reinforcement training. Partial reinforcement schedules show RESISTANCE TO EXTINCTION

Why do we get these effects? Several theories to suggest PREE and similar effects: Discrimination hypothesis Frustration theory Sequential Theory Behavioral momentum

Discrimination and Frustration Discrimination hypothesis: Mowrer and Jones 1945 In order for subjects’ behavior to change during extinction, the subject must be able to discriminate the change in reinforcement contingencies With CRF: This is immediately noticeable With PRF: not immediately noticeable More discriminative on fixed schedules Less discriminative on variable schedules Evidence does not completely support this

Generalization Decrement Hypothesis Capaldi, 1966 Generalization decrement: Decreased responding observed in generalization test when test stimuli become less and less similar to training stimulus Responding during extinction is weak if the stimuli present during extinction are different from those during the reinforcement phase Responding during extinction is STRONG if the stimuli present during extinction are very similar to those during reinforcement phase.

Generalization Decrement Hypothesis Large generalization decrement when schedule moves from CRF to EXT Subject never experienced situation in which some of its responses are not reinforced Not been taught to keep responding in absence of a reinforcer Small generalization decrement when schedule moves from PRF to EXT Subject has experience in situation where some of its responses are not reinforced HAS been taught to keep responding in absence of a reinforcer

Sequential Theory Sequential theory: Memory of reward vs. non reward Cognitive theory Fast extinction after CRF Extinction occurs quickly because the instrumental response has NOT been conditioned to the memory of nonreward Slow extinction during PREE extinction is slowed after partial reinforcement because the instrumental response becomes conditioned to the memory of nonreward.

Behavioral Momentum Behavioral momentum: Degree to which behavior is susceptible to disruption Based on physics: a rolling object continues until something disrupts it Momentum of a physical object is the product of its mass and its speed a fast moving bullet and a slow moving train both have high momentum But a slow moving train has more mass Both the bullet and the train are hard to stop because of their momentum x their mass Behavior is similar: behavior has both speed and mass The better learned the response, the harder it is to stop The higher the rate of reinforcement, the harder it is to stop

Behavioral Momentum Suggests that PREE occurs because the animal has a high momentum of reinforcement and it is more difficult to stop this momentum Two major conclusions about behavioral momentum Behavioral momentum is directly related to the rate of reinforcement Behavioral momentum is not tied to response rate! What can disrupt behavioral momentum? Extra food between experimental sessions Drugs that alter value of reinforcement Introduce alternative sources of reinforcement Anything that devalues the S-O relationship!

Behavioral Momentum Timberlake and Lucas 1985: Ball bearing studies Rolled ball bearing across cage; rats had to let it go past to receive reinforcer Played with the ball bearing, slowing reinforcement During extinction (ballbearing but no food): played with ball bearings MORE Does suggest that animals show strong patterns of behavior that may interfere and thus slow the extinction process But not a complete explanation

Bottom Line Rate of extinction is affected by: Why frustration? Discriminability of the situation Previous experience with non-reinforced responding Possibly, memory of non-reinforced responding Why frustration? Not getting what expected

Bottom Line Because of contingency between response and reinforcer: Subject “controls” responding, and thus reward During EXT: this contingency is disrupted But it is the S-O relationship that is disrupted The S no longer predicts reward! That pisses the organism off! Subject becomes “frustrated” in that Tries other responses Increases magnitude of responding Increases intensity of responding Pushed into fear/flight/fight sequence: aggression is elicited Aggression interferes with responding S-O relation takes on new meaning IN THAT CONTEXT

To make EXT effective: Be sure to use: Lots of EXTINCTION trials Massed trials Many training, extinction, retest trials Alter the cues that predict extinction so they become contextual cues Then generalize those cues to all the situations that EXT should be in effect Remember the organism doesn’t forget the old S+:R-O situation, so if that reoccurs, the old response comes back!