Chapter 7 Operant Conditioning:

Slides:



Advertisements
Similar presentations
Carolina Center for ABA and Autism Treatment, Inc. Reinforcement.
Advertisements

LEARNING Learning - process leading to relatively permanent behavioral change or potential behavioral change.
Mean = = 83%
The Matching Law Richard J. Herrnstein. Reinforcement schedule Fixed-Ratio (FR) : the first response made after a given number of responses is reinforced.
Schedules of Reinforcement There are several alternate ways to arrange the delivery of reinforcement A. Continuous reinforcement (CRF), in which every.
Developing Behavioral Persistence Through the Use of Intermittent Reinforcement Chapter 6.
Quiz #3 Last class, we talked about 6 techniques for self- control. Name and briefly describe 2 of those techniques. 1.
Copyright © 2011 Pearson Education, Inc. All rights reserved. Developing Behavioral Persistence Through the Use of Intermittent Reinforcement Chapter 6.
Thinking About Psychology: The Science of
Operant Conditioning. Shaping shaping = successive approximations toward a goal a process whereby reinforcements are given for behavior directed toward.
Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.
Myers EXPLORING PSYCHOLOGY (6th Edition in Modules) Module 19 Operant Conditioning James A. McCubbin, PhD Clemson University Worth Publishers.
Chapter 8 Operant Conditioning.  Operant Conditioning  type of learning in which behavior is strengthened if followed by reinforcement or diminished.
PSY402 Theories of Learning Chapter 4 (Cont.) Schedules of Reinforcement.
Schedules of Reinforcement Lecture 14. Schedules of RFT n Frequency of RFT after response is important n Continuous RFT l RFT after each response l Fast.
PSY 402 Theories of Learning Chapter 7 – Behavior & Its Consequences Instrumental & Operant Learning.
PSY 402 Theories of Learning Chapter 7 – Behavior & Its Consequences Instrumental & Operant Learning.
Lectures 15 & 16: Instrumental Conditioning (Schedules of Reinforcement) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Chapter 7: Schedules and Theories of Reinforcement
Week 5: Increasing Behavior
Ratio Schedules Focus on the number of responses required before reinforcement is given.
Chapter 9 Adjusting to Schedules of Partial Reinforcement.
Operant Conditioning: Schedules and Theories Of Reinforcement.
Chapter 6 Operant Conditioning Schedules. Schedule of Reinforcement Appetitive outcome --> reinforcement –As a “shorthand” we call the appetitive outcome.
Ninth Edition 5 Burrhus Frederic Skinner.
Operant Conditioning: Schedules and Theories of Reinforcement
Copyright © Allyn & Bacon 2007 Big Bang Theory. I CAN Explain key features of OC – Positive Reinforcement – Negative Reinforcement – Omission Training.
Organizational Behavior Types of Intermittently Reinforcing Behavior.
Reinforcement Consequences that strengthen responses.
Learning (Part II) 7-9% of AP Exam Classical Conditioning UCS + UCR + N, etc… Acquisition Extinction Biological Predisposition Pavlov Watson Operant Conditioning.
Schedules of reinforcement. Schedules of Reinforcement Continuous reinforcement refers to reinforcement being administered to each instance of a response.
Schedules of Reinforcement A mature study by two immature minds.
Chapter 13: Schedules of Reinforcement
Chapter 6 Developing Behavioral Persistence Through the Use of Intermittent Reinforcement.
Reinforcement Schedules
LEARNING Learning - process leading to relatively permanent behavioral change or potential behavioral change.
PSY402 Theories of Learning Chapter 6 – Appetitive Conditioning.
Principles of Behavior Sixth Edition Richard W. Malott Western Michigan University Power Point by Nikki Hoffmeister.
Schedules of Reinforcement 11/11/11. The consequence provides something ($, a spanking…) The consequence takes something away (removes headache, timeout)
Reinforcement Schedule INTERVALS = TIME Does it Deal with Time? Fixed Interval (FI) – A reinforcer is delivered for the first response after a preset time.
Operant Conditioning. Operant Conditioning – A form of learning in which voluntary responses come to be controlled by their consequences. What does this.
Operant Conditioning: Schedules and Theories Of Reinforcement.
Achievement Motivation Motivation and Emotion Some motivations involve simple human behaviors like eating.
Schedules of Reinforcement CH 17,18,19. Divers of Nassau Diving for coins Success does not follow every attempt Success means reinforcement.
Schedules of Reinforcement Thomas G. Bowers, Ph.D.
Schedules of Reinforcement or Punishment: Ratio Schedules
Schedules of reinforcement
Principles of Behavior Sixth Edition Richard W. Malott Western Michigan University Power Point by Nikki Hoffmeister.
AP PSYCHOLOGY UNIT VI Part Two: Operant Conditioning: Reward and Punishment.
Operant Conditioning I. Volunteer? Priscilla the Fastidious Pig
Journal: Explain both positive and negative reinforcement and give an example of each.
Operant Conditioning Overview
Reinforcement Schedules 1.Continuous Reinforcement: Reinforces the desired response each time it occurs. 2.Partial Reinforcement: Reinforces a response.
What have you learned?.  Operant Conditioning  1 volunteer  The Real March Madness The Real March Madness  Punishing and Rewarding the Banana Punishing.
Reinforcements. Clinician’s Basic Task Create communication behaviors Increase communication behaviors Both.
Allocating your Behavior. The Response Allocation Approach There are many possible activities that you could engage in – Sleeping, eating, drinking, sex,
Operant Conditioning The Main Features of Operant Conditioning: Types of Reinforcement and Punishment.
Schedules and more Schedules
Learning Chapter 9.
Factors Affecting Performance on Reinforcement Schedules
Choice Behavior One.
Operant Conditioning A form of learning in which behavior becomes more or less probable depending on its consequences Associated with B.F. Skinner.
Schedules of Reinforcement
Operant conditioning.
Learning: Operant Conditioning.
PSY402 Theories of Learning
Operant Conditioning, Continued
Schedules of Reinforcement
Part 1- Behaviorist Learning Theory
Presentation transcript:

Chapter 7 Operant Conditioning: Schedules and Theories Of Reinforcement

Now that we have discussed reinforcement . . . . It is time to discuss just HOW reinforcements can and should be delivered In other words, there are other things to consider than just WHAT the reinforcer should be!

Think about this! If you were going to reinforce your puppy for going to the bathroom outside, how would you do it? Would you give him a Liv-a-Snap every time? Some of the time? Would you keep doing it the same way or would you change your method as you go along?

What is a schedule of reinforcement? A schedule of reinforcement is the response requirement that must be met in order to obtain reinforcement. In other words, it is what you have to do to get the goodies!

Continuous vs. Intermittent Reinforcement A continuous reinforcement schedule (CRF) is one in which each specified response is reinforced Intermittent An intermittent reinforcement schedule is one in which only some responses are reinforced

Intermittent Schedules When you want to reinforce based on a certain number of responses occurring (for example, doing a certain number of math problems correctly), you can use a ratio schedule When you want to reinforce the first response after a certain amount of time has passed (for example when a teacher gives a midterm test), you can use an interval schedule

Four Types of Intermittent Schedules Ratio Schedules Fixed Ratio Variable Ratio Interval Schedules Fixed Interval Variable Interval

Fixed Ratio Schedule On a fixed ratio schedule, reinforcement is contingent upon a fixed, predictable number of responses Characteristic pattern: High rate of response Short pause following each reinforcer Reading a chapter then taking a break is an example A good strategy for “getting started” is to start with an easy task

Fixed Ratio, continued Higher Ratio requirements result in longer post-reinforcement pauses Example: The longer the chapter you read, the longer the study break! Ratio Strain – a disruption in responding due to an overly demanding response requirement Movement from “dense/rich” to “lean” schedule should be done gradually

Fixed Ratio: FR Fixed Ratio is abbreviated “FR” and a number showing how many responses must be made to get the reinforcer is added: Ex. FR 5 (5 responses needed to get a reinforcer)

Variable Ratio Schedule On a variable ratio schedule, reinforcement is contingent upon a varying, unpredictable number of responses Characteristic pattern: High and steady rate of response Little or no post-reinforcer pausing Hunting, fishing, golfing, shooting hoops, and telemarketing are examples of behaviors on this type of schedule

Other facts about Variable Ratio Schedules Behaviors on this type of schedule tend to be very persistent This includes unwanted behaviors like begging, gambling, and being in abusive relationships “Stretching the ratio” means starting out with a very dense, rich reinforcement schedule and gradually decreasing the amount of reinforcement The spouse, gambler, or child who is the “victim” must work harder and harder to get the reinforcer

Variable Ratio: VR Variable Ratio: VR Variable Ratio is abbreviated “VR” and a number showing an average of how many responses between 1 and 100 must be made to get the reinforcer is added: Ex. VR 50 (an average of 50 responses needed to get a reinforcer – could the the next try, or it could take 72! Gambling is the classic example!

Fixed Interval Schedules On a fixed interval schedule, reinforcement is contingent upon the first response after a fixed, predictable period of time Characteristic pattern: A “scallop” pattern produced by a post-reinforcement pause followed by a gradually increasing rate of response as the time interval draws to a close Glancing at your watch during class provides an example! Student study behavior provides another!

Fixed Interval: FI Fixed Interval is abbreviated “FI” and a number showing how much time must pass before the reinforcer is available: FI 30-min (reinforcement is available for the first response after 30 minutes have passed) Ex. Looking down the tracks for the train if it comes every 30 minutes

Variable Interval Schedule On a variable interval schedule, reinforcement is contingent upon the first response after a varying, unpredictable period of time Characteristic pattern: A moderate, steady rate of response with little or no post-reinforcement pause. Looking down the street for the bus if you are waiting and have no idea how often it comes provides an example!

Variable Interval: VI Variable Interval is abbreviated “VI” and a number showing the average time interval that must pass before the reinforcer is available: VI 30-min (reinforcement is available for the first response after an average of 30 minutes has passed) Ex. Hilary’s boyfriend, Michael, gets out of school and turns on his phone some time between 3:00 and 3:30 – the “reward” of his answering his phone puts her calling behavior on a VI schedule, so she calls every few minutes until he answers

Noncontingent Reinforcement What happens when reinforcement occurs randomly, regardless of a person or animal’s behavior? Weird Stuff! Like what?

Superstitious Behavior Examples include: Rituals of gamblers, baseball players, etc. Elevator-button-pushing behavior Noncontingent reinforcement can sometimes be used for GOOD purposes (not just weird or useless behaviors!)

Good, useful examples Giving noncontingent attention to children Some bad behaviors like tantrums are used to try to get attention from caregivers These behaviors can be diminished by giving attention noncontingently Children need both contingent AND non-contingent attention to grown up healthy and happy!

Theories of Reinforcement In the effort to answer the question, “What makes reinforcers work?”, theorists have developed some . . . . . THEORIES!!!!!

So here’s the first one: If you are hungry and go looking for food and eat some, you will feel more comfortable because the hunger has been reduced. The desire to have the uncomfortable “hunger drive” reduced motivates you to seek out and eat the food

Drive Reduction Theory So this is one thing that can make reinforcers work: An event is reinforcing to the extent that it is associated with a reduction in some type of physiological drive This type of approach may explain some behaviors (like sex) but not others (like playing video games)

Incentive Motivation Sometimes, we just do things because they are FUN! When this happens, we can say that motivation is coming from some property of the reinforcer itself rather than from some kind of internal drive Examples include playing games and sports, putting spices on food, etc.

We can also think about how we use reinforcers. We can use a behavior we love (high probability behavior) to reinforce a behavior we don’t like to do very much (low probability behavior). This is sometimes called “Grandma’s Principle” Bobby, you can read those comic books once you have mowed the grass! To use this theory, you have to know the “relative probability” of each behavior

What do you do if you only know the “probability” for one? You can use the next theory! Let’s say you know that a person likes to play video games. You can use playing video games as a reinforcer IF you: Restrict access to playing Make sure the person is getting to play less frequently than they prefer to

This is the “Response Deprivation Hypothesis” Any behavior that you can restrict access to and keep it below the person or animal’s preferred level of doing it can be used as a reinforcer Think of some examples!

Behavioral Bliss Point The Response Deprivation Hypothesis makes an assumption that there is an optimal or best level of behavior that a person or animal tries to maintain If you could do ANYTHING at all you wanted to do, how would you distribute your time? This would tell you your “behavioral bliss point” for each activity or behavior

Behavioral Bliss Point cont’d An organism that has free access to alternative activities will distribute its behavior in such a way as to maximize overall reinforcement In other words, if you can do anything you want, you will spend time on each thing you do in a way that will give you the most pleasure

But this is real life! This means that you can almost never achieve your “behavioral bliss point” So you have to compromise by coming as close as you can, given your circumstances No wonder we hate to leave our childhoods behind!