Operant Conditioning Skinner, positive & negative reinforcement, response cost, punishment and schedules of reinforcement.

Slides:

Advertisements

Similar presentations

3. Operant Conditioning = A form of learning for which the likelihood of a particular response occurring is determined by the consequences of that response.

Advertisements

LEARNING Learning - process leading to relatively permanent behavioral change or potential behavioral change.

Operant Conditioning Module 16 Demo Activity HO 16.1 Pkt. p. 7 See outline in pkt. p. 6 ½ DVD: Discovering Psychology: Disc 2: “Learning”

Operant Conditioning What is Operant Conditioning?

Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment.

Operant Conditioning Operant conditioning - the learning of voluntary behavior through the effects of pleasant and unpleasant consequences to responses.

Chapter 13, Unit 4 Psychology.  While CC is useful for explaining learned behaviour, there are many other learned behaviours that CC cannot explain,

Instrumental Learning A general class of behaviors inferring that learning has taken place.

Thinking About Psychology: The Science of

Operant Conditioning. I. Operant Conditioning A type of learning that occurs when we receive rewards or punishments for our behavior A type of learning.

Operant Conditioning What the heck is it? Module 16.

Introduction to Operant Conditioning. Operant & Classical Conditioning 1. Classical conditioning forms associations between stimuli (CS and US). Operant.

Operant Conditioning Big Question: Is the organism learning associations between events that it does not control (classical) OR is it learning associations.

Copyright © 2005 Pearson Education Canada Inc. Learning Chapter 5.

Reward and Punishment.  Cats escape from box to get a treat  At first its all trial and error  When successful the behaviour is rewarded  This good.

O PERANT C ONDITIONING Year 12 Psychology Unit 4 Area of Study 1 (chapter 10, page 476)

Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.

Chapter 6: Learning. Classical Conditioning Ivan Pavlov Terminology –Unconditioned Stimulus (UCS): evokes an unconditioned response without previous conditioning.

What is Operant Conditioning? Module 16: Operant Conditioning.

Chapter 5: Learning Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

OPERANT CONDITIONING Changing Behavior Through Reinforcement and Punishment.

© 2008 The McGraw-Hill Companies, Inc. Chapter 6: Learning.

What is Operant Conditioning?. Operant Conditioning A type of learning in which the frequency of a behavior depends on the consequence that follows that.

Operant Conditioning Unit 4 - AoS 2 - Learning. Trial and Error Learning An organism’s attempts to learn or solve a problem by trying alternative possibilities.

OPERANT CONDITIONING.  Many of the behaviours in animals and humans cannot be explained in terms of classical conditioning.  Many complex behaviours.

Classical Conditioning Review

Chapter 2 BIOLOGY AND BEHAVIOUR BY: DR. UCHE AMAEFUNA (MD).

Learning. This is happening when you respond to a second stimulus that is similar to a conditioned stimulus without additional training Generalization.

OPERANT CONDITIONING. DIFFERENT FROM CLASSICAL CLASSICAL: Experimenter presents UCS and CS and then observes the behavior CLASSICAL: Experimenter presents.

Operant Conditioning E.L. Thorndike and B.F. Skinner.

Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment.

Learning Principles and Applications

LEARNING Learning - process leading to relatively permanent behavioral change or potential behavioral change.

Learning Experiments and Concepts.  What is learning?

OPERANT CONDITIONING. Learning in which a certain action is reinforced or punished, resulting in corresponding increases or decreases in behavior.

Learning and Conditioning. I. The Assumptions of Behaviorism A. Behaviorists are deterministic. B. Behaviorists believe that mental explanations are ineffective.

Operant Conditioning A learning process by which the likelihood of a particular behaviour occurring is determined by the consequences of that behaviour.

Operant Conditioning. Operant Conditioning – A form of learning in which voluntary responses come to be controlled by their consequences. What does this.

OPERANT CONDITIONING. DIFFERENT FROM CLASSICAL CLASSICAL: Experimenter presents UCS and CS and then observes the behavior CLASSICAL: Experimenter presents.

B. F. Skinner Radial Behaviorism B.F. Skinner ( ) 1925: Hamilton College (NY): degree in English, no courses in psychology Read about Pavlov’s.

Operant conditioning (Skinner – 1938, 1956)

Learning. LEARNING CONCEPTS Learning –any relatively permanent change in behavior that occurs due to experience. Conditioning-forming associations between.

Operant Conditioning Module 27. Edward Thorndike Puzzle box o See how animals learned Theory of Instrumental Learning o Explain how individuals learn.

Module 27 Operant Conditioning

Operant Conditioning Type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. Another form of learning.

CHS AP Psychology Unit 6: Learning (Behaviorism) Essential Task 6.3: Predict the effects of operant conditioning with specific attention to (primary, secondary,

Copyright © Allyn and Bacon Chapter 6 Learning This multimedia product and its contents are protected under copyright law. The following are prohibited.

Chapter 8 pt. 2: Operant Conditioning and Observational Learning

Operant Conditioning. A type of learning in which the frequency of a behavior depends on the consequence that follows that behavior. The frequency will.

B.F. Skinner. Operant Conditioning  By the 1920s, John B. Watson and other behaviorists were becoming influential, proposing new forms of learning other.

Operant conditioning Learning by consequences. Ratatouille Ratatouille is hungry and performs various exploratory behaviours By chance he presses the.

Learning Principles & Applications 7-9% of AP Exam.

3 types of Learning 1. Classical 2. Operant 3. Social This Is our second type of Learning.

Thinking About Psychology: The Science of Mind and Behavior Charles T. Blair-Broeker Randal M. Ernst.

Chapter 6 LEARNING. Learning Learning – A process through which experience produces lasting change in behavior or mental processes. Behavioral Learning.

What have you learned?.  Operant Conditioning  1 volunteer  The Real March Madness The Real March Madness  Punishing and Rewarding the Banana Punishing.

Operant Conditioning The Main Features of Operant Conditioning: Types of Reinforcement and Punishment.

Learning by consequences

Operant Conditioning The Main Features of Operant Conditioning: Types of Reinforcement and Punishment.

Learning by consequences

© 2008 The McGraw-Hill Companies, Inc.

Module 20 Operant Conditioning.

Operant conditioning.

Operant Conditioning The learning is NOT passive.

Case Study: The Little Albert Experiment

Operant Conditioning Unit 4 - AoS 2 - Learning.

UNIT 4 BRAIN, BEHAVIOUR & EXPERIENCE

Module 27 – Operant Conditioning 27

Presentation transcript:

Operant Conditioning Skinner, positive & negative reinforcement, response cost, punishment and schedules of reinforcement

Three-phase model of operant conditioning Skinner “operant conditioning” Thorndike calls “Instrumental learning” Operant is the response(s) that “operate” or act upon the environment to produce some kind of effect Eg – Thorndike’s experiment the operants was the cat biting on the bar and clawing the box Based on Thorndike’s law of effect – an organism will repeat a behaviour (operants) that have a desirable consequence (cat gets fish) or that will enable it to avoid an undesirable consequence (detention). Also an organism will not tend to repeat a behaviour that has an undesirable consequence (speeding fine = speed less)

Components of operant conditioning S.R.C S = Stimulus that comes before the operant response R = Operant Response to the stimulus C = Consequence to the operant response Example: Thorndike’s cat puzzle box experiment S = box R = sequence of movements to open the door (operating the environment) C = Escaping the box and getting fish

Stimulus (S) Response (R) Consequence (C)

Skinner box

Reinforcement and Punishment Skinner’s and Thorndike’s studies provide evidence for the concept of reinforcement – because learning through operant conditioning occurs as a result of the consequences of behaviour. Reinforcement and Punishment are the main aspects of operant conditioning

Reinforcement Reinforcement is when a stimulus (object or event) stregthens or increases the likelihood or frequency of a response that it follows Reinforcer is any stimulus (object or event) that increases the likelihood of a response that it follows – reinforcer is the stimulus that allows for the reinforcement to occur

Positive and Negative reinforcement Positive Reinforcement (adds something) + Presenting a stimulus (positive reinforcer) that strengthens or increases the likelihood of a desired response by providing a satisfying consequence. Eg. Being well behaved in class to get a gold star on your name; cleaning your room to get pocket money Negative Reinforcement (takes something away) – Removing an unpleasant stimulus that increases or strengthens the likelihood of a desired response. Eg. leaving home early one day and finding no traffic on the road may encourage you to leave home early again (response) in the future to avoid heavy traffic (removal of unpleasant stimulus)

Schedules of reinforcement Refer to the schedules or programs that are set out to determine how often reinforcement should be given in relation to the correct response. Continuous reinforcement = reinforcement is provided immediately after every correct / desired response is made Partial reinforcement = reinforcement is provided for some correct/desirable responses but not all of them

4 types of Partial reinforcement Fixed-ratio schedule A reinforcer is given after a set (fixed) amount of responses (ratio) are made. Eg a ratio of 1:5 means one reinforcer for every five correct responses. Eg. factory workers may be paid a certain amount for every 5 garments that they make. Variable-ratio schedule A reinforcer is given after an unpredictable (variable) number of correct responses (ratio) are made. Eg 1 reinforcer for a mean of 5 ratios made but after 1, 7, 11 etc

Partial Reinforcement Fixed-interval schedule A reinforcer is given after a specific fixed period of time has elapsed (interval) since the previous reinforcer, provided the correct response has been made. Eg. workers are given monthly reviews, they may work harder in the weeks leading up to their review, rather than the days after the review. Variable – interval schedule A reinforcer is given after irregular (variable) periods of time have passed (interval) provided the correct response has been made. There is a mean period of time, but at variable unpredictable times. Reponses made before the scheduled delivery time or before the interval has passed will not be reinforced, even if they are correct

Punishment Punishment is the delivery of an unpleasant consequence following a response or the removal of a pleasant consequence following a response Eg. delivery of an unpleasant consequence following a response (smacking a child after they misbehave) Eg. removal of a pleasant consequence following a response (losing money through a fine) Punishment is different to negative reinforcement. NR is the removal of an unpleasant stimulus to increase a response recurring. Punishment imposes an unpleasant consequence (or removes a pleasant one) and decreases or weakens the response from occurring. Also punishment is ‘given’ or ‘applied’ where as negative reinforcement is avoided or prevented.

Positive and Negative punishment Positive punishment + The presentation of an unpleasant stimulus that decreases or weakens the likelihood of the response occurring again. Eg, having arrived to sport training late, made to run 5 laps to decrease the likelihood that you will be late again Negative punishment – Removal of a stimulus that decreases or weakens the likelihood of a response from occurring again. Eg, removing your mobile phone from you for using it in class

Factors that influence the effectiveness of reinforcement and punishment OAT O = Order of presentation. To be effective it is essential that the reinforcement or punishment is presented after the response, never before. A = Appropriateness. Must be appropriate for the behaviour or response that has occurred. The punishment or reinforcement must be suited to the characteristic of the individual as well T = Timing. Reinforcement and punishment should be given immediately after the response has occurred.

Key processes in Operant Conditioning Acquisition The establishment of a response through reinforcement. The types of behaviour that become learned are more complex in operant conditioning, than the simple responses of classical conditioning Extinction Gradual decrease in the strength of a conditioned (learned) response following consistent non-reinforcement of that response. Eg. Skinner’s pigeons when stopped receiving food pellets, their conditioned response (press the lever) was extinguished. Less likely to occur with partial reinforcement. (Eg gamblers – less likely to stop as reward is unpredictable)

Spontaneous recovery Exhibits the response in the absence of reinforcement. Response is weaker and doesn’t last long Stimulus generalisation Occurs when the correct response is make to another stimulus that is similar. Eg, sound of a car backfiring may cause athletes to generalise this sound to a ‘starters pistol’ and begin running

Stimulus discrimination Makes the correct response to a stimulus and is reinforced, but not to a response that is similar stimuli. Eg. sniffer dog will only bark at certain smells (drugs and specific plant matter) not at every smell

Applications of Operant Conditioning Applications of behaviour modification include Shaping and Token economies. Shaping Also known as the ‘method of successive approximations’. It means giving reinforcement for any response that successively approximates or moves towards the desired response or behaviour. Eg. Shaping may be used when teaching and encouraging young children to swim

Token Economies Settings in which if an individual exhibits desired behaviour, they receive tokens (reinforcers) which are collected and these tokens or reinforcers can be exchanged for other reinforcers in the form of actual tangible rewards. Eg. In prison, an inmate’s good behaviour may earn him a token which could be cashed in for special rewards such as cigarettes and privileges. Can easily fail, especially if people feel they are being manipulated.

Comparison of Classical and Operant conditioning See handout for similarities and differences