PSY402 Theories of Learning

Slides:



Advertisements
Similar presentations
Transposition: Spence’s Behavioral Solution Transposition seems to support the cognitive view that organisms learn by discovering how things are related,
Advertisements

PSY402 Theories of Learning Chapter 10 – Stimulus Control of Behavior.
Lecture 20: Extinction (Pavlovian & Instrumental) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Lecture 18&19: Stimulus Control (Pavlovian & Instrumental) Learning, Psychology 5310 Spring, 2015 Professor Delamater.
Developing Stimulus Control. Peak Shift Phenomena where the peak of the generalization curve shifts AWAY from the S- – Means that the most responding.
PSY 402 Theories of Learning Chapter 8 – Stimulus Control How Stimuli Guide Instrumental Action.
Spence’s Theory of Discrimination and Generalization in an animated graph.
PSY402 Theories of Learning
Stimulus Control of Behaviour. Stimulus Control Differential responding and stimulus discrimination Complex environment “Signal” from the “noise” What.
PSY 402 Theories of Learning
PSY 402 Theories of Learning Chapter 9 – Motivation.
Stimulus Control Chapter 17.
Stimulus Control.
Stimulus Control. Stimulus Control of Behavior Having stimulus control means that the probability of the behavior varies depending upon the stimuli present.
Generalization, Discrimination, and Stimulus Control
Stimulus Control of Operant Behavior Discrimination Generalization Generalization Gradients Peak Shift Concepts Overview of stimulus control of operant.
Chapter 3 Space. Three Kinds of Space Space as format: size, scale, and presentation. Space as the relationships among objects and the areas surrounding.
CHAPTER 4 Pavlovian Conditioning: Causal Factors.
Psychology 2250 Last Class Characteristics of Habituation and Sensitization -time course -stimulus-specificity -effects of strong extraneous stimuli (dishabituation)
Psychology of Learning EXP4404
Discriminated Operants: Stimulus Control Discrimination in the Vernacular The Nature of Discriminated Operants Signal Detection: Breast Self-Examination.
STIMULUS CONTROL OF BEHAVIOR Chapter 10. Stimulus Control of Behavior  Generalization Responding in the same manner to similar stimuli.  Discrimination.
PSY402 Theories of Learning Chapter 6 – Appetitive Conditioning.
1 ABA 635 Concept Formation Caldwell College Applied Behavior Analysis Dr. Ken Reeve.
Learning Experiments and Concepts.  What is learning?
Pavlovian Conditioning Basic Principles Thomas G. Bowers, Ph.D. Penn State Harrisburg.
Spontaneous Recovery A Skinnerian interpretation: By Jack Michael.
Discrimination & Complex Stimulus Control Chs12 & 13.
Extinction of Conditioned Behavior Effects of Extinction  the rate of responding decreases  response variability increases  experiment by Neuringer,
Paradoxical Effects of Reward Overtraining extinction effect: more training leads to faster extinction Reinforcement magnitude effect: Big rewards lead.
The Associative Structure of Instrumental Conditioning Simple, Binary Associations  S-R association.
Extinction of Conditioned Behavior Chapter 9 Effects of Extinction Extinction and Original Learning What is learned during Extinction.
PSY 402 Theories of Learning Chapter 8 – Stimulus Control How Stimuli Guide Instrumental Action.
Attention During Discrimination Learning If a bird reinforced (S+) for responding to a red circle but not reinforced (S-) for responding to a blue circle.
Stimulus Control. Stimulus Control of Behavior Having stimulus control means that the probability of the behavior varies depending upon the stimuli present.
Basic Learning Processes Robert C. Kennedy, PhD University of Central Florida
PSY 402 Theories of Learning Chapter 3 – Nuts and Bolts of Conditioning (Mechanisms of Classical Conditioning)
Dr. Steven I. Dworkin Extinction and Stimulus Control Chapter 8.
Learning Factors in Stimulus Control. Learning Factors Why does stimulus generalization occur? – CS transfers to other stimuli with similar physical properties.
Stimulus Control of Behavior
PSY402 Theories of Learning
PSY402 Theories of Learning
Context Cues and Conditional Relations
Discrimination learning: Introduction
Conditional learning Charlotte Bonardi
Classical Conditioning Operant Conditioning Learning by Observation
Operant Conditioning – Chapter 8
Experimental Psychology PSY 433
PSY 402 Theories of Learning
PSY402 Theories of Learning
Conditioning: ways in which we learn based upon an association between two events by repeated exposure Classic and Operant.
PSY402 Theories of Learning
COMPLEX LEARNING TASKS (Part B)
Experimental Psychology PSY 433
Classical Conditioning and prediction
PSY402 Theories of Learning
Ch. 7: Principles of Learning
PSY402 Theories of Learning
PSY402 Theories of Learning
Stimulus Control.
PSY 402 Theories of Learning
PSY402 Theories of Learning
Experimental Psychology PSY 433
PSY 402 Theories of Learning Chapter 7 – Behavior & Its Consequences
PSY402 Theories of Learning
Chapter 7: Learning.
Learning.
(Do Now) Journal What is psychophysics? How does it connect sensation with perception? What is an absolute threshold? What are some implications of Signal.
Agenda To Get: To Do: Guided notes Intro Unit 7: Learning
Errorless Learning and the Feature Positive Effect
Presentation transcript:

PSY402 Theories of Learning Chapter 10 – Stimulus Control of Behavior

The Role of Environmental Stimuli In operant conditioning, the stimulus becomes associated with the reinforcer or punishment. Reward or punishment is the UCS. The stimulus signaling reward or punishment is the CS. The CR then motivates operant behavior. Operant responding can be used as a measure of the strength of a CR.

Definitions of Terms Stimulus control -- Environmental stimuli signal the opportunity for reward or punishment. Generalization – responding in the same way to similar stimuli. Discrimination – responding to some stimuli but not to others.

Generalization Gradient Degrees of generalization occur. In some situations, the same response occurs to similar stimuli. In other situations, the amount of response varies along with the similarity. Generalization gradient – a graph showing how the strength of response changes with similarity. Steep gradients mean narrow response (stimuli must be very similar).

Kinds of Gradients Excitatory conditioning (S+) – a CS-UCS response to a stimulus is learned. Excitatory gradient – the S+ is varied and the CR is measured. Inhibitory conditioning (S-) – a CS signals absence of the UCS and thus inhibits the CR. Inhibitory gradient – the S- is varied and the CR is measured.

Wavelengths of Light

Visible Color Spectrum

yellow yellow-orange yellow-green orange-yellow green orange-red bouton-fig-08-07-0.jpg blue-green orange red

Gradients Using Four Wavelengths 580 = yellow 550 = green

Gradient Using Tone & Shock The less the tone sounds like the original stimulus, the less fear (measured in galvanic skin response, GSR)

Discrimination The shape of the gradient can be changed by training. When birds are exposed to two different tones (S+ or S-), they must discriminate between them. Responding is less generalized because the competing tone produces no reward. The shape of the gradient becomes steeper and more narrow at the top.

With no discrimination, subjects Respond to every tone. With two or more tones, requiring discrimination, only the rewarded tone elicits responses, depending on the S- tone used during training.

The sharpness of the generalization gradient depends on the type of training bouton-fig-08-08-0.jpg

Flat Gradients A flat gradient means all stimuli are being responded to as if they were the same. Responding with a gradient to a tone occurred only when the tone signaled reward during training.

Tone vs No-Tone During Training Flat gradient Experimental subjects were trained to attend to the tone whereas control subjects were not. tone

Generalization of Inhibition Inhibition example: fear of dating. A good experience with one person leads to less fear of dating the next person. Inhibition gradients are similar to excitatory gradients – the more the stimulus varies, the less inhibition.

Excitatory and Inhibitory Generalization with Line Tilt Stimuli bouton-fig-08-09-0.jpg

Inhibitory Gradients – Line Tilt

Explanation Lashley-Wade theory – people and animals generalize because they are unable to discriminate. Can’t tell the difference between stimuli A contrast is needed during training to enable discrimination. Discrimination training leads to steeper generalization gradients (see Fig 10.3). Perceptual experience matters (Fig 10.5).

Ducks Raised in Monochromatic Light Cannot Discriminate Based on Color Ducks in monochromatic light Ducks in white light (with all wavelengths)

Discrimination Learning In survival terms, it is important to recognize when reinforcement is not available so that responding can be withheld. Discriminative stimulus: SD – reinforcement is available (S+) SD – reinforcement is unavailable (S-) Conditioned stimuli always produce a response. Discriminative stimuli signal the opportunity to respond.

Two-Choice Discrimination Tasks The discriminative stimuli are on the same dimension: Red vs green light. Dimension = hue. Need not be presented simultaneously. Two-choice discrimination includes one SD and one SD . Other tasks can use multiple SD or multiple SD.

Categorization and Discrimination Animals respond to stimuli in ways that suggest they form categories. Pigeons can classify a variety of items, including new images not seen before. The items to be learned as members of a category are SD and signal opportunity for food. The items that are not members of the category are SD and signal that pecking will not be rewarded.

Test Slides – Tree Category bouton-fig-08-02-1.jpg

Test Slides – Water Category bouton-fig-08-02-2.jpg

Test Slides -- Margaret Category bouton-fig-08-02-3.jpg

More Complex Tasks Later pigeons were asked to place images into four categories by pressing one of four buttons (rewarded by food if correct). They are “naming” the object shown. Pigeons do equally well with natural and manufactured objects (cars, chairs). Transfer to new stimuli is worse but above chance.

Apparatus (Part 1) bouton-fig-08-03-1.jpg

Examples of positive images bouton-fig-08-03-2.jpg

Examples of positive images bouton-fig-08-03-3.jpg

Three Phases Subjects begin by responding equally to both stimuli – prediscrimination phase. Discrimination phase -- with training, response to SD increases and response to SD declines. Shift back to non-differential reinforcement to show that behavior was caused by reinforcement.

As Reinforcement Changes, so Does Responding

Conditional Discrimination Availability of reinforcement depends on the condition of a stimulus. The stimulus does not always signal the same thing. More difficult to learn. Nissen’s chimpanzees: Large, small squares, white or black. SD = large when white but small when black.

Behavioral Contrast Behavioral contrast – the increased responding to the differential stimulus, decreased response to SD Contrast also occurs with changes in the duration of reinforcement. VI-10 to VI-3 Local contrast – may be emotional, fades Sustained contrast – related to the differential reinforcement.

Occasion Setting A conditioned stimulus (CS1) can create the conditions for operant responding to a second conditioned stimulus (CS2). Occasion setting – ability of one stimulus to enhance the response to another stimulus. The facilitating stimulus does not produce a CR by itself – so this is not higher order conditioning.

SD as an Occasion Setter A Pavlovian occasion-setter can increase operant responding. Example: A meal elicits CR craving for cigarette. Requesting a cigarette after a meal – an operant behavior caused by CR. Conditional occasion-setting: Second stimulus modifies meaning of first discriminative stimulus.

How it Works

Conclusions An occasion-setter can increase operant responding. A discriminative stimulus (SD) can increase response to a CS (Pavlovian conditioning). This implies interchangeability of Pavlovian occasion-setters and discriminative stimuli.

Occasion Setters Increase Responding

Peak Shift When both inhibitory and excitatory stimuli are conditioned, inhibition changes the shape of the gradient. Peak shift – maximum responding occurs to a stimulus not previously trained as the S+. The peak shifts away from the S- stimulus. The amount of response is the difference between inhibitory and excitatory conditioning.

Hypothetical Excitatory and Inhibitory gradients Spence subtracts the inhibition on the next slide from this excitation bouton-fig-08-10-1.jpg

Hypothetical Excitatory and Inhibitory Gradients Overall predicted response is less because this amount of inhibition is subtracted from it. bouton-fig-08-10-2.jpg

Peak Shift When the inhibitory stimulus S- is to the right, the peak shifts left bouton-fig-08-11-0.jpg

Errorless Discrimination Learning When an S is gradually introduced the pigeon learns to inhibit response without making mistakes. Three fading steps are involved: Brief introduction of S for 5 sec-30 sec Slowly change color of S from dark to green Slowly increase duration of S from 30 sec to 3 minutes

Errorless Discrimination Training

Implications of Errorless Training Errorless learning seems to condition response to SD without inhibition to S. This means that errorless learning is not aversive. As a result, no peak shift occurs. Errorless learning is harder to condition to some stimuli than others (e.g., colors but not lines).

Application of Errorless Training Examples with humans: Preschool children recognizing shapes using a fading technique. Oral reading. Dorry & Zeaman taught mentally handicapped children to identify vocabulary words (pictures faded out). Not all training works – problems with transfer and with reversed consequences.

Is Learning Relational? Are animals learning the relationships between stimuli rather than an absolute response? Transposition occurs when stimuli are changed: The brighter of two lights, louder of two tones is responded to. Different results support both views of learning: Hull-Spence & Kohler.

Absolute vs Relational View

Predictive Value of SD

Mackintosh’s Attentional View Stimuli with multiple dimensions arouse the relevant dimension analyzer. This depends on the salience and intensity of the dimension. The predictive value of the dimension determines arousal. Discrimination learning depends on predictiveness.

8.17 Examples of computer stimuli presented to pigeons by Cook (Part 1) bouton-fig-08-17-1.jpg

8.17 Examples of computer stimuli presented to pigeons by Cook (Part 2) bouton-fig-08-17-2.jpg

8.17 Examples of computer stimuli presented to pigeons by Cook (Part 3) Less popout with conjoined features. bouton-fig-08-17-3.jpg

8.18 “Same” and “different” displays used in the experiment by Wasserman et al bouton-fig-08-18-0.jpg

Continuity Theory Hull-Spence suggest that excitation and inhibition gradually increase with trials. Excitation to SD, inhibition to S. Non-continuity theory suggests that a hypothesis is formed & tested. Learning occurs rapidly with attention to the right dimension. There is support for both theories.