PSY402 Theories of Learning

PSY402 Theories of Learning
Chapter 10 – Stimulus Control of Behavior

The Role of Environmental Stimuli
In operant conditioning, the stimulus becomes associated with the reinforcer or punishment. Reward or punishment is the UCS. The stimulus signaling reward or punishment is the CS. The CR then motivates operant behavior. Operant responding can be used as a measure of the strength of a CR.

Definitions of Terms Stimulus control -- Environmental stimuli signal the opportunity for reward or punishment. Generalization – responding in the same way to similar stimuli. Discrimination – responding to some stimuli but not to others.

Generalization Gradient
Degrees of generalization occur. In some situations, the same response occurs to similar stimuli. In other situations, the amount of response varies along with the similarity. Generalization gradient – a graph showing how the strength of response changes with similarity. Steep gradients mean narrow response (stimuli must be very similar).

Kinds of Gradients Excitatory conditioning (S+) – a CS-UCS response to a stimulus is learned. Excitatory gradient – the S+ is varied and the CR is measured. Inhibitory conditioning (S-) – a CS signals absence of the UCS and thus inhibits the CR. Inhibitory gradient – the S- is varied and the CR is measured.

Wavelengths of Light

Visible Color Spectrum

yellow yellow-orange yellow-green orange-yellow green orange-red
bouton-fig jpg blue-green orange red

Gradients Using Four Wavelengths
580 = yellow 550 = green

Gradient Using Tone & Shock
The less the tone sounds like the original stimulus, the less fear (measured in galvanic skin response, GSR)

Discrimination The shape of the gradient can be changed by training.
When birds are exposed to two different tones (S+ or S-), they must discriminate between them. Responding is less generalized because the competing tone produces no reward. The shape of the gradient becomes steeper and more narrow at the top.

With no discrimination, subjects
Respond to every tone. With two or more tones, requiring discrimination, only the rewarded tone elicits responses, depending on the S- tone used during training.

The sharpness of the generalization gradient depends on the type of training
bouton-fig jpg

Flat Gradients A flat gradient means all stimuli are being responded to as if they were the same. Responding with a gradient to a tone occurred only when the tone signaled reward during training.

Tone vs No-Tone During Training
Flat gradient Experimental subjects were trained to attend to the tone whereas control subjects were not. tone

Generalization of Inhibition
Inhibition example: fear of dating. A good experience with one person leads to less fear of dating the next person. Inhibition gradients are similar to excitatory gradients – the more the stimulus varies, the less inhibition.

Excitatory and Inhibitory Generalization with Line Tilt Stimuli
bouton-fig jpg

Inhibitory Gradients – Line Tilt

Explanation Lashley-Wade theory – people and animals generalize because they are unable to discriminate. Can’t tell the difference between stimuli A contrast is needed during training to enable discrimination. Discrimination training leads to steeper generalization gradients (see Fig 10.3). Perceptual experience matters (Fig 10.5).

Ducks Raised in Monochromatic Light Cannot Discriminate Based on Color
Ducks in monochromatic light Ducks in white light (with all wavelengths)

Discrimination Learning
In survival terms, it is important to recognize when reinforcement is not available so that responding can be withheld. Discriminative stimulus: SD – reinforcement is available (S+) SD – reinforcement is unavailable (S-) Conditioned stimuli always produce a response. Discriminative stimuli signal the opportunity to respond.

Two-Choice Discrimination Tasks
The discriminative stimuli are on the same dimension: Red vs green light. Dimension = hue. Need not be presented simultaneously. Two-choice discrimination includes one SD and one SD . Other tasks can use multiple SD or multiple SD.

Categorization and Discrimination
Animals respond to stimuli in ways that suggest they form categories. Pigeons can classify a variety of items, including new images not seen before. The items to be learned as members of a category are SD and signal opportunity for food. The items that are not members of the category are SD and signal that pecking will not be rewarded.

Test Slides – Tree Category
bouton-fig jpg

Test Slides – Water Category
bouton-fig jpg

Test Slides -- Margaret Category
bouton-fig jpg

More Complex Tasks Later pigeons were asked to place images into four categories by pressing one of four buttons (rewarded by food if correct). They are “naming” the object shown. Pigeons do equally well with natural and manufactured objects (cars, chairs). Transfer to new stimuli is worse but above chance.

Apparatus (Part 1) bouton-fig jpg

Examples of positive images
bouton-fig jpg

Three Phases Subjects begin by responding equally to both stimuli – prediscrimination phase. Discrimination phase -- with training, response to SD increases and response to SD declines. Shift back to non-differential reinforcement to show that behavior was caused by reinforcement.

As Reinforcement Changes, so Does Responding

Conditional Discrimination
Availability of reinforcement depends on the condition of a stimulus. The stimulus does not always signal the same thing. More difficult to learn. Nissen’s chimpanzees: Large, small squares, white or black. SD = large when white but small when black.

Behavioral Contrast Behavioral contrast – the increased responding to the differential stimulus, decreased response to SD Contrast also occurs with changes in the duration of reinforcement. VI-10 to VI-3 Local contrast – may be emotional, fades Sustained contrast – related to the differential reinforcement.

Occasion Setting A conditioned stimulus (CS1) can create the conditions for operant responding to a second conditioned stimulus (CS2). Occasion setting – ability of one stimulus to enhance the response to another stimulus. The facilitating stimulus does not produce a CR by itself – so this is not higher order conditioning.

SD as an Occasion Setter
A Pavlovian occasion-setter can increase operant responding. Example: A meal elicits CR craving for cigarette. Requesting a cigarette after a meal – an operant behavior caused by CR. Conditional occasion-setting: Second stimulus modifies meaning of first discriminative stimulus.

How it Works

Conclusions An occasion-setter can increase operant responding.
A discriminative stimulus (SD) can increase response to a CS (Pavlovian conditioning). This implies interchangeability of Pavlovian occasion-setters and discriminative stimuli.

Occasion Setters Increase Responding

Peak Shift When both inhibitory and excitatory stimuli are conditioned, inhibition changes the shape of the gradient. Peak shift – maximum responding occurs to a stimulus not previously trained as the S+. The peak shifts away from the S- stimulus. The amount of response is the difference between inhibitory and excitatory conditioning.

Hypothetical Excitatory and Inhibitory gradients
Spence subtracts the inhibition on the next slide from this excitation bouton-fig jpg

Hypothetical Excitatory and Inhibitory Gradients
Overall predicted response is less because this amount of inhibition is subtracted from it. bouton-fig jpg

Peak Shift When the inhibitory stimulus S- is to the right, the peak shifts left bouton-fig jpg

Errorless Discrimination Learning
When an S is gradually introduced the pigeon learns to inhibit response without making mistakes. Three fading steps are involved: Brief introduction of S for 5 sec-30 sec Slowly change color of S from dark to green Slowly increase duration of S from 30 sec to 3 minutes

Errorless Discrimination Training

Implications of Errorless Training
Errorless learning seems to condition response to SD without inhibition to S. This means that errorless learning is not aversive. As a result, no peak shift occurs. Errorless learning is harder to condition to some stimuli than others (e.g., colors but not lines).

Application of Errorless Training
Examples with humans: Preschool children recognizing shapes using a fading technique. Oral reading. Dorry & Zeaman taught mentally handicapped children to identify vocabulary words (pictures faded out). Not all training works – problems with transfer and with reversed consequences.

Is Learning Relational?
Are animals learning the relationships between stimuli rather than an absolute response? Transposition occurs when stimuli are changed: The brighter of two lights, louder of two tones is responded to. Different results support both views of learning: Hull-Spence & Kohler.

Absolute vs Relational View

Predictive Value of SD

Mackintosh’s Attentional View
Stimuli with multiple dimensions arouse the relevant dimension analyzer. This depends on the salience and intensity of the dimension. The predictive value of the dimension determines arousal. Discrimination learning depends on predictiveness.

8.17 Examples of computer stimuli presented to pigeons by Cook (Part 1)
bouton-fig jpg

bouton-fig jpg

Less popout with conjoined features. bouton-fig jpg

8.18 “Same” and “different” displays used in the experiment by Wasserman et al
bouton-fig jpg

Continuity Theory Hull-Spence suggest that excitation and inhibition gradually increase with trials. Excitation to SD, inhibition to S. Non-continuity theory suggests that a hypothesis is formed & tested. Learning occurs rapidly with attention to the right dimension. There is support for both theories.

PSY402 Theories of Learning

Similar presentations

Presentation on theme: "PSY402 Theories of Learning"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

PSY402 Theories of Learning

Similar presentations

Presentation on theme: "PSY402 Theories of Learning"— Presentation transcript:

Similar presentations

About project

Feedback