Learning Chapter 9
Classical Conditioning
Classical Conditioning Ivan Pavlov Digestion Learning Noticed correlation between salivation and a stimulus Signalization
Stimulus Response “Stimulus” simply means the thing that starts it while “response” means the thing that ends it. A stimulus elicits and a response is elicited. Unconditioned stimulus: a thing that can already elicit a response. Unconditioned Response: a thing that is already elicited by a stimulus. Conditioned Stimulus: a new stimulus delivered at the same time as the old stimulus. Conditioned Response: the elicited response due to the conditioned stimulus
Acquisition- the process of learning the response (dog salivates at bell) Extinction- response is forgotten because stimulus is random Spontaneous Recovery- reappearance of the conditioned response due to implementation of the conditioned response Full Circle
Generalization Same conditioned response to similar stimuli Discrimination is the opposite of generalization Those trucks are nice!
Higher-Order Conditioning When you pair an unconditioned stimulus with a conditioned stimulus Bell and ball
Operant Conditioning
Edward L Thorndike Suggested that learning depends less on reflexes and more on feedback form behavior Cats and puzzle boxes Law of Effect
Operant Conditioning: Skinner Skinner Box Father of Operant Conditioning
Skinner Box Response is reinforced Response- running to eat Reinforced- with food Shaping (training) to obtain the target behavior
Consequence of behavior that increases the likelihood that a response will occur Reinforcement
Reinforcement Positive is always more productive Positive Negative Think of it as adding something in order to increase a response Child gets an allowance for doing chores Think of it as taking something negative away in order to increase a response Wife is nagging at her husband… he takes out the trash and the nagging stops Positive is always more productive
Reinforcement Schedule Continuous Intermittent Reinforced every time Reinforced some of the time There are four types of intermittent reinforcement schedules Reinforcement Schedule Intermittent Fixed Ratio Fixed Interval Variable Ratio Variable Interval Continuous
Fixed Reinforcement Schedule Fixed Ratio Fixed Interval A reinforcer is given after a specified number of correct responses Ex. Jobs that pay based on units delivered It is the best schedule for learning a new behavior The first response is rewarded only after a specified amount of time has elapsed Paychecks work on this schedule - every two weeks I get one
Variable Reinforcement Schedule Variable Ratio Variable Interval Reinforcers are distributed based on the average number of correct behaviors On average, one out of every three tables I wait on give me a good tip. Reinforcers will be distributed after a varying amount of time Mail With a random schedule, there is no correlation between the animal's behavior and the consequence. This is how Fate works N
Consequence of a behavior that decreases the likelihood that a response will occur Punishment
Punishment Positive Negative When a negative consequence is applied in response to an undesired behavior Spanking When something desired is taken away as a consequence of a certain behavior Frequency in which an employee is late for work decreases as a result of losing the right to listen to music
Classical vs. Operant Classical: does not produce new behavior but does explain some learning Operant: produces new behavior and can explain almost all complex behaviors in animals