Presentation is loading. Please wait.

Presentation is loading. Please wait.

NSCI 5702 PRINCIPLES OF TRAINING. 1.What is training? 2.Why is it useful to train animals? 3.How do animals learn? 4.Training techniques used with animals.

Similar presentations


Presentation on theme: "NSCI 5702 PRINCIPLES OF TRAINING. 1.What is training? 2.Why is it useful to train animals? 3.How do animals learn? 4.Training techniques used with animals."— Presentation transcript:

1 NSCI 5702 PRINCIPLES OF TRAINING

2 1.What is training? 2.Why is it useful to train animals? 3.How do animals learn? 4.Training techniques used with animals LECTURE OUTLINE

3 “The shaping of an animal so that it behaves in a way that humans desire.” UFAW (1992) WHAT IS TRAINING?

4  Husbandry and health purposes  Safety of handler  Treatment of problem behaviour  Human assistance  Entertainment/Education  Enrichment WHY IS IT USEFUL TO TRAIN ANIMALS?

5 1.Classical Conditioning 2.Desensitisation 3.Counter Conditioning 4.Operate Conditioning I.Shaping II.Positive Reinforcement III.Negative Reinforcement IV.Combinations of positive and negative reinforcement 5.Flooding 6.Punishment To understanding how to train animals we must first have an understanding of how they learn – Learning Theory TRAINING TECHNIQUES USED WITH ANIMALS

6 HOW ANIMALS LEARN

7  There are a number of different forms of learning. We will discuss the following four: 1.Sensitisation 2.Habituation 3.Imprinting 4.Associate Learning  Classical Conditioning  Operant Conditioning HOW ANIMALS LEARN

8 “The increasing of a response to a repeated stimulus” (Broom and Fraser: Domestic Animal Behaviour and Welfare, 2007)  The animal learns to respond to a stimulus  Adaptive for survival.  E.g. Gazelles reacting to the sound of a twig breaking (signals approaching predator?)  E.g. A rat that has just experiences an aversive stimulus, such as a bright light will immediately afterwards be extra sensitive to other cues, such as noises or lights, that it would not normally respond to. SENSITISATION

9  A decrement in response that is produced by gradual exposure to a stimulus that elicits the response.  Commonly used in training.  E.g. Using a tape recording of a particular sound which a dog is fearful of. Tape is played very softly at first and only gradually increased in volumes at increments designed to elicit no response. DESENSITISATION

10  The animal learns not to respond to irrelevant stimuli.  This decline in response is specific to a given stimulus.  Animals will not habituate to relevant stimuli e.g. those associated with predators, food or mates.  Advantageous in that it saves energy that would be wasted on repeated response to trivial stimulus.  E.g. Zoo animal habituate to the presence of visitors.  E.g. Sheep habituate to the sound of passing traffic. HABITUATION

11  Phase-sensitive learning that is rapid and apparently independent of the consequences of behaviour  Phase sensitive learning = learning occurring at a particular age or a particular life stage.  E.g. Chicks hatch with an innate tendency to approach and follow their mother.  They have already imprinted on her vocalisations.  After hatching (24-36hrs) they imprint on her visual appearance.  Other young animals imprint on olfactory cues from their mothers.  Also has an impact upon the animals future choice of a sexual partner  http://en.wikipedia.org/wiki/File:Anas_platyrhynchos_- Boston_Harbor,_Massachusetts,_USA-_parent_and_chicks-8.ogv http://en.wikipedia.org/wiki/File:Anas_platyrhynchos_- Boston_Harbor,_Massachusetts,_USA-_parent_and_chicks-8.ogv IMPRINTING

12

13  There are 2 types of associative learning: 1.Classical (Pavlovian) Conditioning 2.Operant (Instrumental) Conditioning ASSOCIATIVE LEARNING

14 CLASSICAL (PAVLOVIAN) CONDITIONING

15  When an animal learns to associate a conditioned stimulus (bell ringing) with an unconditioned stimulus (food) and eventually elicits a conditioned response (salivation)  Definitions:  Primary or Unconditioned Stimulus (US) = Stimuli that animals react to without training  Secondary or Conditioned Stimulus (CS) = Stimuli that have been associated with a primary (unconditioned) Stimulus CLASSICAL (PAVLOVIAN) CONDITIONING

16  When an animal learns to associate a conditioned stimulus (bell ringing) with an unconditioned stimulus (food) and eventually elicits a conditioned response (salivation) CLASSICAL (PAVLOVIAN) CONDITIONING

17

18 Examples:  Knock on the door and bark – people  Keys and run to you – leaving/car  Tin opening and cat meowing – food  Hay basket and squeaking GP’s food  http://www.youtube.com/watch?v=WfZfMIHwSkU http://www.youtube.com/watch?v=WfZfMIHwSkU CLASSICAL (PAVLOVIAN) CONDITIONING

19 Application  In training to be used prior to a reinforcer  E.g. a Bridge  Clicker  Whistle CLASSICAL (PAVLOVIAN) CONDITIONING

20  If the food is no longer presented with the bell, causing the dog to salivate less in response to the bell CLASSICAL CONDITIONING: EXTINCTION

21  Thorndike (1898) put hungry cats into a ‘puzzle box’ with a lever mechanism that opened a door which lead to a food reward. OPERANT CONDITIONING

22  Thorndike concluded that it was merely a process of trail and error.  The box invoked a series of trial and error voluntary actions. The cat learnt to press the lever to escape (a rewarding experience).  “a response that is followed by a reward is more likely to recur whereas one that is followed by an unpleasant experience is less likely to occur again? (laws of effect)  Learning is the result of associations forming between stimuli and responses. Associations are weakened or strengthened by the nature and frequency of stimulus-response (S-R pairings)  The animal learns to associate its own behaviour with a particular outcome. If the outcome is rewarding e.g. access to food, the animal learns to repeat the behaviour that resulted in food access previously. OPERANT CONDITIONING

23  Operant conditioning is “the type of learning in which the probability of a behaviour recurring is increased or decreased by the consequences that follows.  This includes positive/negative reinforcement and positive/negative punishment.  Forms an association between a behaviour (voluntary) and a consequence. OPERANT CONDITIONING

24 Definitions:  Anything that increases a behaviour = Reinforcer  Anything that decreased a behaviour = Punisher Consequences: 1.Something good can START or be presented = behaviour increases (Positive Reinforcement) 2.Something bad can END or be taken away = behaviour increases (Negative Reinforcement) 3.Something bad can START or be presented = behaviour decreases (Positive Punishment) 4.Something good can end or be taken away = behaviour decreases (Negative Punishment) REINFORCEMENT AND PUNISHMENT

25

26  Examples 1.Positive Reinforcement (R+)  Adding something good to increase behaviour  Food 2.Negative Reinforcement (R-)  Removing something bad to increase behaviour  Elephant training, horse reins 3.Positive Punishment (P+)  Adding something bad to decrease a behaviour  Shock collar, physical punishment 4.Negative Punishment (P-)  Removing something good to decrease a behaviour  Time out. REINFORCEMENT AND PUNISHMENT

27  Definitions are based on their actual effect on the behaviour in question.  They must reduced or strengthen the behaviour (to be defined as a punishment or reinforcer).  Pleasures meant as rewards but that do not strengthen the behaviour are indulgences not reinforcement.  Aversives meant as a behaviour weakener but which do not weaken behaviour are abuses, not punishment. REINFORCEMENT AND PUNISHMENT

28  Rewards mean difference things to different animals (e.g. food, toys, affection, other animals).  You must first establish what motivates the animal.  This could be:  Food  Social contact  Toys  Praise  clicker REINFORCERS AND MOTIVATION TO LEARN

29  Primary Reinforcer:  (e.g. food)  A stimulus or event that is inherently rewarding to the animal  Secondary Reinforcer:  (e.g. clicker)  Initially meaningless stimuli or event becomes inherently rewarding after repeated association with primary reinforcer http://www.youtube.com/watch? v=hgDHWLyztCI&feature=related REINFORCERS

30  Timing is everything! The reward must occur within 1-2 seconds of the behaviour.  The frequency of the rewards is also important. During training by rewarding for every correct behaviour then gradually switching to intermittent variable rates.  Don’t phase out rewards too quickly.  Help to make training session enjoyable and to strengthen the human animal bond.  http://www.youtube.com/watch?v= bDZCyObMfkA http://www.youtube.com/watch?v= bDZCyObMfkA REWARDS

31  What is punishment?  “an aversive action or unpleasant sensation (not necessarily physical) applied either during or within on e second of a particular behaviour that reduces the likelihood of that behaviour being repeated in the future.”  Differs from negative reinforcement (where the aversive stimulus is applied before the behavioural response)  E.g. Hitting a Zebra a few seconds after they have bitten you. PUNISHMENT

32 1.Pain, fear, anxiety, learned helplessness and stress which are all welfare concerns. 2.Pain, fear, anxiety learned helplessness and stress also interfere with the animals ability to learn and focus. 3.It can intensify the occurrence and severity of behaviour problems. 4.Difficulty in getting the timing right causing association with the wrong things. 5.It becomes meaningless (desensitisation). 6.Breakdown in the trainer-animal relationship. WHY PUNISHMENT MIGHT NOT WORK

33  Specific behaviour may be “shaped”  Involves teaching the desired behaviour pattern one step at a time through operant conditioning.  The animal needs to be rewarded for behaviour that resembles the eventual behavioural goal.  Initially reinforcement is given to an approximation of the behavioural goal.  Reinforcement continues as the animals behavioural approximations develop to resemble the final behaviour more closely.  Eventually, only the more precise behaviour is rewarded. http://www.youtube.com/watch?v=g6F0bRTurPk&feature=relate d SHAPING

34 Animal is encouraged to engage in another behaviour that is more pleasurable and which cannot be performed simultaneously with fear responses in the presence of the triggering stimulus. E.g. Feeding a vet phobic Giraffe whilst earing a vets uniform. COUNTER CONDITIONING

35  Prolonged exposure to a negatively perceived stimulus at a level that provokes the response so that the animal eventually gives up.  VERY STRESSFUL AND POTENTIALLY DAMAGING  E.g. confining a dog in an area and playing the tape at a louder than appropriate level until the dog no longer reacts fearfully  Used as a last resort and executed in the most humane way. FLOODING

36  Trainers should use a variety of different learning methods.  Animal must be motivated to learn.  Minimal use of negative reinforcement because of stressor/& fear – not a good learning state for the animal.  Shaping behaviours – helping the animals to learn. HOW ANIMAL TRAINERS USE LEARNING THEORY


Download ppt "NSCI 5702 PRINCIPLES OF TRAINING. 1.What is training? 2.Why is it useful to train animals? 3.How do animals learn? 4.Training techniques used with animals."

Similar presentations


Ads by Google