1 Learning pp. 194-233 • Non-associative learning – Sensitization – Habituation – Dishabituation • • • • • • 2 Classical conditioning Operant conditioning Latent learning Cognitive maps Insight learning Observational learning Nature Vs. Nurture • “Give me a dozen healthy infants, well-formed, and my own specified world to bring them up in and I'll guarantee to take any one at random and train him to become any type of specialist I might select – doctor, lawyer, artist, merchant-chief and, yes, even beggar-man and thief, regardless of his talents, penchants, tendencies, abilities, vocations, and race of his ancestors. I am going beyond my facts and I admit it, but so have the advocates of the contrary and they have been doing it for many thousands of years.” John Watson (1930) 3 What is learning? • Modification through experience of pre-existing behavior and understanding • This is shared with all species! • We learn all the time whether we want it or not 4 Why is it important? • • • • 5 You need to adapt to a constantly changing environment to survive Avoid making the same mistakes Master skills Predict events Learning About Stimuli • Sensitization • Habituation • Dishabituation 6 How Habituation Works? • Opponent process theory: – – – – – Stimuli that arouse strong emotions These emotions disrupt state of equilibrium Disruption triggers an opponent process to counteract disruption Repeated arousing makes opponent process stronger Equilibrium restored 1 Classical Conditioning 7 • Associative learning • Pairing two stimuli together • Learning stimulus-response associations Phase 1 Phase 2 Phase 3 The office clip 8 9 10 11 • http://www.youtube.com/watch?v=qzXOscWD-Sg Classical Conditioning is… 12 • A neutral stimulus (bell) paired with a reflex triggering stimulus (meat triggers salivation) • The neutral stimulus (bell) triggers the reflex alone (salivation), without the reflex triggering stimulus (meat) For Classical Conditioning You Need… 13 • Unconditioned Stimulus (UCS) – Triggers a reflex response naturally • Unconditioned Response (UCR) – The reflex triggered by the UCS • Conditioned Stimulus (CS) – The originally neutral stimulus (does NOT triggers a response naturally), eventually associated with the UCS • Conditioned Response (CR) – The reflex triggered by the UCS now responding to the CS 14 Extinction and Spontaneous Recovery Write in a Paper 15 16 • Which is the UCS, CS, CR, UCR in The Office clip? Second-order Conditioning 17 • Conditioned stimulus (CS) acts like an Unconditioned stimulus (UCS) • Drug Addiction Generalization and Discrimination 18 1 • Generalization: – Similar stimuli to the CS also elicit the CR – The greater the similarity the stronger the response • Discrimination: – Learn to differentiate among similar stimuli to generate different responses 2 2 • Little Albert Video Factors Affecting Learning Other Factors… 19 20 • Predictability: – Does the CS always predict the UCS and only the UCS? • Signal Strength: – CR is greater with a strong UCS • Attention: – You have to pay attention to the appropriate stimulus Biopreparedness 21 • Certain events are specially suited to form associations with other events • Conditioned Taste Aversion – Does not follow the exact same rules than the classical conditioning Some Applications.. 22 • Systematic Desensitization • Predator Control • Alzheimer’s Disease Prediction Instrumental Conditioning 23 1 • Thorndike’s Puzzle box: – Learn relationships between behavior and its consequences – Law of effect – If a response in presence of a stimulus produces rewards it increases its likelihood the next time the same stimulus is presented 24 Operant Conditioning • What is operant conditioning? • Components of operant conditioning – Reinforcement – Punishment • Primary and secondary reinforcers • Shaping • Schedules of reinforcement 25 Operant Conditioning • Learning to respond to the environment to produce good consequences and avoid bad ones • Do something to the environment (operant) • What is the difference with classical conditioning? 26 Operant Conditioning 3 • Skinner’s box – Thorndike measures whether the animal made the response or not – Skinner measures the number of behavior’s in a given period of time – Allowed a more precise control of the stimuli presented 27 28 Components of Operant Conditioning Components of Operant Conditioning • Reinforcer: – Stimulus that increases probability of behavior • Punishment: – Stimulus that decreases the probability of behavior • Positive: – Add a stimulus as a consequence of a response • Negative: – Remove a stimulus as a consequence of a response 30 Components of Operant Conditioning Components of Operant Conditioning 31 Positive Reinforcement 29 • Give – Candy – Money – Golden stars • Can you think of other examples? 33 Components of Operant Conditioning Components of Operant Conditioning 34 Negative Reinforcement 32 • Remove – Electric shock – Loud noise – Awkward silence • Can you think of other examples? 36 Components of Operant Conditioning Components of Operant Conditioning 37 Positive Punishment 35 • Give – Slap – Bad grade – Scream • Can you think of other examples? 38 Components of Operant Conditioning 4 39 Components of Operant Conditioning 40 Negative Punishment • Remove: – – – – Freedom TV Car Money • Can you think of other examples? 41 42 43 44 45 46 47 Types of Operant Conditioning Types of Operant Conditioning Types of Operant Conditioning Types of Operant Conditioning Types of Operant Conditioning Types of Operant Conditioning Operant Conditioning • Can you interpret The Office clip in terms of operant conditioning? 48 Primary And Secondary Reinforcers • Primary Reinforcer: – Innately rewarding • Secondary Reinforcer – Previously neutral reinforcer now associated with a primary reinforcer 49 How Do You Teach Complex Behaviors? • Shaping: – Reinforcing successive approximations to desired behavior • How can you teach a sheep to do a full turn? 50 Schedules Of Reinforcement • Continuous reinforcement – Reinforced every time • Partial reinforcement – Only reinforced sometimes • Why would you reinforce only sometimes? • What about extinction and predictability? 51 Partial Reinforcement • Fixed Ratio: – Reward after a fixed number of responses – FR6 = reward after 6th response 5 • Variable Ratio: – Reward after a variable number of responses – VE6 = reward after an average of 6 responses Partial Reinforcement 52 • Fixed Interval: – Reward after the first response after a fixed period of time – FI15 = Reward after first response after 15 seconds • Variable Interval: – Reward after the first response after a variable period of time – VI15 = Reward after first response after a period of 15 seconds in average Make Groups of 5! 53 • Come up with examples of FR, VR, FI, VI but with negative reinforcement, positive punishment and negative punishment. Why Reinforcers Work? 54 • Premack Principle – Preferred activity reinforces any less preferred activity – Play videogames if you do your homework • Disequilibrium Hypothesis – Any activity can become a reinforcer if it has been witheld for a while • Biological Approach – “Pleasure centers” in the brain 55 1 2 Are Punishments Useful? • Advantages – – – – • Disadvantages – – – – – 56 Signaling wrong behaviors Works when it is contingent to the wrong behavior It is useful when used in combination with other techniques Works if you tell what was wrong and what is a good alternative behavior Does not erase an undesirable habit Is maintained when it is possible to avoid detection Association between punishment and punisher Physical punishment can become aggression and children may imitate it Does not tell how to improve Some Applications… • • • • • Animal training Cultural accepted norms Improving productivity in the work environment Therapeutic techniques Stimulus control therapy – Helps people sleep 57 Other Ways Of Learning? • Learned Helplessness 6 • Latent Learning – Tolman’s Experiment • Cognitive Maps • Insight Learning • Observational Learning 58 Learned Helplessness • You can learn expectations about the loss of control of the environment 59 Latent Learning • Do you know the lyrics of your favorite song? • What’s on TV on Tuesday nights? • What are some movies in theatres right now? 60 Cognitive Maps • Can you imagine going from the classroom to the REP PS-2 waiting room? 61 Insight Learning • • • • Sudden understanding about how to solve problem First performance usually without error Solution remembered well Principle underlying solution is easily applied to others 63 Insight Learning Insight Learning 64 Requirements For Observational Learning 62 • • • • • 65 Attention Retention Ability to reproduce behavior Motivation Characteristics of the model TV and Violence? • Critical Thinking • Make two teams • One defends TV the other attacks TV 66 Learning in the Classroom • Cultural differences 7 • Active learning • Skill learning 67 Learning in Neural Networks • Distributed knowledge • There is no “grandma neuron” • Neurons that fire together wire together! 68 Examples of Reinforcement Schedules 8