Module 19 Operant Conditioning Big Question: Is the organism learning associations between events that it does not control (classical) OR is it learning associations between its behavior and resulting events (operant) Edward Thorndike (1874-1949) • Introduced the “Law of Effect” • Behaviors with favorable consequences will occur more frequently. • Behaviors with unfavorable consequences will occur less frequently. • Developed into Operant Conditioning • Created puzzle boxes for research on cats Thorndike’s Puzzle Box Operant Conditioning • A type of learning in which the frequency of a behavior depends on the consequence that follows that behavior • The frequency will if the consequence is reinforcing to the subject. • The frequency will if the consequence is not reinforcing to the subject. B.F. Skinner (1904-1990) • Developed the fundamental principles and techniques of operant conditioning. • Devised ways to apply these principles in the real world. • Designed the Skinner Box. (operant box) B.F. SKINNER • http://www.youtube.com/watch?v=AepqpT tKbwo (Skinner discusses pigeons) • http://www.youtube.com/watch?v=vGazyH 6fQQ4&feature=related (ping-pong) • http://www.youtube.com/watch?v=vGazyH 6fQQ4&feature=related (pigeon v. human) Reinforcement v Punishment • Reinforcement - Any consequence that increases the likelihood of the behavior to be repeated. • Punishment - Any consequence that decreases the likelihood of the behavior to be repeated. I. Reinforcement A. Types of Reinforcement 1. Positive Reinforcement • Anything that increases the likelihood of a behavior by following it with a desirable event or state • The subject receives something they want • Will strengthen the behavior Positive Reinforcement Operant Conditioning Activity: Positive Reinforcement Get in groups of three. Choose who will be the recorder, the experimenter, and the subject. Subjects please leave the room for a moment. Directions…… 2. Negative Reinforcement • Anything that increases the likelihood of a behavior by following it with the removal of an undesirable event or state • Something the subject doesn’t like is removed XX OR • Will strengthen the behavior (Definition of Reinforcement) Negative Reinforcement Positive/Negative Reinforcement • Positive Reinforcement-any condition that follows and strengthens a response. • Getting a hug • Receiving a paycheck • Food, money, sex • Attention, praise, smile • Negative Reinforcementsubtraction of the unpleasant stimulus • Fastening a seatbelt to turn off beeping. • Pushing snooze button will silence your annoying alarm. • Use umbrella to avoid getting wet. II. Ways of Reinforcement: A. Primary v Secondary A. 1. Primary Reinforcement • Something that is naturally reinforcing • Examples: food, warmth, water, etc. • The item is reinforcing in and of itself A. 2. Secondary Reinforcement • Something that a person has learned to value or finds rewarding because it is paired with a primary reinforcer • Money is a good example • Cooking utensil II. Ways of Reinforcement B. Shaping • Step by step reinforcement of behaviors that are more and more similar to the one you want to occur. (Progress Reports, etc) •Technique used to establish a new behavior II. Ways of Reinforcement: C. Immediate v Delayed C. Immediate/Delayed Reinforcement • Immediate reinforcement is more effective than delayed reinforcementhowever humans will respond to delayed reinforcement better than animals. • Ability to delay gratification predicts higher achievement II. Ways of Reinforcement D. Schedules of Reinforcement: 1. Continuous Reinforcement D. 1. Continuous reinforcement • A schedule of reinforcement in which a reward follows every correct response • Most useful way to establish a behavior. • The behavior will extinguish quickly once the reinforcement stops. D. 2. Partial Reinforcement • A schedule of reinforcement in which a reward follows only some correct responses-initial learning is slower but there is a greater resistance to extinction. • Includes the following types: – Fixed-interval and variable interval – Fixed-ratio and variable-ratio (a) FixedInterval Schedule • A partial reinforcement schedule that rewards only the first correct response after some defined period of time • i.e. weekly quiz in a class; monthly pay check (a) Variable-Interval Schedule • A partial reinforcement that rewards the first correct response after an unpredictable amount of time • i.e. “pop” quiz in a class; fishing (b) Fixed-Ratio Schedule • A partial reinforcement schedule that rewards a response only after some defined number of correct responses • The faster the subject responds, the more reinforcements they will receive. • Ex. Pay a worker a dollar for every 10 tires they fix (b) Variable-Ratio Schedule • A partial reinforcement schedule that rewards an unpredictable number of correct responses • This schedule is very resistant to extinction. • Sometimes called the “gambler’s schedule”; similar to a slot machine; people who make sales pitches by telephone Schedules of Reinforcement III. Punishment: The Process of Punishment Decrease a behavior from happening again by following it with a negative consequence II. A. Types of Punishment (1) An undesirable event following a behavior (2) A desirable state or event ends following a behavior Module 16: Operant Conditioning III. Punishment: B. Problems With Punishment II. B. Negative Effects of Punishment • Doesn’t prevent the undesirable behavior when away from the punisher • Can lead to fear, anxiety, and lower selfesteem • Children who are punished physically may learn to use aggression as a means to solve problems. II. C. Positive Effects of Punishment • Punishment can effectively control certain behaviors. • Especially useful if teaching a child not to do a dangerous behavior • Most still suggest reinforcing an incompatible behavior rather than using punishment Module 16: Operant Conditioning IV. The Role of Cognition: New Understandings of Operant Conditioning III. A. Latent Learning • Learning that takes place in absence of an apparent reward III. B. Cognitive Map • A mental representation of a place • Experiments showed rats could learn a maze without any reinforcements III. C. Overjustification Effect • The effect of promising a reward for doing what someone already likes to do • The reward may lessen and replace the person’s original, natural motivation, so that the behavior stops if the reward is eliminated The End