Operant Conditioning Module 19 1 Learning Operant Conditioning Overview Skinner’s Experiments Extending Skinner’s Understanding Skinner’s Legacy Contrasting Classical & Operant Conditioning 2 Edward L. Thorndike ( 1874–1949) 3 Thorndike’s Puzzle Box link 4 Early Operant Conditioning • E. L. Thorndike (1898) • Puzzle boxes and cats First Trial Scratch at bars After Many Scratch at bars in Box Push at ceiling Trials in Box Push at ceiling Situation: stimuli inside of puzzle box Dig at floor Howl Etc. Situation: stimuli inside of puzzle box Dig at floor Howl Etc. Etc. Etc. Press lever Press lever 5 B. F. Skinner (1904–1990) 6 B.F. Skinner and Operant Conditioning • Classical conditioning involves an automatic response to a stimulus • Operant conditioning involves learning how to control one’s response to elicit a reward or avoid a punishment (to press a lever for example) 7 Skinner’s Experiments Skinner’s experiments extend Thorndike’s thinking, especially his law of effect. This law states that rewarded behavior is likely to occur again. Yale University Library 8 Operant Conditioning Operant Behavior operates (acts) on environment produces consequences Respondent Behavior occurs as an automatic response to stimulus behavior learned through classical conditioning 9 Operant Chamber Skinner Box chamber with a bar or key that an animal manipulates to obtain a food or water reinforcer contains devices to record responses 10 Operant Chamber Examples. Walter Dawn/ Photo Researchers, Inc. From The Essentials of Conditioning and Learning, 3rd Edition by Michael P. Domjan, 2005. Used with permission by Thomson Learning, Wadsworth Division 11 The “Skinner Box” • Rats placed in “Skinner boxes” • Shaped to get closer and closer to the bar in order to receive food • Eventually required to press the bar to receive food • Food is a reinforcer 12 Shaping Shaping is the operant conditioning procedure in which reinforcers guide behavior towards the desired target behavior through successive approximations. link Fred Bavendam/ Peter Arnold, Inc. Khamis Ramadhan/ Panapress/ Getty Images A rat shaped to sniff mines. A manatee shaped to discriminate objects of different shapes, colors and sizes. 13 14 p. 228 15 p. 228 Types of Reinforcers Reinforcement: Any event that strengthens the behavior it follows. Reuters/ Corbis A heat lamp positively reinforces a meerkat’s behavior in the cold. 16 Types of Reinforcement • Positive reinforcer (+) – Adds something rewarding following a behavior, making that behavior more likely to occur again – Giving a dog a treat for fetching a ball is an example • Negative reinforcer (-) – Removes something unpleasant that was already in the environment following a behavior, making that behavior more likely to occur again – Taking an aspirin to relieve a headache is an example 17 18 19 Escape and Avoidance: Two types of negative reinforcement Escape Conditioning Avoidance Conditioning Adapted from: The Psychology of Memory and Learning by Hintzman. © 1978 by W.H. Freeman and Company. Used with permission. 20 Learned Helplessness • Failure to try to avoid an unpleasant stimulus because in the past it was unavoidable • Possible model for depression in humans 21 Kinds of Reinforcement and Punishment Positive + Negative – (adding stimulus) (removing stimulus) Reinforcement (label afterwards to describe increase in behavior) Punishment (label afterwards to describe decrease in behavior) 22 Kinds of Reinforcement and Punishment Reinforcement (label afterwards to describe increase in behavior) Positive + Negative – (adding stimulus) (removing stimulus) Pos. Reinf. (Adding pleasant consequence) Punishment (label afterwards to describe decrease in behavior) 23 Kinds of Reinforcement and Punishment Reinforcement (label afterwards to describe increase in behavior) Positive + Negative – (adding stimulus) (removing stimulus) Pos. Reinf. Neg. Reinf. (Adding pleasant consequence) (Removing Aversive Stimuli) Punishment (label afterwards to describe decrease in behavior) 24 Kinds of Reinforcement and Punishment Reinforcement (label afterwards to describe increase in behavior) Punishment (label afterwards to describe decrease in behavior) Positive + Negative – (adding stimulus) (removing stimulus) Pos. Reinf. Neg. Reinf. (Adding pleasant consequence) (Removing Aversive Stimuli) Pos. Pun. (Adding aversive stimuli) 25 Kinds of Reinforcement and Punishment Reinforcement (label afterwards to describe increase in behavior) Punishment (label afterwards to describe decrease in behavior) Positive + Negative – (adding stimulus) (removing stimulus) Pos. Reinf. Neg. Reinf. (Adding pleasant consequence) (Removing Aversive Stimuli) Pos. Pun. (Adding aversive stimuli) Neg. Pun. (Removing pleasant stimuli) 26 Examples Link 1 Link 2 27 Negative Reinforcement and Punishment Negative reinforcement: Removing an unpleasant stimulus 1. Unpleasant stimulus Punishment 1. Introducing an unpleasant stimulus = 2. Removal of unpleasant stimulus 2. Withholding a pleasant stimulus = 28 29 Figure 6.18 Positive reinforcement versus negative reinforcement 30 Figure 6.20 Comparison of negative reinforcement and punishment 31 IMPORTANT!! • Negative reinforcement is NOT punishment. • Negative reinforcement is the REMOVAL of unpleasant stimulus when target behavior is observed (a positive consequence of behavior – increases behavior) • Punishment is the introduction of an aversive (unpleasant) stimulus or removal of a pleasant stimulus as a consequence of behavior – ( a negative consequence of behavior - decreases behavior. 32 Punishment An aversive event that decreases the behavior it follows. 33 Primary & Secondary Reinforcers 1. Primary Reinforcer: An innately reinforcing stimulus like food or drink. (satisfies a biological need 2. Conditioned (secondary) Reinforcer: A learned reinforcer that gets its reinforcing power through association with the primary reinforcer. 34 Immediate & Delayed Reinforcers 1. Immediate Reinforcer: A reinforcer that occurs instantly after a behavior. A rat gets a food pellet for a bar press. 2. Delayed Reinforcer: A reinforcer that is delayed in time for a certain behavior. A paycheck that comes at the end of a week. 35 Reinforcement Schedules 1. Continuous Reinforcement: Reinforces the desired response each time it occurs. 2. Partial (intermittent) Reinforcement: Reinforces a response only part of the time. Though this results in slower acquisition in the beginning, it shows greater resistance to extinction later on. 36 Schedules of Reinforcement • Partial reinforcement lies between continuous reinforcement and extinction 37 Schedules of Reinforcement Fixed Ratio (FR) reinforces a response only after a specified number of responses faster you respond the more rewards you get different ratios very high rate of responding like piecework pay 38 Schedules of Reinforcement Variable Ratio (VR) reinforces a response after an unpredictable number of responses like gambling, fishing very hard to extinguish because of unpredictability Skinner link 3:58 SLOT machines show SLOwesT extinction. 39 Schedules of Reinforcement Fixed Interval (FI) reinforces a response only after a specified (fixed) time has elapsed response occurs more frequently as the anticipated time for reward draws near 40 Schedules of Reinforcement Variable Interval (VI) reinforces a response at unpredictable time intervals produces slow steady responding like pop quiz 41 Intermittent Reinforcement Schedules Summary Based on Number of necessary responses Predictable Unpredictable (“On the Average”) Based on Time that must first pass Fixed Ratio (FR) Fixed Interval (FI) Variable Ratio (VR) Variable Interval (VI) 42 Schedules of Reinforcement 43 • You do not have to write down the following examples. 44 FI, VI, FR, or VR? 1. 2. 3. 4. 5. When I bake cookies, I can only put one set in at a time, so after 10 minutes my first set of cookies is done. After another ten minutes, my second set of cookies is done. I get to eat a cookie after each set is done baking. After every 10 math problems that I complete, I allow myself a 5 minute break. I look over my notes every night because I never know how much time will go by before my next pop quiz. When hunting season comes around, sometimes I’ll spend all day sitting in the woods waiting to get a shot at a big buck. It’s worth it though when I get a nice 10 point. Today in Psychology class we were talking about Schedules of Reinforcement and everyone was eagerly raising their hands and participating. Miranda raised her hand a couple of times and was eventually called on. 1. FI 2. FR 3. VI 4. VI 5. VR 45 FI, VI, FR, or VR? 6. Madison spanks her son if she has to ask him three times to clean up his room. 7. Emily has a spelling test every Friday. She usually does well and gets a star sticker. 8. Steve’s a big gambling man. He plays the slot machines all day hoping for a big win. 9. Snakes get hungry at certain times of the day. They might watch any number of prey go by before they decide to strike. 10. Mr. Bertani receives a salary paycheck every 2 weeks. (Miss Suter doesn’t ). 11. Christina works at a tanning salon. For every 2 bottles of lotion she sells, she gets 1 dollar in commission. 12. Mike is trying to study for his upcoming Psychology quiz. He reads five pages, then takes a break. He resumes reading and takes another break after he has completed 5 more pages. 6. FR 7. FI 8. VR 9. VI 10. FI 11. FR 12. FR 46 FI, VI, FR, or VR? 13. Megan is fundraising to try to raise money so she can go on the annual band trip. She goes door to door in her neighborhood trying to sell popcorn tins. She eventually sells some. 14. Kylie is a business girl who works in the big city. Her boss is busy, so he only checks her work periodically. 15. Mark is a lawyer who owns his own practice. His customers makes payments at irregular times. 16. Jessica is a dental assistant and gets a raise every year at the same time and never in between. 17. Andrew works at a GM factory and is in charge of attaching 3 parts. After he gets his parts attached, he gets some free time before the next car moves down the line. 18. Brittany is a telemarketer trying to sell life insurance. After so many calls, someone will eventually buy. 13. VR 14. VI 15. VI 16. FI 17. FR 18. VR 47 Updating Skinner’s Understanding • Skinner’s emphasis on external control of behavior made him an influential, but controversial figure. • Many psychologists criticized Skinner for underestimating the importance of cognitive and biological constraints. 48 Cognitive Approach This approach emphasizes abstract and subtle learning that could not be achieved through conditioning or social learning alone. 49 Cognition & Operant Conditioning Evidence of cognitive processes during operant learning comes from rats during a maze exploration in which they navigate the maze without an obvious reward. Rats seem to develop cognitive maps (E.C. Tolman), or mental representations, of the layout of the maze (environment). 50 Latent Learning 51 Intrinsic Motivation Intrinsic Motivation: The desire to perform a behavior for its own sake. Extrinsic Motivation: The desire to perform a behavior due to promised rewards or threats of punishments. 52 Biological Predisposition Biological constraints predispose organisms to learn associations that are naturally adaptive. Photo: Bob Bailey Marian Breland Bailey 53 Skinner’s Legacy Skinner argued that behaviors were shaped by external influences instead of inner thoughts and feelings. Critics argued that Skinner dehumanized people by neglecting their free will. Falk/ Photo Researchers, Inc . 54 Applications of Operant Conditioning Skinner introduced the concept of teaching machines that shape learning in small steps and provide reinforcements for correct rewards. LWA-JDL/ Corbis In School 55 Applications of Operant Conditioning Reinforcers affect productivity. Many companies now allow employees to share profits and participate in company ownership. At work 56 Applications of Operant Conditioning At Home In children, reinforcing good behavior increases the occurrence of these behaviors. Ignoring unwanted behaviors decreases their occurrence. 57 EXPLORING PSYCHOLOGY (7th Edition in Modules) David Myers PowerPoint Slides Aneeq Ahmad Henderson State University, Amy Jones, Bernstein, Schallhorn with Garber edits Worth Publishers, © 2008 58