Operant conditioning More on reinforcement and beyond extinction Occurs when reinforcement stops following the response If you repeat a previously reinforced behavior but fail to receive reinforcement, eventually you will probably stop performing the action Stimulus generalization Somewhat similar to classical conditioning If a certain response/behavior received reinforcement in a certain situation, you are likely to make the same response in a similar situation discrimination Some responses gain reinforcement, others don’t As you would expect, those that gain reinforcement will be much more likely to be repeated We learn to chose or discriminate between the two Discriminative stimuli Any stimulus that provides an organism with a cue for making a certain response in order to obtain reinforcement Stop signs Warm smiles/ eye contact Primary reinforcers these stand alone and, all things being equal, reinforce the behaviors that precede them because of their importance to the organism For example: water, food, and comfort Secondary reinforcers These must first be paired with unconditional reinforcers to prompt increases in behaviors The most obvious example is money, which can help someone secure water, food, etc. Reinforcement schedules At the most basic level we have two different types of reinforcement schedules: 1) Continuous, in which reinforcement follows every correct response, and 2) Intermittent, in which only some correct responses are followed by reinforcement. Intermittent schedules Fixed ratio – provide reinforcement after only a fixed number of correct responses Marked by a steady response rate If many responses are needed to secure reinforcement, a pause follows reinforcement Variable ratio – reinforcement provided after a variable amount of responses Marked by steady response rates Playing slot machines, lotto, and many other types of gambling Fixed interval – reinforcement delivered after a fixed amount of time Lots of responses right before reinforcement Variable interval – reinforcement delivered after a varying amount of time Responses are made at a slow and steady rate An astronomer looking for comets The allure of intermittent reinforcement While behaviors continuously reinforced quickly succumb to extinction, Behaviors which are intermittently reinforced are quite resistant to extinction Lotto That unpredictable boyfriend/girlfriend Dollard & miller: beyond skinner Remember, Skinner was starving those poor pigeons and rats. Wasn’t he ignoring something? Dollard and Miller, in the 50’s, thought so. Dollard & miller ii A four step process: 1) Drive – propels the organism into action, 2) Cue – the guiding stimuli the “how and where” of the response 3) Response – the actual behavior, and 4) Reinforcement – reduces the drive For example: A long trip on an interstate 1) Drive – eventually, you get hungry 2) Cue – road signs advise of food ahead 3) Response – you get off the highway and pull into the parking lot 4) Reinforcement – you eat, satisfying your hunger Observational learning Classical conditioning and operant conditioning don’t explain it all Operant conditioning implies trial and error Some behaviors are too dangerous to try and then see what happens We learn from the experiences of others Bandura’s contribution Albert Bandura realized that Skinner’s framework was not sufficient He guessed that we learn from watching the behavior of others, not just enduring the consequences of our own imitation Poor Bobo If children watched an adult hammer him they were likely to do the same thing Even repeating the exact words If the adult spared the doll, all the children did also Vicarious consequences However, if the children watched the adult being disciplined for pounding the hapless bag of air, they didn’t rough him up They had somehow vicariously experienced the adult’s punishment and the behavior never emerged Vicarious reinforcement Powerball winners Are they hidden? No, they are celebrated We share in their triumph We can’t wait for our chance How about all those who lost big? Encouraging moral behavior “Catch them when they are being good”? If we wait for it to emerge, and then reinforce it, we might wait a long time Moral behavior rarely occurs spontaneously So, we teach it through ….. modeling We imitate those we view as: 1) Powerful, 2) Successful, 3) Similar to us, and 4) Warm/Kind. Modeling ii We find an individual or character who fits these criteria and then we show them displaying the desired quality It might make an impression