Learning - Plain Local Schools

advertisement
Chapter 2: Learning
There are several types of LEARNING like:
Observational Learning, Operant Conditioning & Classical Conditioning
Classical Conditioning –
Conditioned means LEARNED
*Accidently discovered by Ivan Pavlov and his famous experiment with dogs
*A learning procedure where a natural stimulus-response relationship
is attached to a new stimulus
_______________________________________________________________________________________
Example:
natural stimulus-response= ”Unconditioned”
food presented
= dog salivates
Unconditioned Stimulus
Unconditioned Response
Classical Conditioning happens when a NEUTRAL stimulus gets attached
to the existing natural stimulus response relationship
Example:
neutral stimulus – the tone of the bell
Bell (NS)+ food(UCS) = dog drools(UCR)
Repeat pairing = Result =learning or CONDITIONING takes place (Acquisition)
Dog salivate when hear bell (with no food)
Bell (CS-Conditioned Stimulus) = dog drooling (CR-Conditioned response)
Acquisition - the first stages of learning when a response is established
When the once neutral stimulus evokes the “conditioned” response
& the once neutral stimulus becomes now the “conditioned” stimulus
This NEW learned response is called classical conditioning
UCS= Unconditioned stimulus (natural): An event that elicits a certain
predictable response without previous training (presence of food)
UCR= Unconditioned response (natural):
A reaction that occurs naturally and automatically when the unconditioned
stimulus is presented (salivate)
CS= Conditioned Stimulus (learned)
once-neutral stimulus (tone of bell) that has come to elicit a given response
after a period of training in which it has been paired with an unconditioned
stimulus (UCS)
CR= Conditioned Response(learned)
learned reaction to a conditioned stimulus (salivating with bell &
no food present)
*Increased pairing of NS and UCS = acquisition happens which is
classical conditioning*
GOLDEN RULES TO CLASSICAL CONDITIONING:
1.
The UCR & the CR are ALWAYS the SAME!
2.
NS always becomes the CS!
Let’s break it down:
Lets try some together!
1. You are taking a shower. Your sister flushes the toilet.
The water turns scalding hot and you jump back. Now
when you take a shower and you hear the toilet flush you
automatically jump.
(UCS)__________= __________(UCR)
(NS)_______+ (UCS)__________=___________(UCR)
Acquisition takes place after multiple pairings
(CS)___________=___________(CR)
2. You go to your doctor’s office to receive weekly injections for a
medical condition. The injection’s side effects include a
warming sensation and dizziness. Now when you wait in any
small doctor office like space you feel warm and dizzy.
(UCS)__________= __________(UCR)
(NS)_______+ (UCS)__________=___________(UCR)
Acquisition takes place after multiple pairings
(CS)___________=___________(CR)
3. You were driving to your friend’s house and were stopped by
police for speeding. Once you saw the lights of the police car
your heart started racing. Months later when you drive to
your friend’s, at the appropriate speed limit, on the same
stretch of the road where you were pulled over your heart
races.
(UCS)__________= __________(UCR)
(NS)_______+ (UCS)__________=___________(UCR)
Acquisition takes place after multiple pairings
(CS)___________=___________(CR)
Related Classical Conditioning Terms:
Extinction: The gradual disappearance of a conditioned response because
the reinforcement is withheld or because the conditioned stimulus is
repeatedly presented without the UCS (unconditioned stimulus)
Think about Baby Albert: if Watson would have removed the fear of white fuzzy
things by showing baby fuzzy things repeatedly with no loud scary noise, baby’s
fear would have decreased and eventually become extinct
Spontaneous Recovery –
the reappearance of the conditioned response after a rest period
While the response might disappear, that does not mean
that it has been forgotten or eliminated.
Generalization: Responding similarly to a range of similar stimuli,
think of little Albert & his fear of all white fuzzy things
Discrimination: The ability to respond differently to similar but distinct
stimuli, example being afraid of only white fuzzy items not all fuzzy items
Taste Aversions-
linking your distaste for particular items to a period of illness,
queasiness or nausea
The previously neutral stimulus (the food) is paired with an unconditioned
stimulus (an illness), which leads to an unconditioned response (feeling sick).
After this one-time pairing, the previously neutral stimulus (the food) is
now a conditioned stimulus that elicits a conditioned response (avoiding the
food).
Read pages 117 & 118 from the AP Review Book
& define all bold vocab
Operant Conditioning A form of learning in which a certain action is reinforced or punished, resulting in increases or decreases in the likelihood that action
happening again = TRAINING
* study of how behavior is affected by its consequences
Edward Thorndike(1898) started operant research
Originally called “Instrumental Conditioning”
- a behavior becomes more or less pronounced depending on its consequences
- Research with cat and a puzzle box http://www.simplypsychology.org/edward-thorndike.html
Law of Effect- any behavior that is followed by pleasant consequences is likely to be repeated, and any behavior followed by unpleasant consequences
is likely to be stopped
B.F. Skinner
Skinner Box Experiments
http://www.youtube.com/watch?v=I_ctJqjlrHA
Operant Conditioning Training Procedures
4 Methods to choose from
1.
Positive Reinforcement + something added or given
Rewards given to increase frequency of behavior
Types of Reinforcers:
Primary Reinforcer occur naturally (“primary to my survival” is my mnemonic device)
Examples of primary reinforcers include things that satisfy basic survival needs such as water, food, sleep, air
Secondary Reinforcer
Something that is learned to be rewarding, like gold stars or the behavior color codes system used in elementary schools
Token Economy
Secondary reinforcers are used to reward desirable behaviors, Positive Attitude Awards are a great example –they hold no value in any other situation
Used successfully in prisons and day care programs
Generalized Reinforcer
Money because it can be used to get anything, it can be “generalized”
2.
Negative Reinforcement - subtract something
Removal of something one dislikes, an
aversive/punishment, increases frequency of behavior repeating
*Parent removes your curfew (you don’t like), when you bring home all A’s, to INCREASE the likelihood you get all A’s in future
*Take an aspirin to remove a headache
There are 2 types:
A. Escape – take away aversive stimuli after it already started
B. Avoidance – your behavior takes away the opportunity for the aversive stimuli to ever begin
Let’s Practice together & add a few to your notes
*Put on seatbelt before start car to prevent annoying ding sound
*Before heading out for a day at the beach, you slather on sunscreen so you don’t get sunburned
*Putting on gloves to remove chill from hands, more likely to put on gloves before you even head outside next time
* You decide to clean up your mess in the kitchen so Mom doesn’t yell at you about it
* Mom starts yelling at you about your messy room, so you go clean it to stop the nagging
3. Punishment Aversive consequence follows behavior, used to decrease undesirable behavior
Example: Spray your cat (or kid) with a water bottle
4. Omission Training –
Removal of something of value to decrease the likelihood of the behavior repeating
Example: All time-outs = removal of something good or fun
IF YOU ARE USING REINFORCEMENTS – How often will you give them?
Schedule of Reinforcements: Timing and frequency of reinforcement are important:
1.
Continuous Schedule:
Reward every desired behavior, learning happens quick but so will extinction
2.
Partial Schedule (Intermittent):
Responses are more stable and last longer
4 Types of Partial Schedule Reinforcements:
RATIO = NUMBER
INTERVALS = TIME
See handout
Fixed-ratio schedule: A schedule of reinforcement in which a specific number of correct responses is required before reinforcement can be obtained
Ex: Rewarding every 4th response
Variable-ratio schedule: A schedule of reinforcement in which an ever changing number of responses are required before reinforcement can be obtained each time
EX: Slot machines, set to pay off after a varying number of attempts and you never know if it will be the next pull or not will be the reward
Fixed-interval schedule: A schedule of reinforcement in which a specific amount of time must elapse before a response will elicit reinforcement
Ex. Teacher gives a quiz every Monday
Variable-interval schedule: A schedule of reinforcement in which changing amount of time must elapse before a response will obtain reinforcement each time
Ex: Trying to reach a friend and goes straight to voicemail. Keep trying at random lengths of time you are not sure how long you have to wait to get ahold of them
Begin NOW
Fill out the Graphic Organizer
Practice Schedules of reinforcement on WKST 1-22
More CLASSICAL CONDITIONING terms to learn:
Delayed Conditioning–
* Ideal training*
NS precedes UCS; briefly overlap
Example w/ Pavlov’s dogs: ______________________________________________________________________________________________________
Simultaneous Conditioning –
NS and UCS paired together at the same time
Example w/ Pavlov’s dogs: ______________________________________________________________________________________________________
Trace Conditioning–
NS presented first, removed, then the UCS is presented
Example w/ Pavlov’s dogs: ______________________________________________________________________________________________________
Backwards Conditioning–
UCS presented first and the NS follows
Example w/ Pavlov’s dogs: ______________________________________________________________________________________________________
Dueling Theories on What leads to Conditioning:
Contiguity - (Webster’s Definition - the state of being in indirect contact with)
Pavlov – behavioral perspective
Proximity important to learning occurring
Stimuli that are close to one another in time and space become associated
VERSES
Contingency –
Richard Rescorla –cognitivist perspective
When one stimulus depends on the other, they become associated (correlation)
More OPERANT CONDITIONING terms to learn:
Instinctive Drift – animals lose their conditioned behavior to instead perform more organism specific behavior
See handout
example:_______________________________________________________________________________________________________________________
___________________________________________________________________________________________________________________________________
Preparedness – predisposition to easily learn behaviors related to survival of the species
example: Teaching a chicken to peck at a specific colored disk, card or object
http://www.youtube.com/watch?v=2F5kLv6ErOA
Premack principle – Parents’ favorite weapon!!!
David Premack states that a commonly occurring action (watching TV) can be used effectively as a reinforcer for a less commonly occurring one
(cleaning your room).
example: __IF YOU R CLEAN YOUR ROOM, YOU CAN WATCH TV
an activity that probably does not require reinforcement, is used as a reinforcer for cleaning the room,
Behavioral modification – using conditioning principles to alter behavior, see examples on website
http://www.edpsycinteractive.org/topics/behavior/behmod.html
This is just one of the many ways to develop a new behavior via conditioning
1. Successive Approximation Principle:
To teach a child to act in a manner in which he has seldom or never before behaved, reward successive steps to the final behavior.
Latent Learning – Edward Tolman
Rats in mazes
We make cognitive maps, and we aren’t aware we have them, until we need to rely on them. Like getting around your house in the dark.
Insight – Wolfgang Kohler
Peanut in the tube puzzle
Kohler’s chimps solving puzzles
Learned helplessness – Martin Seligman
One of several explanations for depression
Constantly facing failures, causes one to stop trying!
This person develops an external locus of control- no matter what they do, crap happens to them – they have no control over the
situation
Experiment – Dogs were shocked
Escalator video
Learned laziness - Martin Seligman
The person was handed everything, they never learned to work for things. They literally learn to be “lazy”.
Happens with bright students, up until junior year they never had to study and easily got great grades. But as a junior the curriculum
is more challenging. They have no idea how to study and are frustrated by the idea that they will have to work for a grade.
Shaping vs. Chaining –
Chaining = requires one to complete a sequence or series of tasks before a reinforcement is given (like links in a chain are bound together)
Read your kids a book (reinforcement) after they complete all: use the restroom, take a bath, put on PJs, brush their teeth
Shaping = you reinforce desired behaviors at incremental steps / Skinner training his pigeon to do a 360 turn
Define the following terms
Chapter 2: Learning
Shaping
Chaining
Feedback
Transfer
Learned Helplessness
Learned Laziness
Behavior Modification
Response Chains
Avoidance
Escape
Observational Learning
Modeling
Disinhibition
Download