Schedules of Reinforcement

advertisement
Schedules of Reinforcement
CH 17,18,19
Divers of Nassau
• Diving for coins
• Success does not follow every attempt
• Success means reinforcement
Intermittent Reinforcement
• A reinforcer follows the response only once
in a while.
Continuous Reinforcement
(CRF)
• A reinforcer follows every response.
• Best used for shaping or maintaining
difficult behavior.
Shaping with Reinforcement
Reinforcement
Before:
Andrew
has no gum
Behavior Initial:
Andrew moves lips
After:
Intermed: Andrew
makes croaking sound
Andrew
receives
gum
Terminal: Says words
clearly
Behavior Initial: NA
Intermed: Moves lips
only
Terminal: Says words
unclearly
Extinction
After:
Andrew
receives no
gum
Schedule of Reinforcement
• The way reinforcement occurs because of
the number of responses, time between
responses, and stimulus conditions.
Schedules of Reinforcement
• Ratio
– Fixed Ratio (FR)
– Variable Ratio (VR)
• Interval
– Fixed Interval (FI)
– Variable Interval (VI)
Fixed Ratio (FR) Schedule of
Reinforcement
• A reinforcer follows a fixed number of
responses.
FR Responding
• After a response is reinforced, no
responding occurs for a period of time, then
responding occurs at a high, steady rate
until the next reinforcer is delivered.
Reinforcers
Reinforcement
always immediately
follows the 5th
response!!!!
This is an FR5
schedule of
reinforcement.
Pauses
Postreinforcement Pause (PRP)
• The name for the pause after the
consumption of the reinforcer and before
the next ratio of responses begins.
FR & PRP
• Length of the pause is proportional to the
size of the ratio.
• Small ratio – pausing is short
• Large ratio – pausing is long
General Rule for establishing
intermittently reinforced
behavior:
First use CRF and gradually
increase the intermittency of
reinforcement as responding
stabilizes at a high rate.
FR120
• What are the response requirements?
• When will reinforcement be delivered?
• What will the pattern of responding look
like?
Cumulative Number of Responses
Cumulative Graph for FR
140
120
100
80
Post
reinforcement
pauses
60
40
Reinforcers
20
0
Time
Variable Ratio (VR) Schedule
• A reinforcer follows after a variable number
of responses
VR Responding
• VR schedules produce a high, constant rate
of responding, with almost no
postreinforcement pausing.
VR 50
180
Cumulative Number of Responses
160
140
120
100
80
60
40
20
0
Time
VR 50
• Schedule notation
• Number designates the average number of
responses required for reinforcement
How do typical schedules of
reinforcement differ from
gambling?
• See pages 290-291
Skinner Box vs. 1 Armed Bandit
Typical VR
Many interspersed
learned reinforcers
No
Gambling
Schedule
Yes
Amount of reinforcer
varies from ratio to ratio
No
Yes
Small ratio
No
Yes
Emotional reinforcers
No
Yes
Time Dependent Schedules
• Fixed Interval (FI)
• Variable Interval (VI)
FI Schedule of Reinforcement
• A reinforcer is contingent on the first
response after a fixed interval of time since
the last opportunity for reinforcement.
Cumulative Number of
Responses
Fixed Interval 10s Schedule of Reinforcement
200
150
100
Reinforcer
50
0
0
10
20
Seconds
Example: Fixed Interval (FI) Schedule of
Reinforcement
FI 200-s
Fixed Interval 200-s Schedule of Reinforcement
Cumulative Number of Responses
200
180
160
140
120
100
80
Reinforcer
60
40
20
0
Seconds
200 seconds
400 seconds
600 seconds
Fixed-Interval Scallop
• A FI schedule often produces a scallop – a
gradual increase in the rate of responding
with responding occurring at a high rate just
before reinforcement is available. No
responding occurs for some time after
reinforcement (Post Reinforcement Pause.
Example: Fixed Interval (FI) Schedule of
Reinforcement
FI 200-s
Fixed Interval 200-s Schedule of Reinforcement
Cumulative Number of Responses
200
PRP
180
160
140
Scallop
120
100
80
Reinforcer
60
40
20
0
Seconds
200 seconds
400 seconds
600 seconds
Lever Pressing of Mice after Training
Fixed Interval 200 s
A
Relative Frequency
0.3
+/+
+/rl
rl/rl
0.2
0.1
0
20
40
60
80
100 120 140 160 180 200
Time (sec)
Term paper vs. FI
• Term paper does not have a deadline.
FI vs. Term Paper
FI
Term Paper
Does early responding
effect anything?
No
Yes
Do you get more if you
work harder?
No
Yes
Is the relevant response
class clear?
Yes
No
Are there calendars and
clocks?
No
Yes
Is there a deadline?
No
Yes
Is the reinforcer too
delayed?
No
Yes
Fixed Time & Superstitious
Behavior
• Fixed time schedules of reinforcement
– A reinforcer is delivered after the passage of a
fixed period of time, independently of the
response.
• Superstitious behavior
– Behaving as if the response causes some
specific outcome when it really does not.
Superstitious behavior of the
pigeon:
Experiment by Skinner
Fixed Time Schedule
Variable Interval (VI) Schedule
of Reinforcement
• A reinforcer is contingent on the first
response after a variable interval of time
since the last opportunity for reinforcement.
VI Responding
• VI schedules produce a moderate rate of
responding with almost no
postreinforcement pausing.
Comparing/Contrasting Ratio and
Interval Schedules
• Pg. 305: table
• Pg. 305: cumulative records
Continuous vs. Intermittent
• CRF – every response is reinforced
• Intermittent reinforcement – only some
responses produce a reinforcer.
Intermittent Reinforcement
• Makes the response more resistant to
extinction than does continuous
reinforcement.
Resistance to Extinction
• The number of responses or the amount of
time before a response extinguishes
Resistant to Extinction and
Intermittent Reinforcement
• Intermittent reinforcement makes the response
more resistant to extinction than does continuous
reinforcement.
• Why? Hard for the rat to tell the difference
between only an occasional reinforcement and no
reinforcement
• CRF followed by EXT: rats quickly stop
responding: easy discrimination between
reinforcement & extinction contingencies
4 Types of Concurrent
Contingencies
1. Two compatible responses
2. Compatible contingencies for a single
response
3. Incompatible contingencies for a single
response
4. Two incompatible responses
Concurrent Contingencies
• Contingencies that are available at the same
time.
• More than one contingency of
reinforcement or punishment is available at
the same time.
Concurrent Contingencies
• Two levers in chamber
– VI 300 s on Left lever
– VI 30 s on Right lever
• Will animal allocate more responses to the
left or the right lever?
• Matching Law (pg. 330)
% of Left lever presses = % Left-lever reinforcers
Matching Law
• Relative Frequency of responding on two
concurrent schedules of reinforcement
equals the relative value of reinforcement
on those two schedules
Download