Schedules of Reinforcement CH 17 18 19 Divers

  • Slides: 43
Download presentation
Schedules of Reinforcement CH 17, 18, 19

Schedules of Reinforcement CH 17, 18, 19

Divers of Nassau • Diving for coins • Success does not follow every attempt

Divers of Nassau • Diving for coins • Success does not follow every attempt • Success means reinforcement

Intermittent Reinforcement • A reinforcer follows the response only once in a while.

Intermittent Reinforcement • A reinforcer follows the response only once in a while.

Continuous Reinforcement (CRF) • A reinforcer follows every response. • Best used for shaping

Continuous Reinforcement (CRF) • A reinforcer follows every response. • Best used for shaping or maintaining difficult behavior.

Shaping with Reinforcement Behavior Initial: Andrew moves lips Intermed: Andrew makes croaking sound Before:

Shaping with Reinforcement Behavior Initial: Andrew moves lips Intermed: Andrew makes croaking sound Before: Andrew has no gum Terminal: Says words clearly After: Andrew receives gum Behavior Initial: NA After: Intermed: Moves lips only Andrew receives no gum Terminal: Says words unclearly Extinction

Schedule of Reinforcement • The way reinforcement occurs because of the number of responses,

Schedule of Reinforcement • The way reinforcement occurs because of the number of responses, time between responses, and stimulus conditions.

Schedules of Reinforcement • Ratio – Fixed Ratio (FR) – Variable Ratio (VR) •

Schedules of Reinforcement • Ratio – Fixed Ratio (FR) – Variable Ratio (VR) • Interval – Fixed Interval (FI) – Variable Interval (VI)

Fixed Ratio (FR) Schedule of Reinforcement • A reinforcer follows a fixed number of

Fixed Ratio (FR) Schedule of Reinforcement • A reinforcer follows a fixed number of responses.

FR Responding • After a response is reinforced, no responding occurs for a period

FR Responding • After a response is reinforced, no responding occurs for a period of time, then responding occurs at a high, steady rate until the next reinforcer is delivered.

Reinforcers Reinforcement always immediately follows the 5 th response!!!! This is an FR 5

Reinforcers Reinforcement always immediately follows the 5 th response!!!! This is an FR 5 schedule of reinforcement. Pauses

Postreinforcement Pause (PRP) • The name for the pause after the consumption of the

Postreinforcement Pause (PRP) • The name for the pause after the consumption of the reinforcer and before the next ratio of responses begins.

FR & PRP • Length of the pause is proportional to the size of

FR & PRP • Length of the pause is proportional to the size of the ratio. • Small ratio – pausing is short • Large ratio – pausing is long

General Rule for establishing intermittently reinforced behavior: First use CRF and gradually increase the

General Rule for establishing intermittently reinforced behavior: First use CRF and gradually increase the intermittency of reinforcement as responding stabilizes at a high rate.

FR 120 • What are the response requirements? • When will reinforcement be delivered?

FR 120 • What are the response requirements? • When will reinforcement be delivered? • What will the pattern of responding look like?

Cumulative Graph for FR Post reinforcement pauses Reinforcers

Cumulative Graph for FR Post reinforcement pauses Reinforcers

Variable Ratio (VR) Schedule • A reinforcer follows after a variable number of responses

Variable Ratio (VR) Schedule • A reinforcer follows after a variable number of responses

VR Responding • VR schedules produce a high, constant rate of responding, with almost

VR Responding • VR schedules produce a high, constant rate of responding, with almost no postreinforcement pausing.

VR 50

VR 50

VR 50 • Schedule notation • Number designates the average number of responses required

VR 50 • Schedule notation • Number designates the average number of responses required for reinforcement

How do typical schedules of reinforcement differ from gambling? • See pages 290 -291

How do typical schedules of reinforcement differ from gambling? • See pages 290 -291

Skinner Box vs. 1 Armed Bandit Typical VR Gambling Schedule Many interspersed learned reinforcers

Skinner Box vs. 1 Armed Bandit Typical VR Gambling Schedule Many interspersed learned reinforcers No Yes Amount of reinforcer varies from ratio to ratio No Yes Small ratio No Yes Emotional reinforcers No Yes

Time Dependent Schedules • Fixed Interval (FI) • Variable Interval (VI)

Time Dependent Schedules • Fixed Interval (FI) • Variable Interval (VI)

FI Schedule of Reinforcement • A reinforcer is contingent on the first response after

FI Schedule of Reinforcement • A reinforcer is contingent on the first response after a fixed interval of time since the last opportunity for reinforcement.

Example: Fixed Interval (FI) Schedule of Reinforcement FI 200 -s 200 seconds 400 seconds

Example: Fixed Interval (FI) Schedule of Reinforcement FI 200 -s 200 seconds 400 seconds 600 seconds

Fixed-Interval Scallop • A FI schedule often produces a scallop – a gradual increase

Fixed-Interval Scallop • A FI schedule often produces a scallop – a gradual increase in the rate of responding with responding occurring at a high rate just before reinforcement is available. No responding occurs for some time after reinforcement (Post Reinforcement Pause.

Example: Fixed Interval (FI) Schedule of Reinforcement FI 200 -s PRP Scallop 200 seconds

Example: Fixed Interval (FI) Schedule of Reinforcement FI 200 -s PRP Scallop 200 seconds 400 seconds 600 seconds

Lever Pressing of Mice after Training

Lever Pressing of Mice after Training

Term paper vs. FI • Term paper does not have a deadline.

Term paper vs. FI • Term paper does not have a deadline.

FI vs. Term Paper FI Term Paper Does early responding effect anything? No Yes

FI vs. Term Paper FI Term Paper Does early responding effect anything? No Yes Do you get more if you work harder? No Yes Is the relevant response class clear? Yes No Are there calendars and clocks? No Yes Is there a deadline? No Yes Is the reinforcer too delayed? No Yes

Fixed Time & Superstitious Behavior • Fixed time schedules of reinforcement – A reinforcer

Fixed Time & Superstitious Behavior • Fixed time schedules of reinforcement – A reinforcer is delivered after the passage of a fixed period of time, independently of the response. • Superstitious behavior – Behaving as if the response causes some specific outcome when it really does not.

Superstitious behavior of the pigeon: Experiment by Skinner Fixed Time Schedule

Superstitious behavior of the pigeon: Experiment by Skinner Fixed Time Schedule

Variable Interval (VI) Schedule of Reinforcement • A reinforcer is contingent on the first

Variable Interval (VI) Schedule of Reinforcement • A reinforcer is contingent on the first response after a variable interval of time since the last opportunity for reinforcement.

VI Responding • VI schedules produce a moderate of responding with almost no postreinforcement

VI Responding • VI schedules produce a moderate of responding with almost no postreinforcement pausing.

Comparing/Contrasting Ratio and Interval Schedules • Pg. 305: table • Pg. 305: cumulative records

Comparing/Contrasting Ratio and Interval Schedules • Pg. 305: table • Pg. 305: cumulative records

Continuous vs. Intermittent • CRF – every response is reinforced • Intermittent reinforcement –

Continuous vs. Intermittent • CRF – every response is reinforced • Intermittent reinforcement – only some responses produce a reinforcer.

Intermittent Reinforcement • Makes the response more resistant to extinction than does continuous reinforcement.

Intermittent Reinforcement • Makes the response more resistant to extinction than does continuous reinforcement.

Resistance to Extinction • The number of responses or the amount of time before

Resistance to Extinction • The number of responses or the amount of time before a response extinguishes

Resistant to Extinction and Intermittent Reinforcement • Intermittent reinforcement makes the response more resistant

Resistant to Extinction and Intermittent Reinforcement • Intermittent reinforcement makes the response more resistant to extinction than does continuous reinforcement. • Why? Hard for the rat to tell the difference between only an occasional reinforcement and no reinforcement • CRF followed by EXT: rats quickly stop responding: easy discrimination between reinforcement & extinction contingencies

4 Types of Concurrent Contingencies 1. Two compatible responses 2. Compatible contingencies for a

4 Types of Concurrent Contingencies 1. Two compatible responses 2. Compatible contingencies for a single response 3. Incompatible contingencies for a single response 4. Two incompatible responses

Concurrent Contingencies • Contingencies that are available at the same time. • More than

Concurrent Contingencies • Contingencies that are available at the same time. • More than one contingency of reinforcement or punishment is available at the same time.

Concurrent Contingencies • Two levers in chamber – VI 300 s on Left lever

Concurrent Contingencies • Two levers in chamber – VI 300 s on Left lever – VI 30 s on Right lever • Will animal allocate more responses to the left or the right lever? • Matching Law (pg. 330) % of Left lever presses = % Left-lever reinforcers

Matching Law • Relative Frequency of responding on two concurrent schedules of reinforcement equals

Matching Law • Relative Frequency of responding on two concurrent schedules of reinforcement equals the relative value of reinforcement on those two schedules