Operant Conditioning I Operant Conditioning n A type

  • Slides: 28
Download presentation
Operant Conditioning

Operant Conditioning

I. Operant Conditioning n. A type of learning that occurs when we receive rewards

I. Operant Conditioning n. A type of learning that occurs when we receive rewards or punishments for our behavior n Behaviors will increase if the consequences are reinforced n Behaviors will decrease if the consequences are not reinforced

CLASSICAL VS OPERANT CLASSICAL OPERANT CONDITIONING is a process by which the consequences of

CLASSICAL VS OPERANT CLASSICAL OPERANT CONDITIONING is a process by which the consequences of a response affect 1 There is always a specific the likelihood that the response stimulus (UCS) that elicits a will occur again. desired response 1. The response is voluntary (INVOLUNTARY) 2. (UCS) does not depend on 2. The subject plays a role in what happens the learners response 3. The learner responds to the 3. Generalization can occur in that the individual MAY learn environment to generalize the response 4. Generalization can occur when the same response occurs to a similar stimuli

In Class Activity 1. CREATE A VISUAL(ANY KIND) - Handout, power point slide… 2.

In Class Activity 1. CREATE A VISUAL(ANY KIND) - Handout, power point slide… 2. PRESENT THE VISUAL 3. DESIGN, CREATE & IMPLEMENT AN ACTIVITY THAT REINFORCES THE INFORMATION YOU ARE PRESENTING

A. Edward Thorndike: The Puzzle Box

A. Edward Thorndike: The Puzzle Box

B. B. F. Skinner (1904 -1990) n Founder n Devised of operant conditioning ways

B. B. F. Skinner (1904 -1990) n Founder n Devised of operant conditioning ways to test his theories in the real world (The Skinner Box)

Components of Operant Conditioning 1. Reinforcement - Any consequence that increases the likelihood of

Components of Operant Conditioning 1. Reinforcement - Any consequence that increases the likelihood of a response (behavior) will be repeated. a. Positive Reinforcement: Will strengthen behaviors because the subject receives something they want

Positive Reinforcement

Positive Reinforcement

b. Negative Reinforcement: Also strengthens the behavior, because something the subject doesn’t like is

b. Negative Reinforcement: Also strengthens the behavior, because something the subject doesn’t like is removed Ex: “If you get all A’s, you won’t have to mow the yard this summer. ” Extrinsic rewards? ?

Negative Reinforcement

Negative Reinforcement

Positive/Negative Reinforcement

Positive/Negative Reinforcement

2. Primary vs. Secondary Reinforcers Primary Reinforcement: Something that is naturally reinforcing Ex: food,

2. Primary vs. Secondary Reinforcers Primary Reinforcement: Something that is naturally reinforcing Ex: food, warmth, water, etc. Secondary Reinforcement Something that a person has learned to value; often paired with a primary reinforcer Ex: Kind words

Schedules of Reinforcement

Schedules of Reinforcement

Immediate vs. Delayed Reinforcement 1. Immediate: given immediately after the desired behavior – More

Immediate vs. Delayed Reinforcement 1. Immediate: given immediately after the desired behavior – More effective 2. Delayed: given after a period of time following the desired behavior

3. Continuous Reinforcement: a reward follows every correct response n Most useful way to

3. Continuous Reinforcement: a reward follows every correct response n Most useful way to establish a behavior n The behavior will extinguish quickly once the reinforcement stops.

4. Partial Reinforcement a reward follows only some correct responses

4. Partial Reinforcement a reward follows only some correct responses

a. b. Fixed Interval: rewarding a response after some defined period of time (ex:

a. b. Fixed Interval: rewarding a response after some defined period of time (ex: you get paid $9/hour) Variable Interval: rewarding a response after an unpredictable amount of time (ex: your paycheck is sometimes late)

5. Fixed Ratio: rewarding a response after a defined number of correct responses (ex:

5. Fixed Ratio: rewarding a response after a defined number of correct responses (ex: paid $5 for every 3 pages you write) n The faster the subject responds, the more reinforcements they will receive. 6. Variable Ratio: rewarding an unpredictable number of correct responses (ex: slot machine)

Strengthening Power → Schedule Pros Cons Variable Ratio High levels of response May not

Strengthening Power → Schedule Pros Cons Variable Ratio High levels of response May not want to take your chances Fixed Ratio More effort = more reward Fixed Interval Variable Interval Depends on how much you value the reward No over or No personal control under-exertion over availability of the reward Keeps you May cause you to guessing quit or give up

3. Aversive Control n Any consequence that decreases the likelihood a behavior will be

3. Aversive Control n Any consequence that decreases the likelihood a behavior will be repeated n An undesirable event

Aversion Control Using an unpleasant stimulus to condition Negative reinforcement- take away something unpleasant

Aversion Control Using an unpleasant stimulus to condition Negative reinforcement- take away something unpleasant or painful. Escape – behave in a way to STOP unpleasant or painful thing. Avoidance behave in a way to PREVENT unpleasant or painful thing from starting Punishment inflict something unpleasant or painful to make you STOP

Problems with Punishment n Only works when the punisher is around n May cause

Problems with Punishment n Only works when the punisher is around n May cause fear, anxiety, or low self-esteem n Teaches aggression as a means to solve problems. n Doesn’t teach the appropriate or desired behavior.

D. Operant Conditioning Procedures

D. Operant Conditioning Procedures

1. Shaping n Reinforcing behaviors that are increasingly similar to the behavior you want

1. Shaping n Reinforcing behaviors that are increasingly similar to the behavior you want n Is used to establish a new behavior n (Ex: training an animal to jump through a hoop)

Chaining Response chain - linking individual parts of something together to complete the final

Chaining Response chain - linking individual parts of something together to complete the final action Swimming