PSY 402 Theories of Learning Chapter 10 Stimulus

  • Slides: 56
Download presentation
PSY 402 Theories of Learning Chapter 10 – Stimulus Control of Behavior

PSY 402 Theories of Learning Chapter 10 – Stimulus Control of Behavior

The Role of Environmental Stimuli o In operant conditioning, the stimulus becomes associated with

The Role of Environmental Stimuli o In operant conditioning, the stimulus becomes associated with the reinforcer or punishment. n n o Reward or punishment is the UCS. The stimulus signaling reward or punishment is the CS. The CR then motivates operant behavior. n Operant responding can be used as a measure of the strength of a CR.

Definitions of Terms o o o Stimulus control -- Environmental stimuli signal the opportunity

Definitions of Terms o o o Stimulus control -- Environmental stimuli signal the opportunity for reward or punishment. Generalization – responding in the same way to similar stimuli. Discrimination – responding to some stimuli but not to others.

Generalization Gradient o Degrees of generalization occur. n n o In some situations, the

Generalization Gradient o Degrees of generalization occur. n n o In some situations, the same response occurs to similar stimuli. In other situations, the amount of response varies along with the similarity. Generalization gradient – a graph showing how the strength of response changes with similarity. n Steep gradients mean narrow response (stimuli must be very similar).

Kinds of Gradients o Excitatory conditioning (S+) – a CS-UCS response to a stimulus

Kinds of Gradients o Excitatory conditioning (S+) – a CS-UCS response to a stimulus is learned. n o Excitatory gradient – the S+ is varied and the CR is measured. Inhibitory conditioning (S-) – a CS signals absence of the UCS and thus inhibits the CR. n Inhibitory gradient – the S- is varied and the CR is measured.

Wavelengths of Light

Wavelengths of Light

Visible Color Spectrum

Visible Color Spectrum

yellow-orange yellow-green bluegreen orangeyellow orange-red orange red

yellow-orange yellow-green bluegreen orangeyellow orange-red orange red

Gradients Using Four Wavelengths 580 = yellow 550 = green

Gradients Using Four Wavelengths 580 = yellow 550 = green

Gradient Using Tone & Shock The less the tone sounds like the original stimulus,

Gradient Using Tone & Shock The less the tone sounds like the original stimulus, the less fear (measured in galvanic skin response, GSR)

Discrimination o o The shape of the gradient can be changed by training. When

Discrimination o o The shape of the gradient can be changed by training. When birds are exposed to two different tones (S+ or S-), they must discriminate between them. n n Responding is less generalized because the competing tone produces no reward. The shape of the gradient becomes steeper and more narrow at the top.

With no discrimination, subjects Respond to every tone. With two or more tones, requiring

With no discrimination, subjects Respond to every tone. With two or more tones, requiring discrimination, only the rewarded tone elicits responses, depending on the S- tone used during training.

The sharpness of the generalization gradient depends on the type of training

The sharpness of the generalization gradient depends on the type of training

Flat Gradients o o A flat gradient means all stimuli are being responded to

Flat Gradients o o A flat gradient means all stimuli are being responded to as if they were the same. Responding with a gradient to a tone occurred only when the tone signaled reward during training.

Tone vs No-Tone During Training no tone Flat gradient Experimental subjects were trained to

Tone vs No-Tone During Training no tone Flat gradient Experimental subjects were trained to attend to the tone whereas control subjects were not.

Generalization of Inhibition o Inhibition example: fear of dating. n o A good experience

Generalization of Inhibition o Inhibition example: fear of dating. n o A good experience with one person leads to less fear of dating the next person. Inhibition gradients are similar to excitatory gradients – the more the stimulus varies, the less inhibition.

Excitatory and Inhibitory Generalization with Line Tilt Stimuli

Excitatory and Inhibitory Generalization with Line Tilt Stimuli

Inhibitory Gradients – Line Tilt

Inhibitory Gradients – Line Tilt

Explanation o Lashley-Wade theory – people and animals generalize because they are unable to

Explanation o Lashley-Wade theory – people and animals generalize because they are unable to discriminate. n n o o Can’t tell the difference between stimuli A contrast is needed during training to enable discrimination. Discrimination training leads to steeper generalization gradients (see Fig 10. 3). Perceptual experience matters (Fig 10. 5).

Ducks Raised in Monochromatic Light Cannot Discriminate Based on Color Ducks in monochromatic light

Ducks Raised in Monochromatic Light Cannot Discriminate Based on Color Ducks in monochromatic light Ducks in white light (with all wavelengths)

Discrimination Learning o o In survival terms, it is important to recognize when reinforcement

Discrimination Learning o o In survival terms, it is important to recognize when reinforcement is not available so that responding can be withheld. Discriminative stimulus: n n o SD – reinforcement is available (S+) S – reinforcement is unavailable (S-) Conditioned stimuli always produce a response. Discriminative stimuli signal the opportunity to respond.

Two-Choice Discrimination Tasks o The discriminative stimuli are on the same dimension: n o

Two-Choice Discrimination Tasks o The discriminative stimuli are on the same dimension: n o o o Red vs green light. Dimension = hue. Need not be presented simultaneously. Two-choice discrimination includes one SD and one S . Other tasks can use multiple SD or multiple S.

Categorization and Discrimination o Animals respond to stimuli in ways that suggest they form

Categorization and Discrimination o Animals respond to stimuli in ways that suggest they form categories. n n n Pigeons can classify a variety of items, including new images not seen before. The items to be learned as members of a category are SD and signal opportunity for food. The items that are not members of the category are S and signal that pecking will not be rewarded.

Test Slides – Tree Category

Test Slides – Tree Category

Test Slides – Water Category

Test Slides – Water Category

Test Slides -- Margaret Category

Test Slides -- Margaret Category

More Complex Tasks o Later pigeons were asked to place images into four categories

More Complex Tasks o Later pigeons were asked to place images into four categories by pressing one of four buttons (rewarded by food if correct). n o o They are “naming” the object shown. Pigeons do equally well with natural and manufactured objects (cars, chairs). Transfer to new stimuli is worse but above chance.

Apparatus (Part 1)

Apparatus (Part 1)

Examples of positive images

Examples of positive images

Examples of positive images

Examples of positive images

Three Phases o o o Subjects begin by responding equally to both stimuli –

Three Phases o o o Subjects begin by responding equally to both stimuli – prediscrimination phase. Discrimination phase -- with training, response to SD increases and response to S declines. Shift back to non-differential reinforcement to show that behavior was caused by reinforcement.

As Reinforcement Changes, so Does Responding

As Reinforcement Changes, so Does Responding

Conditional Discrimination o Availability of reinforcement depends on the condition of a stimulus. n

Conditional Discrimination o Availability of reinforcement depends on the condition of a stimulus. n o o The stimulus does not always signal the same thing. More difficult to learn. Nissen’s chimpanzees: n n Large, small squares, white or black. SD = large when white but small when black.

Behavioral Contrast o o Behavioral contrast – the increased responding to the differential stimulus,

Behavioral Contrast o o Behavioral contrast – the increased responding to the differential stimulus, decreased response to S Contrast also occurs with changes in the duration of reinforcement. n o o VI-10 to VI-3 Local contrast – may be emotional, fades Sustained contrast – related to the differential reinforcement.

Occasion Setting o o A conditioned stimulus (CS 1) can create the conditions for

Occasion Setting o o A conditioned stimulus (CS 1) can create the conditions for operant responding to a seconditioned stimulus (CS 2). Occasion setting – ability of one stimulus to enhance the response to another stimulus. n The facilitating stimulus does not produce a CR by itself – so this is not higher order conditioning.

SD as an Occasion Setter o o A Pavlovian occasion-setter can increase operant responding.

SD as an Occasion Setter o o A Pavlovian occasion-setter can increase operant responding. Example: n n o A meal elicits CR craving for cigarette. Requesting a cigarette after a meal – an operant behavior caused by CR. Conditional occasion-setting: n Second stimulus modifies meaning of first discriminative stimulus.

How it Works

How it Works

Conclusions o o o An occasion-setter can increase operant responding. A discriminative stimulus (SD)

Conclusions o o o An occasion-setter can increase operant responding. A discriminative stimulus (SD) can increase response to a CS (Pavlovian conditioning). This implies interchangeability of Pavlovian occasion-setters and discriminative stimuli.

Occasion Setters Increase Responding

Occasion Setters Increase Responding

Peak Shift o When both inhibitory and excitatory stimuli are conditioned, inhibition changes the

Peak Shift o When both inhibitory and excitatory stimuli are conditioned, inhibition changes the shape of the gradient. n n o Peak shift – maximum responding occurs to a stimulus not previously trained as the S+. The peak shifts away from the S- stimulus. The amount of response is the difference between inhibitory and excitatory conditioning.

Hypothetical Excitatory and Inhibitory gradients Spence subtracts the inhibition on the next slide from

Hypothetical Excitatory and Inhibitory gradients Spence subtracts the inhibition on the next slide from this excitation

Hypothetical Excitatory and Inhibitory Gradients Overall predicted response is less because this amount of

Hypothetical Excitatory and Inhibitory Gradients Overall predicted response is less because this amount of inhibition is subtracted from it.

Peak Shift When the inhibitory stimulus S- is to the right, the peak shifts

Peak Shift When the inhibitory stimulus S- is to the right, the peak shifts left

Errorless Discrimination Learning o o When an S is gradually introduced the pigeon learns

Errorless Discrimination Learning o o When an S is gradually introduced the pigeon learns to inhibit response without making mistakes. Three fading steps are involved: n n n Brief introduction of S for 5 sec-30 sec Slowly change color of S from dark to green Slowly increase duration of S from 30 sec to 3 minutes

Errorless Discrimination Training

Errorless Discrimination Training

Implications of Errorless Training o Errorless learning seems to condition response to SD without

Implications of Errorless Training o Errorless learning seems to condition response to SD without inhibition to S. n o o This means that errorless learning is not aversive. As a result, no peak shift occurs. Errorless learning is harder to condition to some stimuli than others (e. g. , colors but not lines).

Application of Errorless Training o Examples with humans: n n n o Preschool children

Application of Errorless Training o Examples with humans: n n n o Preschool children recognizing shapes using a fading technique. Oral reading. Dorry & Zeaman taught mentally handicapped children to identify vocabulary words (pictures faded out). Not all training works – problems with transfer and with reversed consequences.

Is Learning Relational? o o Are animals learning the relationships between stimuli rather than

Is Learning Relational? o o Are animals learning the relationships between stimuli rather than an absolute response? Transposition occurs when stimuli are changed: n o The brighter of two lights, louder of two tones is responded to. Different results support both views of learning: Hull-Spence & Kohler.

Absolute vs Relational View

Absolute vs Relational View

Predictive Value of SD

Predictive Value of SD

Mackintosh’s Attentional View o Stimuli with multiple dimensions arouse the relevant dimension analyzer. n

Mackintosh’s Attentional View o Stimuli with multiple dimensions arouse the relevant dimension analyzer. n n o This depends on the salience and intensity of the dimension. The predictive value of the dimension determines arousal. Discrimination learning depends on predictiveness.

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 1)

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 1)

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 2)

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 2)

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 3) Less

8. 17 Examples of computer stimuli presented to pigeons by Cook (Part 3) Less popout with conjoined features.

8. 18 “Same” and “different” displays used in the experiment by Wasserman et al

8. 18 “Same” and “different” displays used in the experiment by Wasserman et al

Continuity Theory o Hull-Spence suggest that excitation and inhibition gradually increase with trials. n

Continuity Theory o Hull-Spence suggest that excitation and inhibition gradually increase with trials. n o Non-continuity theory suggests that a hypothesis is formed & tested. n o Excitation to SD, inhibition to S. Learning occurs rapidly with attention to the right dimension. There is support for both theories.