Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
43 Cards in this Set
- Front
- Back
operant conditioning
|
learning from the consequences of ones actions
|
|
positive/negative reinforcement/punishment
|
positive=added
negative=taken away reinforcement=more likely to do again |
|
negative reinforcement subtypes
|
escape-experience, then action
avoidance-action before experienced anything |
|
reinforcer
|
consequence that strengthens behavior
|
|
primary reinforcer
|
positive: tied to biological need: food water etc.
negative: escape |
|
secondary reinforcer
|
reinforcer (sR) with acquired value by being paired with an established reinforcer (sR) or exchanged with an established reinforcer (sR)
ex: money or praise |
|
secondary reinforcer subtypes
|
standard: exchange for one type of primary reinforcer
generalized conditioned reinforcer: can be exchanged for multiple reinforcers |
|
extrinsic reinforcer
|
getting an external reinforcer for performing a behavior
ex: reading book for good grade |
|
intrinsic reinforcer
|
reinforcement simply by performing the behavior
ex: reading book for pleasure |
|
shaping
|
using behavior subject already does and shape it into behaviors you want
|
|
continuous reinforcement (CRF)
|
every response is reinforced
|
|
ratio schedule
|
reinforcement depends on the number of times a response is repeated
|
|
interval schedule
|
reinforcement depends on the passage of time plus a response
|
|
fixed schedule
|
reinforcer is predictable
fixed ratio (FR)= get reinforcer after x responses, graphed steps fixed interval (FI)= get reinforcer after x seconds plus one response, graphed scallops |
|
variable schedule
|
reinforcer is unpredictable
variable ratio (VR):get reinforcer after about every x responses, graphed steep slope variable interval (VI): get reinforcer after about every x seconds, plus one response, graphed moderate slope |
|
skinner's view of reinforcement
|
if something acts to change B, then it is a reinforcer; a reinforcer is a stimulus that reinforces the operant response
|
|
two types of reinforcement theories
|
reinforcers are stimuli (Hull) or reinforcers are activities (sr not food but is eating the food)
|
|
Hulls drive reduction theory
|
a reinforcer as stimuli theory
stimuli that reduce a biological need or reduce strong aversive stimuli will become reinforcers problem was found stimuli with high reinforcement that didnt reduce any drives. |
|
premack's principle
|
reinforcement is a contingency between two behaviors, instrumental reinforcer and contingent reinforcer.
getting to do the second behavior is contingent upon first performing the instrumental behavior. measure using free baseline, what organism does with no input. more probable behaviors reinforce less probable. significant effect is performing the instrumental behavior above baseline level |
|
shortfalls of premack's principle
|
assumes activities preformed most are preferred.
make subject perform x longer than it normally would to allow it to perform y as long as it usually does. |
|
response deprivation theory
|
a reinforcement effect will occur only when the reinforcement contingency deprives the subject of the contingent activity.
performing baseline amount of instrumental response earns you less than baseline amount of contingent response. make subject perform instrumental behavior at a higher level than its baseline level of the contingent behavior. can increase time required for instrumental or decrease time allowed for contingent. contingent ratio > baseline ratio then contingency should work |
|
problem with response deprivation
|
subjects sometimes compromise instead of performing as expected (extreme conditions)
|
|
bliss point theory
|
baseline= behavioral ideal
any contingency disrupts the 'bliss point' so subject increases frequency of instrumental behavior as a compromise. |
|
operant extinction
|
no longer reinforcing the operant behavioral response
|
|
effects of extinction
|
extinction burst
emotional behaviors (frustration, aggression) increase behavior variability resurgence depression spontaneous recovery |
|
discriminative stimuli
|
stimulus where rewarded when present and when absent not reinforced.
only reinforce a lighted key peck, not dark key. |
|
stimulus control
|
presence of a discriminative stimulus reliably affects the probability of the behavior.
more likely to pack a lighted key if that is reinforced. |
|
stimulus generalization
|
tendency for an operant response to be emitted in presence of stimulus similar to discriminative stim.
'generalization gradient' |
|
stimulus discrimination
|
tendency for an operant response to be emitted more in the presence of one stimulus than another.
'picky pigeons' with steep bell curve |
|
discrimination training/ discriminative stimulus for extinction
|
reinforcement of responding in the presence of one stimulus but not in the presence of another
the stimulus that signals the absence of reinforcement |
|
intradimensional discriminative training
|
both discrim. stimulus and discrim stim for extinction from the same dimension
|
|
interdimensional discriminative training
|
discrim. stimulus and discrim stim for extinction from the different dimensions
having a stimulus being on or off counts as this |
|
peak shift
|
only occurs with intradimensional training
peak moves away from Sdelta and no longer centered over discriminatory stimulus. |
|
signaled avoidance
|
signal stimulus preceded aversive stimulus
|
|
non signaled avoidance
|
no indication of aversive stimulus
|
|
two factory theory of avoidance
|
factor one: classical conditioning of fear to a signal for the aversive stimulus
factor two: operant conditioning of the avoidance response which decreases the CR of fear two factor only works with fixed time intervals, variable time is always afraid |
|
one factor theory
|
reduction of aversive stim in long term reinforces behavior
|
|
one factor/two factor compound
|
two factor = 'molecular', what happens immediately after the behavior. signaled conditions
one factor = 'molar' concentrate on larger time scales. non signaled/variable |
|
basic punishment rule
|
punisher must be more intense than positive reinforcer
|
|
effectiveness of punisher
|
reduces the behavior quickly, reduces the behavior significantly
|
|
factors influencing effective punishment
|
intensity
method of introduction (ineffective if starts low and builds up) immediacy contingency frequency discrimination (best if not discriminative stim. signaling the punisher is in effect) |
|
problems with punishment
|
aggression
agent delivering punishment becomes aversive doesnt teach new behavior encourage unwanted avoidance behavior negatively reinforce agents behavior (lead to abusive behavior example) |
|
ideal punishment combination
|
punish inappropriate b with positive reinforcement of alternate incompatible new behavior
|