Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
136 Cards in this Set
- Front
- Back
Thorndike's law of effect |
behaviors leading to a satisfactory outcome are strengthened, and behaviors leading to an unsatisfactory outcome are weakened or eliminated (Ex: reaching through cage= no food, so don't do) |
|
puzzle box |
a tool developed by Thorndike to study the learning capabilities of cats |
|
operant behavior |
behaviors that are more voluntary (chosen) in nature which seem to be controlled by their consequences (contrast with elicited behaviors) ...any behavior impacted by its consequences |
|
*****elicited behavior***** |
to draw out,
those behaviors that are typically involuntary, AUTOMATIC,rely on a stimulus to occur (Ex: salivation, blinking, sneezing, jumping at noises...) |
|
skinner tried to test/observe |
learning by consequence |
|
skinner box/operant chamber |
a device developed by skinner that can control those environments in which an organism can operate |
|
operant conditioning |
teaching through consequence |
|
response |
any behavior exhibited by the organism |
|
consequence |
the result of the behavior |
|
discriminative stimulus (S sub D) |
some aspect of environment that impacts the way a consequence is applied (Ex: "go easy on mommy because she is frustrated/tired") |
|
reinforcement |
an event that follows a behavior that INCREASES the likelihood of that behavior to occur again in the future (Ex: at camp= deposits) |
|
punisher |
an event that follows a behavior that DECREASES the likelihood of that behavior to occur again in the future (Ex: at camp= withdrawals) |
|
antecedents |
what happens before |
|
discriminative stimulus for reinforcement |
any stimulus that signals a behavior will be reinforced |
|
discriminative stimulus for punishment |
any stimulus that signals a behavior will be punished |
|
three term contingency |
all three conditions predict the future likelihood of a performed behavior. they are: discriminative stimulus, response/behavior, and consequence (reinforcer or punisher) |
|
4 types of contingencies |
pos./neg. reinforcement, pos./neg. punishment |
|
reinforcement is like... |
math! (positive means add, negative means subtract) |
|
positive reinforcement |
PRESENTATION (adding) of a stimulus following a response that INCREASES the likelihood of that response in the future |
|
negative reinforcement |
REMOVAL (subtracting) of a stimulus following a response that INCREASES the likelihood of that response in the future |
|
escape behavior |
terminates an aversive stimulus (think: it is happening now, get away!) |
|
avoidance behavior |
prevents delivery of aversive stimulus (think: it hasn't happened yet, so I'll avoid it early) |
|
positive punishment |
PRESENTATION (adding) of a stimulus following a response that DECREASES the likelihood of that response in the future |
|
negative punishment |
REMOVAL (subtracting) of a stimulus following a response that DECREASES the likelihood of that response in the future (Ex: no TV) |
|
hint: if behavior is increasing then it is |
reinforcement |
|
hint: if behavior is decreasing then it is |
punishment |
|
immediate positive reinforcement |
tend to be the most effective (want to reinforce as quickly as possible) |
|
delayed positive reinforcement |
may result in no association between behavior and reinforcement
|
|
primary positive reinforcement |
properties (the actual thing(s)) reinforce. these may run out. (Ex: food, alcohol, sex)
|
|
secondary positive reinforcement |
associations reinforce. these have infinite use. (Ex: clicker-trained dogs)
|
|
generalized secondary positive reinforcement |
associate with many other reinforcements (Ex: CA$H!)
|
|
intrinsic positive reinforcement |
behavior is reinforced in and of itself
|
|
extrinsic positive reinforcement |
behavior must have a rewarding consequence to be repeated. this decreases motivation (Ex: getting paid for good grades)
|
|
shaping (AKA: neat tricks) with positive reinforcement |
a process used to teach an organism to engage in a particular behavior
|
|
remember: you can't train away an |
instinct |
|
successive approximations are needed to achieve |
shaping (AKA: neat tricks) |
|
successive approximations complete procedure |
select behavior break-down into tiny pieces selectively reinforce each tiny action (Ex: progressions for sit, lay-down, and shake) |
|
schedules of reinforcement |
the response requirement that must be met before reinforcement is delivered |
|
continuous reinforcement |
reinforcement after each separate performance of a given behavior |
|
intermittent reinforcement |
reinforcement following a set requirement of number of behaviors performed or amount of time passed since prior reinforcement. not reinforcing behavior every time. more special= more of a reaction due to depravation |
|
4 basic schedules of reinforcement |
fixed ratio, variable ratio, fixed interval, variable interval |
|
ratio schedule |
set number of responses |
|
*****fixed ratio (FR)***** |
a set number of responses must be performed before reinforcement will be delivered. CERTAIN and PREDICTABLE. (Ex: FR(5)= press bar 1, 2, 3, 4, 5... #5 gets reward like how Liza gets a prize for every 5 pieces of paper... FR(1)= reward for every behavior present) |
|
variable ratio (VR) |
a varying unpredictable number of responses that must be made before reinforcement will be delivered (Ex: VR(5)= average number of reinforcements)... think: like fishing and slot machines. |
|
fixed ratio vs. variable ratio shows... |
fixed ratio= lower motivation because you know where you are going variable ratio= more response to get most possible rewards, so more motivation |
|
*****fixed interval (FI)***** |
a set period of time must pass before reinforcement will be delivered; after that period has passed, the next time that behavior is performed it will be reinforced. CUMULATIVE RESPONSE PATTERN THAT IS A REPEATED, UPWARDLY SCALLOPED PATTERN. (Ex: FI(30 sec)= press bar, 30 sec pass, reinforcement) think: 10:45- 10:50= check clock over and over again to see if class is over |
|
variable interval (VI) |
a varying, unpredictable period of time must pass before reinforcement will be delivered; after that period has passed, the next time that behavior is performed it will be reinforced (this is about the slowest rate of responding) |
|
response-rate schedule of reinforcement |
reinforcement of rates of responding, rather than number or time period |
|
differential reinforcement HIGH (DRH) |
reinforcement of HIGH rates of responding (Ex: buy a new car at close enough to the end of the month will mean it will be cheaper. the cheapest is before january) |
|
differential reinforcement LOW (DRL) |
reinforcement of LOW rates of responding (Ex: with autistic kids, if they make eye contact for a certain amount of time, they get a reinforcement) |
|
differential reinforcement PACED (DRP) |
reinforcement of a consistent rate of responding (Ex: like pacing in a race) |
|
non-contingent schedules of reinforcement |
schedule of reinforcement not contingent on responses |
|
fixed time schedule of reinforcement |
reinforcement is delivered based on time interval instead of following behavior |
|
fixed interval vs. fixed time difference is... |
FI= have to do behavior to get reinforcement FT= get reinforcement after time |
|
variable time schedule of reinforcement |
reinforcement is delivered on an unpredictable time interval instead of following behaviors |
|
superstitious behavior |
strange, RANDOM behavior performed in hopes of controlling reinforcement. more prevalent in low reinforcement activities (Ex: baseball) |
|
chained schedule of reinforcement |
sequence of 2 or more simple schedules, each resulting in reinforcement (Ex: FR(10) --> FR(5) --> FR(1)... this is a homogenous chain) |
|
homogenous chain of reinforcement |
same type of response (like pecking a key) to gain reinforcement (same behavior). more like real life. |
|
goal gradient effect |
increased activity as an organism nears a goal (Ex: when writing a paper, the first paragraph is the worst to start, but as you finish the body, the speed of completion increases (different story for the conclusion paragraph, though)) |
|
heterogenous chain of reinforcement |
different types of responses (running, jumping, pecking a key) to gain reinforcement (different behaviors) (Ex: college- each semester/class is different... obstacle course) |
|
drive reduction theory |
an event is reinforced because it REDUCES a physiological drive or need (Ex: food as a reinforcer) |
|
incentive motivation |
motivation derived from a property of a reinforcement (Ex: money gets you food, water, shelter...) |
|
Premack principle |
a high-probability behavior (what you want to do) can reinforce a low probability behavior (what you don't want to do) (Ex: work, then play... play will reinforce the work) |
|
response deprivation hypothesis |
any behavior that has a baseline of occurrence that is most pleasing to the organism. any behavior can be a reinforcer when access to the behavior is restricted and/or frequency falls below its desired level of occurrence. (Ex: not being able to walk/ hold kids) |
|
behavioral bliss point approach |
when an organism can act freely in its environment, it will choose activities that will maximize reinforcement. assumes rationality. sounds good, but not in real life. (Ex: win lotto... what will you eat tomorrow?... it gets boring) variety can be reinforcing! |
|
extinction |
the process by which an operant behavior is eliminated or greatly reduced in occurence. non-reinforcement following the exhibition of the target behavior. trying to get a behavior to go away without punishment. |
|
extinction burst |
temporary increase in frequency and intensity of responding |
|
increase in variablility |
behavior will become variable, organism will try to adjust for reinforcement |
|
emotional behavior |
organism may become agitated or violent |
|
aggressive behavior |
organism may become violent or unusually agressive |
|
resurgence |
reappearance of other behaviors that once produced a reinforcement |
|
depression |
organism may reduce activity levels to an extreme degree, which is usually associated with this label |
|
*****resistance to extinction***** |
the extent to which responding persists during extinction. will likely be strongest following plenty of reinforcement and experience on an FR100 schedule |
|
schedule of reinforcement |
behavior maintained on a partial reinforcement schedule will be more resistant to extinction |
|
history of reinforcement |
more reinforcers an organism has earned for a behavior, the more resistant to extinction it will be (Ex: MB having a history of changing spark plugs; when the next appliance didn't get fixed because of that, freaked) |
|
magnitude of reinforcer |
large-magnitude reinforcers for a behavior are more resistant to extinction than small-magnitude reinforcers |
|
degree of deprivation |
the more deprived state of an animal, the more resistant to extinction it will be (deprived from the reinforcer) |
|
experience with extinction |
the more an organism has been exposed to extinction, the less resistant to extinction it becomes |
|
signal for extinction |
if a discriminative stimulus for extinction is present, the organism becomes less resistant to extinction |
|
*****spontaneous recovery***** |
the reappearance of extinguished behaviors following a rest period (Ex: Reid crying the next day for phone, but shorter) |
|
differential reinforcement of other behavior (DRO) |
the reinforcement of all behavior other than the target behavior can facilitate extinction (Ex: animal training- install the behavior with reinforcement and then stop reinforcing it to make it go away... doesn't work for instincts) |
|
stimulus control |
creating a circumstance where you make a behavior happen. most useful to animal and child training. reliable stimulus-response |
|
stimulus generalization |
tendency to perform an operant response if a stimulus similar to a discriminative stimulus (S sub D) is present (Ex: Annie is more receptive to heightened emotions and is more obedient, varied tones/pitches lead to reacting differently) |
|
generalization gradient |
graphic representation of a range of discriminative stimuli |
|
stimulus discrimination |
learned differences among discriminative stimuli. the closer to the discriminative stimuli, the more bar pressing |
|
discriminative stimulus for extinction (S sub delta) |
a stimulus that signals that reinforcement will not be delivered |
|
*****the peak shift effect***** |
the peak of the generalization gradient will shift from the original (S sub D) to a point further removed from the (S sub delta), following discrimination training. when attempting this, "train with 2, test with many" is a good rule of thumb. |
|
multiple schedules |
2 or more independent schedules presented in sequence, each resulting in reinforcement and each having a separate (S sub D) |
|
*****behavioral contrast***** |
a change in a rate of reinforcement on one schedule equals an opposite change in the rate of responding on another schedule |
|
negative behavioral contrast |
an INcrease in the rate of reinforcement on one component produces a DEcrease in the rate of responding on the other component (behavior is DEcreasing, so rate of reinforcement is INcreasing) the first component changes to become more attractive, so the second component seems less attractive, hence, less responding. |
|
positive behavioral contrast |
a DEcrease in the rate of reinforcement on one component results in an INcrease of responding on the other component. as the first becomes less attractive, the unchanged appears more attractive |
|
anticipatory contrast |
rate of response varies inversely with an upcoming change in the rate of reinforcement. animal must have an (S sub D... discriminative stimulus) that signals impending change once signal is given, animal will begin vigorously obtaining reinforcement |
|
fading |
gradually altering the entensity of a (S sub D... discriminative stimulus) |
|
errorless training |
training that utilizes fading that usually results in no responding to the (S sub delta... discriminative stimulus for extinction) |
|
process for errorless training |
train with (S sub D... discriminative stimulus) nomally introduce brief, faint (S sub delta... discriminative stimulus for extinction) gradually increase stregnth of (S sub delta... discriminative stimulus for extinction) |
|
targeting |
differentially reinforce looking at a particular target, then using this target as an (S sub delta... discriminative stimulus for extinction) (Ex: lazer pointer at the zoo) |
|
placing behavior on "cue" |
give an undesirable behavior an (S sub D... discriminative stimulus) or cue, such that it comes to only be performed on that cue (pipe dream... not easy) |
|
insomnia |
make certain behaviors or items cues for sleep |
|
escape behaviors |
behaviors that alleviate or terminate an aversive stimulus |
|
avoidance behaviors |
behaviors that prevent an aversive stimulus |
|
2-process theory of avoidance |
avoidance is made up of 2 distinct processes... 1- classical conditioning of a fear to a CS (a discriminative stimulus in the operant chamber) 2- operant conditioning in the negatively-reinforced reduction in fear (Ex: light causes fear, avoidance reduces this fear) |
|
anxiety conservation hypothesis |
explains why fear conditioning won't extinguish; the organism immediately "copes" with anxiety, not giving it exposure enough to extinguish |
|
one-process theory |
only the lower rate of shock is reinforced, maintaining the behavior |
|
species-specific defense reaction |
avoidance behavior actually elicited rather than operant |
|
similarities between phobias and avoidance behaviors |
development of a fear response, failure of fear response to extinguish, |
|
differences between phobias and avoidance behaviors |
in experimental avoidance, animals avoid the US, not the CS as in phobias avoidance response tends to occur after a number of trials, not just one trial as in phobias |
|
phobias in a lab setting example |
creation of a track with a conveyor belt, belt not moving, rat could explore, a dark-walled section led to a dark compartment, shock delivered in the dark, the rat hung out in total light, to avoid the darkness= more phobic than avoidance behavior. |
|
obsessive-compulsive disorder is comprised of |
obsessions and compulsions |
|
obsessions are |
intrusive thoughts, increase anxiety |
|
compulsions are |
actions that address the thoughts, decrease anxiety |
|
anal retentive means that you |
hold your poop in |
|
*****OCD as avoidance behavior, similarities***** |
disease reported as avoidance- behaviors prevent an unfavorable outcome, obsessions increase anxiety, compulsions decrease anxiety: very reinforcing through NEGATIVE REINFORCEMENT |
|
exposure and response prevention (ERP) |
treatment for OCD that involves exposing them to their anxiety- causing event (obsession) and preventing them from the response (compulsion)... can be quite effective because it is like in-vivo flooding... it leads to extinction |
|
time out |
loss of access to positive reinforcement for a set period of time |
|
response cost |
removal of a specific reinforcer |
|
intrinsic punishment |
the behavior itself is punishing (Ex: putting hand on hot stove or sparkler) |
|
extrinsic punishment |
not inherent to the behavior, instead applied following the behavior |
|
primary punishers |
events that are naturally punishing |
|
secondary punishers |
those events that are punishable through association (Ex: scowling or glaring) |
|
generalized secondary punishers |
associated with many other punishers (Ex: how are you? "FINE"= maybe not) |
|
problems in the application of punishment |
doesn't strenghten good behavior, only eliminates target behavior (you can't punish a kid into being good), deliverer/punisher could become a discriminative stimulus for punishment (Ex: Burns's Dad), may only teach avoidance of deliverer (Ex: avoid Burns's Dad), likely to elicit strong emotional responses (Ex: anger or fear)... this is why you talk it out before punishment, through observational learning, may teach punished that punishing is acceptable means of controlling behavior, use of punishment itself is extremely reinforced (they will comply) |
|
appropriate use of punishment |
should be immediate rather than delayed (especially to things that don't speak- dogs), should ideally follow each occurence of behavior (stay predictable and consistent), intense enough from outset to suppress the target behavior (may change over time), punishment should be accompanied by a verbal explanation (for humans) since it takes a while for impulse control to develop in children |
|
punishment is more |
efficient than reinforcement, but be careful! |
|
conditioned suppression theory |
punishment doesn't weaken a behavior, but really produces an emotional response that interferes with the occurrence of the behavior. an organism is so upset by being punished that it loses interest in the original punishing behavior |
|
avoidance theory of punishment |
punishment actually involves avoidance conditioning because anything but the target behavior is reinforced by a no-punishment contingency, bad behavior is punished, if avoided, everything is negatively reinforced by decreasing fear |
|
premack approach to punishment |
a low-probability behavior can be used to punish a more probable behavior. something an organism doesn't like can be used as a punishment for engaging in a behavior an organism does like |
|
learned helplessness |
dogs were placed in a shock box. some could escape, some couldn't. once dogs that thought they couldn't escape were given control, they didn't know they had control. this study tried to mimic depression. led to withdrawal, assuming bad things would happen, and external locus of control (Ex: passive behavior in abused people) |
|
noncontingent |
willy-nilly, for no apparent reason |
|
experimental neurosis |
exposed cats to noncontingent shocks or blasts of air while eating lead to cats stopping breathing for a second and aggrivation. calm cats= agitated and non-trusting. feisty cats= withdrawn and passive. |
|
what are the implications for inappropriately applied punishments to animals? |
they may not ascribe blame to where they should, noncontingent punishment can change personality, want punishment to be predictable, emotionless, and immediate. |
|
concurrent schedules- choices |
simultaneous presentation of 2 or more independent schedules, each of which leads to a reinforcement. the organism must choose the behavior between 2 or more options |
|
the matching law
|
the proportion of responses emitted on a particular schedule matches the proportion of reinforcement obtained on that schedule. thus, the proportion of responses will vary directly with the proportion of reinforcements for an option. (Ex: more time playing basketball, should you go for the 2-point shot vs. the 3-point shot?) |
|
deviations from matching law |
undermatching |
|
undermatching |
proportion of responses between the 2 alternatives is less different than would be predicted. occurs when there is a low penalty for switching |
|
ways to induce resistance to extinction |
schedule of reinforcement, history of reinforcement, magnitude of reinforcer, degree of deprivation, experience with extinction, signal for extinction |
|
2 studies that demonstrated the effects of noncontingent punishment were |
experimental neurosis in cats, and learned helplessness in dogs. |