• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/115

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

115 Cards in this Set

  • Front
  • Back
Independent variable
"X"
Treatment/intervention
must have 2+ levels to have comparison point
Dependent Variable
"Y"
Observed
outcome of treatment
Protocoal Analysis
a type of content analysis
"heeded cognitions" that underlie problem-solving
person asked to think aloud and their verbalizations are recorded
verbalizations are categorized (intention, cognition, planning,evaluation)
Interval recording
obsere a behavior for a period of equal-interval time periods
record if specific behavior occured during that period.
Good for complex interactions and behaviors without concrete beginning and ending (laughing, playing, talking)
Event Sampling
observe and record behaviors each time they occur
Good for behaviors that occur infrequently or have a long duration
Situational Sampling
Observe behaviors in a number of settings
increases generalizability of findings
Sequential Analysis
code behavior sequences (not just specific behaviors) to study complex social behaviors
True Experimental Research
random assignment to groups
enough control of variables to determine that a change in the DV is caused by the IV
Random Assignment
randomly assigning participants to the various groups
Quasi-Experimental Research
experimenter does not have control over group assignment and must use preexisting groups
Simple Random Sampling
Every member of the population has an equal chance at being a part of the sample
Selection of one member of population has no effect on the liklihood of another members selection
reduces probability that sample will be biased, especially if it's a large sample
Stratified Random Sampling
Used when population varies in characteristics and to ensure each characteristic is represented in the sample, the population is split into "strata" and subjects are randomly selected from each strata
(i.e. gender, age, SES, race, culture)
Cluster Sampling
Select units of individuals from population and either include all people in unit or randomly select from the unit
Useful when it is not possible to identify or gain access to entire population
Causes of variability
1) independent variable (experimental variance - you want to maximize)
2)systematic error (you want to control)
3)random error (you want to minimize)
extraneous variable
source of systematic error
-irrelevant to purpose of study, but confounds results because it correlates with the DV
Random Assignment and variability
used to control equalize the effects of variability because all groups are the same on that extraneous variable
Ways to Control Variability from extraneous variables
-randomly assign to equalize
-hold extraneous variable constant by making all participants the same on this variable (limits generalizability)
-match subjects on EV - matching is used to put in pairs and then randomly assign pairs, especially in small samples
-Blocking: build variable into study as an extra IV, participants are "blocked" into groups based on status of EV and then randomly assigned
-Statistically control it, especially in quasi-experiments, ANCOVA
Blocking
when an extraneous variable is identified prior to starting, you can group participants into "blocks" based on the EV and then randomly assign to include the EV as another IV so that it can be analyzed.
Random Error
goal: minimize
-make sure participants dont get tired, experiment site is free from distractions and all measures are reliable
Internal Validity
1) Is there a relationship between IV and DV?
2) Is the relationship causal?

Threats to internal validity:
maturation
history
testing
instrumentation
statistical regression
selection
attrition
interaction with selection
Maturation and internal validity
any biological change that occurs to participants during course of study that is not relevant for research

CONTROL: include more than one group & randomly assign, single-group time-series design so that DV is measured multiple times to gain information on maturation
History and internal validity
when an external event effects DV
Most problematic if only have 1 group and the event occurs at the same time as IV.
CONTROL: have more than one group
Testing and internal validity
exposure to the test may effect performance when the test is readministered
CONTROL: administer DV measure only once, design a measure that minimizes memory and practice effects
Instrumentation
changes in the accuracy or sensitivity of your device
CONTROL: more than one group, use same measures on all participants, ensure measuring devices do not change during study
Statistical Regression and internal validity
extreme scores will likely regress toward the mean
CONTROL: don't include only extreme performers when selecting participants
Selection and internal validity
If there are systematic differences between the groups because of selection process there will be low internal validity
CONTROL: administer pretest to determine initial similarity between groups, randomly assing
Attrition and internal validity
threat to internal validity if drop-outs signficantly differ than those who remain in the study
CONTROL: use a pretest to be able to assess differences
Interactions with Selection and validity
when groups are nonequivalant (i.e. one group exposed to historical event and the other was not)
External Validity
Can these findings be generalized?
*always limited by internal validity, however high internal validity doesn't guarantee external validity*
Threats:
interaction between testing and treatment
interaction between selection and treatment
reactivity/demand characteristics
Multiple treatment interference
population validity
can results be generalized to other people
Ecological Validity
can results be generalized to other settings
-especially important for lab studies
Interaction between testing and treatment
threat to external validity
-administering a pre-test may "sensitize" subjects to the topic area adn therefore interfere with their reaction to IV

limits generalizability to only those who were pretested

CONTROL: don't administer pretest, SOLOMON 4 GROUP DESIGN (to assess impact of pretesting on internal and external validity by using pretest as another IV)
Interaction between selection and treatment
threat to external validity

those who included in research responding to IV in a particular way based on their specific characteristics (i.e. those who volunteer are different than those who didn't -maybe more motivated)

CONTROL: ensure sample is representative of population
Reactivity
threat to external validity
-respond to IV in certain way because they know they are being watched (experimental conditions influence bx)

EVALUATION APPREHENSION: behavior influenced by avoidance of negative evaluations

DEMAND CHARACTERISTICS: cues in experimental setting that inform participants of the purpose and what's expected of them

EXPERIMENTER EXPECTANCY: unintentionally bias results by demand characteristics or in their interpretation of data (i.e. researchers errors are liekly to support their hypothesis)

CONTROL: use deception, unoptrusive/nonreactive measures, single/double-blind study
Multiple Treatment Itnerference
Threat to external validity
Order Effects: effects of one level of IV are influenced by previous exposure to different level of IV

COUNTERBALANCED DESIGN: different subjects receive IV in different order
LATIN SQUARE is a type of counterbalanced desing
Between-Group Design
different levels of IV are administered to different groups and status on DV is compared betwen groups
Factorial Design
Between-Group Design or WIthin-subjects

when study has 2+ IV
advantage: more thorough information about the relationship among variables (main effects and interaction effects)
main effect
effect of one IV on DV

IV-->DV

interpret main effects with caution if there is an interaction effect
Interaction effect
effects of 2+ IV considered together
i.e.: effects of IV are different for different levels of other IV

IV+IV-->DV
Within-Subjects Design
all levels of IV are administered to all participants sequentially
single-group time-series design
within-subjects design

measure DV several times at regular intervals before and after IV is administered

subjects act as their own no-treatment control

(-) susceptable to carryover effects, control through counterbalancing, confounded by autocorrelation (pre-post test correlation inflated because they are correlated b/c same person) - increase Type I error, use special stat to correct
Mixed Designs
combines between-groups and within-subjects

common when measuring DV over time or across trials (time/trial is considered IV of within-subjects)
single-subject design
CAN BE USED WITH GROUPS
1)there is a baseline (no tx) phase
2) then there is a tx phase
each participant acts as own control
3)DV measured repeatedly during both phases

AB design, ABA, ABAB

REVERSAL/WITHDRAWAL DESIGN: anytime treatment is administered and then withdrawan. Provides additional information, but may be inappropriate when unethical to remove effective tx

MULTIPLE BASELINE: no withdrawal required because you adminsiter tx in different settings (across settings) or to the same bx in different participants (across subjects)
Scales of Measurement
N: Nominal: Categories

O: Ordinal: ordered numbers, not equal intervals (Likert Scales, race rankings)

I: Interval: Ordered and at equal intervals. You can add and subtract to calculate M an SD (IQ, temperature)

R: Ratio: ordered,equal intervals and absolute 0, can multiply and divide to determine how much more/less of a characteristic one has over the other (claories, correct answers, response time)
mesokurtic
normal distribution
leptokurtic
more peaked than normal distribution
platykurtic
less peaked/flatter than normal distribution
postively skewed distribution
most scores are on the low side (tail is on positive side)

Mean>Median>Mode
negatively skewed distribution
more scores are on the high side (tail is no negative side)

Mode>Median>Mean
Measures of central tendency
Mode: most frequently occuring number, can be multimodal, easy to identify, but changes with sample fluctuations, not useful for statistical purposes

Median: score that divides distribution in half, if even, it's the number between middle scores, not influenced by outliers, more useful than mode, but only a descriptive stat

Mean: average of socres, least susceptable to sample fluctuations, unbiased estimate of population mean (mu), misleading if outliers
Range
lowest score-highest score=range

can be misleading when there are outliers
Variance
Mean Square
More thorough measrue of variability
includes ALL scores in distribution (not just high and low)

sum of (x-M)squared / N-1

average amount of variability in a distribution, indicating the degree scores are dispersed around the mean

If calculating population variance, denominator is N

If calculating sample variance, denominator is N-1
Standard Deviation
square root of the variance

in the same unit as the measurement (variance changes the unit of measurement, making it hard to interpret)

The larger the SD, the greater the dispersion of scores around the mean
Central Limit Theorum
as the size the sample size increases, the sampling distribution of the mean approaches normal

the mean of the sampling distribution of the mean is equal to the population mean

the SD of the sampling distribution mean is equal to population SD divided by square root of sample size (standard error of the mean)
Sampling distribution of the mean
used to determine the probability of a sample having a particulat mean that can be drawn from population with known parameter

the mean of infinite number of samples in the population

foundation of inferential statistics
Standard error of the mean
an estimate of variability that any particular sample may differ from the population

measure of variability due to random error

the larger the population SD and smaller the sample size, the larger the standard error
null hypothesis
stated to imply that the IV has no effect
alternative hypothesis
stated to indicate that IV has an effect

may be directional (one-tailed) or nondirectional (two-tailed)...Use Two-tailed unless theoretical grounds to use one-tailed

stated in population parameters
Rejection Region
the values not likely to occur by chance or sampling error.

values lie in the "tail" or the alpha value

if value is in this region - null hypothesis is rejected
Retention Region
value falls in area that is consequence of sampling error only

retain null hypothesis
alpha
level of signficance

whena two-tailed test, this is divided between the two tails
Type I error
say there are results when there aren't (rejects true null)

probability of occuring = alpha
increased likelihood when small sample size or observations are dependent
Type II error
retain false null (baby out with bathwater)

probability of occuring = Beta

more likely with low alpha level, small sample size, IV not administered at sufficient intensity
Power
when a test enables experimenter to reject false null (make a correct decision)

maximize by:
increase alpha (.05 not .01)
increase sample size
increase effect size (IV strength and length)
minimize error (reliable DV measure, reduce variability within groups, control extraneous)
use one-tailed when appropriate
Use parametric test (t-test, ANOVA are more powerful)
Confidence
the certaininty ar esearcher has that the decision they made regarding the null hypothesis is accurate
Statistical test to use with nominal data
Chi-Square
-single sample (only 1 variable)
-multiple sample (2+ variables)
*IV and DV aren't differentiated, both count toward variables*
Statistical test to use with Ordinal Data
Mann-Whitney (2 independent groups)

Wilcoxon matched-pairs test (2 correlated groups)

Kruskal-Wallis test (2+ independent groups)
Statistical test to use with Interval/Ratio Data
t-test for single sample (sample v. population mean)

t-test for independent samples (2+ independent groups)

t-test for correlated samples (2 correlated groups)

one-way ANOVA (1 IV, 2+ independent groups

factorial ANOVA (2+IVs)

repeated measures ANOVA (2+ correlated groups)

ANCOVA (removes EV)

mixed ANOVA (independent and correlated groups)

Randomized block ANOVA (extraneous variable)

trend analysis (quantitative IV)

MANOVA (2+ DVs)
Parametric Tests
Interval or Ratio

Normal Distribution

homoscedasticity - variance of the popoulations represented are about equal
Robustness
equal numbers in group

large sample size

low alpha level
Nonparametric Test
nominal or ordinal data

"distribution-free" tests

less powerful
critical value
number that corresponds to the boundary between rejection and retention region.

determined by alpha and degrees of freedom
degrees of freedom
number of values that are "free to vary" based on the numbers that are already known

calculations:

t-test: df=N-1
chi-square df= ("columns"-1)(rows-1)
Chi-Square Test
analyzes frequency of observation in each category

determine if observed frequency is equivalent to expected frequency (expected is null hypothesis i.e. no difference)

expected frequency for each category must be more than 5 and each observation can only appear in 1 category
Single Sample Chi Square
"goodness of fit model"

statistic: x squared
df: (c-1)
Multiple sample chi square
2+ variables
df= (c-1)(r-1)
Mann-Whitney U Test
1 IV, 2 independent groups
1 DV, ordinal
Wilcoxon Matched-Pairs Signed-Ranks Test
1 IV, 2 correlated groups
1DV, ordinal
Kruskal-Wallis Test
1 IV, 2+ independet groups
1 DV, Ordinal
t-test for single sample
1 IV, single group
1 DV, interval/ratio
df: (N-1)
t-test for independent samples
1 IV, 2 independent groups
1 DV, interval/ratio
df= (N-2) where N = number of subjects
t-test for correlated samples
1 IV, 2 correlated groups
1 DV, interval/ratio
df = (N-1) where N=number of PAIRS of scores
one-way ANOVA
1 IV, 2+ independent groups
1 DV, interval/ratio
df = (C-1), (N-1) where C = number of IV levels and N = number of subjects

F ratio: treatment + error / error, the larger F the more likely to have Tx effect. Post hoc needed to determine which means were significantly different
Factorial ANOVA
2+IV, independent groups
1 DV, interval/ratio
F statistic
Randomized block ANOVA
use extraneous variable as IV

reduces within group variability
increases statistical power
ANCOVA
control extraneous variable by statistically removing it

reduces within-group variability
increases power
repeated measures ANOVA
for within-subjects design
mixed (split-plot) ANOVA
used formized design
Trend Analysis
1+ quantitative variablea nd wants to know shape of relationship on DV (linear or nonlinear)
MANOVA
1+ IV, independent groups
2+ DV, interval ratio

simultatneously assessing effects of IV on all DVs (increases statistical power and control experimentwise error)
what measures effect size?
Cohen's d

r squared and eta square
Cohen's d
measure of difference between 2 groups

Mean(group A)- Mean(group B) and difvide by SD for both groups

gives you a difference in SD units

small = .2
medium = .5
large = .8
r square and eta square
effect size measurement

indicate percent of variance in outcome variable that's accounted for by tx
correlation coefficient
summarizes the degree of associateion between variables with one number
pearson r *
most common with interval/ratio
Spearman rank-order (Rho)
2 ordinal variables
Phi
2 nominal variables that are both tru dichotomies (i.e. male/female vs. fiction/nonfiction)
Tetrachoric
2 nominal variables, BUT artificial (favorable/unfavorable, successfull/unsuccessful)
Contingency
2 nominal variables
Point Biserial *
one true dichotomy (nominal)
one interval/ratio
Biserial *
Artificial dichotomy (nominal)
Interval/ratio
Eta *
interval/ratio vs. interval/ratio

DOES NOT NEED TO BE LINEAR RELATIONSHIP
Assumptions of correlation coefficient
Linearity

Unrestricted Range for both variables (i.e. heterogenous groups) - if restricted range, pearson r is underestimate

Homoscedasticity: the range of Y scores is about the same as range of X scores - will result in correlation that does not represent a full range of scores
coefficient of determination
square the correlation coefficient to determine shared variability

ONLY square a correlation if it correlated to different variables
Regression Analysis
allows predictions to be made
ASSUMPTIONS: linear relationship

degree of predictive ability is directly related to magnitude of correlation coefficient

standard error of estimate is used to construct confidence interval of predicted score
Multiple Regression
2+ continuous or discrete predictors used to predict single continues criterion

R (multple correlation coefficient) indicates degree of association between criterion and predictors

ideal to have high correlation with criterion and low correlation wtih other predictors

multicolinearity: when predictors are highly correlated

simple or stepwise
multicolinearity
when predictors (multple regression) are highly correlated
Types of multiple regression
simple: analyze all predictors on criterion at once

stepwise: explain greatest variability with fewest predictors
-forward stepwise: one predictor added to analysis
-backward stepwise: start with all predictors and remove one at a time
Multiple Regression vs. ANOVA
Multple Regression: when groups are unequal size, IV is continuous scale, allows researcher to add or subtract IV to determine the best model
Cross-Validation
try it out on another sample

likely to experience "shrinkage" because doesn't fit exactly like original sample
canonical correlation
2+ continuous predictors to predict 2+ continuous criteria

Identifyt he number and nature of underlying dimensinos that account for the correlation between 2 sets of variables
Discriminant Function analysis
2+ continusou predictors to estmate status on 1 nominal criteria

accuracy measured by "hit rate" or accuratly classified cases
Logistic Regression
predict status on discrete criteiorn using 2+ continuous or discrete predictors

REQUIRES LINEAR RELATIONSHIP (unlike discriminant function analysis)
Path Analysis
extension of multiple regression
translate theory about causal relationships into diagram indicating direction and strength between variables
LISREL
when causal model includes recursive (one-way) and non-recursive (two-way) paths

observed variables AND latent traits variables are believed to measure