Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Related Flashcards

Flashcards
»
EPPP - Statistics & Research Design

Eppp - Statistics & Research Design

by katechristenson9, Feb. 2011

Subjects: eppp

Favorite

Add to folder

Flag

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/115

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

115 Cards in this Set

Front
Back

	Independent variable	"X" Treatment/intervention must have 2+ levels to have comparison point
	Dependent Variable	"Y" Observed outcome of treatment
	Protocoal Analysis	a type of content analysis "heeded cognitions" that underlie problem-solving person asked to think aloud and their verbalizations are recorded verbalizations are categorized (intention, cognition, planning,evaluation)
	Interval recording	obsere a behavior for a period of equal-interval time periods record if specific behavior occured during that period. Good for complex interactions and behaviors without concrete beginning and ending (laughing, playing, talking)
	Event Sampling	observe and record behaviors each time they occur Good for behaviors that occur infrequently or have a long duration
	Situational Sampling	Observe behaviors in a number of settings increases generalizability of findings
	Sequential Analysis	code behavior sequences (not just specific behaviors) to study complex social behaviors
	True Experimental Research	random assignment to groups enough control of variables to determine that a change in the DV is caused by the IV
	Random Assignment	randomly assigning participants to the various groups
	Quasi-Experimental Research	experimenter does not have control over group assignment and must use preexisting groups
	Simple Random Sampling	Every member of the population has an equal chance at being a part of the sample Selection of one member of population has no effect on the liklihood of another members selection reduces probability that sample will be biased, especially if it's a large sample
	Stratified Random Sampling	Used when population varies in characteristics and to ensure each characteristic is represented in the sample, the population is split into "strata" and subjects are randomly selected from each strata (i.e. gender, age, SES, race, culture)
	Cluster Sampling	Select units of individuals from population and either include all people in unit or randomly select from the unit Useful when it is not possible to identify or gain access to entire population
	Causes of variability	1) independent variable (experimental variance - you want to maximize) 2)systematic error (you want to control) 3)random error (you want to minimize)
	extraneous variable	source of systematic error -irrelevant to purpose of study, but confounds results because it correlates with the DV
	Random Assignment and variability	used to control equalize the effects of variability because all groups are the same on that extraneous variable
	Ways to Control Variability from extraneous variables	-randomly assign to equalize -hold extraneous variable constant by making all participants the same on this variable (limits generalizability) -match subjects on EV - matching is used to put in pairs and then randomly assign pairs, especially in small samples -Blocking: build variable into study as an extra IV, participants are "blocked" into groups based on status of EV and then randomly assigned -Statistically control it, especially in quasi-experiments, ANCOVA
	Blocking	when an extraneous variable is identified prior to starting, you can group participants into "blocks" based on the EV and then randomly assign to include the EV as another IV so that it can be analyzed.
	Random Error	goal: minimize -make sure participants dont get tired, experiment site is free from distractions and all measures are reliable
	Internal Validity	1) Is there a relationship between IV and DV? 2) Is the relationship causal? Threats to internal validity: maturation history testing instrumentation statistical regression selection attrition interaction with selection
	Maturation and internal validity	any biological change that occurs to participants during course of study that is not relevant for research CONTROL: include more than one group & randomly assign, single-group time-series design so that DV is measured multiple times to gain information on maturation
	History and internal validity	when an external event effects DV Most problematic if only have 1 group and the event occurs at the same time as IV. CONTROL: have more than one group
	Testing and internal validity	exposure to the test may effect performance when the test is readministered CONTROL: administer DV measure only once, design a measure that minimizes memory and practice effects
	Instrumentation	changes in the accuracy or sensitivity of your device CONTROL: more than one group, use same measures on all participants, ensure measuring devices do not change during study
	Statistical Regression and internal validity	extreme scores will likely regress toward the mean CONTROL: don't include only extreme performers when selecting participants
	Selection and internal validity	If there are systematic differences between the groups because of selection process there will be low internal validity CONTROL: administer pretest to determine initial similarity between groups, randomly assing
	Attrition and internal validity	threat to internal validity if drop-outs signficantly differ than those who remain in the study CONTROL: use a pretest to be able to assess differences
	Interactions with Selection and validity	when groups are nonequivalant (i.e. one group exposed to historical event and the other was not)
	External Validity	Can these findings be generalized? always limited by internal validity, however high internal validity doesn't guarantee external validity Threats: interaction between testing and treatment interaction between selection and treatment reactivity/demand characteristics Multiple treatment interference
	population validity	can results be generalized to other people
	Ecological Validity	can results be generalized to other settings -especially important for lab studies
	Interaction between testing and treatment	threat to external validity -administering a pre-test may "sensitize" subjects to the topic area adn therefore interfere with their reaction to IV limits generalizability to only those who were pretested CONTROL: don't administer pretest, SOLOMON 4 GROUP DESIGN (to assess impact of pretesting on internal and external validity by using pretest as another IV)
	Interaction between selection and treatment	threat to external validity those who included in research responding to IV in a particular way based on their specific characteristics (i.e. those who volunteer are different than those who didn't -maybe more motivated) CONTROL: ensure sample is representative of population
	Reactivity	threat to external validity -respond to IV in certain way because they know they are being watched (experimental conditions influence bx) EVALUATION APPREHENSION: behavior influenced by avoidance of negative evaluations DEMAND CHARACTERISTICS: cues in experimental setting that inform participants of the purpose and what's expected of them EXPERIMENTER EXPECTANCY: unintentionally bias results by demand characteristics or in their interpretation of data (i.e. researchers errors are liekly to support their hypothesis) CONTROL: use deception, unoptrusive/nonreactive measures, single/double-blind study
	Multiple Treatment Itnerference	Threat to external validity Order Effects: effects of one level of IV are influenced by previous exposure to different level of IV COUNTERBALANCED DESIGN: different subjects receive IV in different order LATIN SQUARE is a type of counterbalanced desing
	Between-Group Design	different levels of IV are administered to different groups and status on DV is compared betwen groups
	Factorial Design	Between-Group Design or WIthin-subjects when study has 2+ IV advantage: more thorough information about the relationship among variables (main effects and interaction effects)
	main effect	effect of one IV on DV IV-->DV interpret main effects with caution if there is an interaction effect
	Interaction effect	effects of 2+ IV considered together i.e.: effects of IV are different for different levels of other IV IV+IV-->DV
	Within-Subjects Design	all levels of IV are administered to all participants sequentially
	single-group time-series design	within-subjects design measure DV several times at regular intervals before and after IV is administered subjects act as their own no-treatment control (-) susceptable to carryover effects, control through counterbalancing, confounded by autocorrelation (pre-post test correlation inflated because they are correlated b/c same person) - increase Type I error, use special stat to correct
	Mixed Designs	combines between-groups and within-subjects common when measuring DV over time or across trials (time/trial is considered IV of within-subjects)
	single-subject design	CAN BE USED WITH GROUPS 1)there is a baseline (no tx) phase 2) then there is a tx phase each participant acts as own control 3)DV measured repeatedly during both phases AB design, ABA, ABAB REVERSAL/WITHDRAWAL DESIGN: anytime treatment is administered and then withdrawan. Provides additional information, but may be inappropriate when unethical to remove effective tx MULTIPLE BASELINE: no withdrawal required because you adminsiter tx in different settings (across settings) or to the same bx in different participants (across subjects)
	Scales of Measurement	N: Nominal: Categories O: Ordinal: ordered numbers, not equal intervals (Likert Scales, race rankings) I: Interval: Ordered and at equal intervals. You can add and subtract to calculate M an SD (IQ, temperature) R: Ratio: ordered,equal intervals and absolute 0, can multiply and divide to determine how much more/less of a characteristic one has over the other (claories, correct answers, response time)
	mesokurtic	normal distribution
	leptokurtic	more peaked than normal distribution
	platykurtic	less peaked/flatter than normal distribution
	postively skewed distribution	most scores are on the low side (tail is on positive side) Mean>Median>Mode
	negatively skewed distribution	more scores are on the high side (tail is no negative side) Mode>Median>Mean
	Measures of central tendency	Mode: most frequently occuring number, can be multimodal, easy to identify, but changes with sample fluctuations, not useful for statistical purposes Median: score that divides distribution in half, if even, it's the number between middle scores, not influenced by outliers, more useful than mode, but only a descriptive stat Mean: average of socres, least susceptable to sample fluctuations, unbiased estimate of population mean (mu), misleading if outliers
	Range	lowest score-highest score=range can be misleading when there are outliers
	Variance	Mean Square More thorough measrue of variability includes ALL scores in distribution (not just high and low) sum of (x-M)squared / N-1 average amount of variability in a distribution, indicating the degree scores are dispersed around the mean If calculating population variance, denominator is N If calculating sample variance, denominator is N-1
	Standard Deviation	square root of the variance in the same unit as the measurement (variance changes the unit of measurement, making it hard to interpret) The larger the SD, the greater the dispersion of scores around the mean
	Central Limit Theorum	as the size the sample size increases, the sampling distribution of the mean approaches normal the mean of the sampling distribution of the mean is equal to the population mean the SD of the sampling distribution mean is equal to population SD divided by square root of sample size (standard error of the mean)
	Sampling distribution of the mean	used to determine the probability of a sample having a particulat mean that can be drawn from population with known parameter the mean of infinite number of samples in the population foundation of inferential statistics
	Standard error of the mean	an estimate of variability that any particular sample may differ from the population measure of variability due to random error the larger the population SD and smaller the sample size, the larger the standard error
	null hypothesis	stated to imply that the IV has no effect
	alternative hypothesis	stated to indicate that IV has an effect may be directional (one-tailed) or nondirectional (two-tailed)...Use Two-tailed unless theoretical grounds to use one-tailed stated in population parameters
	Rejection Region	the values not likely to occur by chance or sampling error. values lie in the "tail" or the alpha value if value is in this region - null hypothesis is rejected
	Retention Region	value falls in area that is consequence of sampling error only retain null hypothesis
	alpha	level of signficance whena two-tailed test, this is divided between the two tails
	Type I error	say there are results when there aren't (rejects true null) probability of occuring = alpha increased likelihood when small sample size or observations are dependent
	Type II error	retain false null (baby out with bathwater) probability of occuring = Beta more likely with low alpha level, small sample size, IV not administered at sufficient intensity
	Power	when a test enables experimenter to reject false null (make a correct decision) maximize by: increase alpha (.05 not .01) increase sample size increase effect size (IV strength and length) minimize error (reliable DV measure, reduce variability within groups, control extraneous) use one-tailed when appropriate Use parametric test (t-test, ANOVA are more powerful)
	Confidence	the certaininty ar esearcher has that the decision they made regarding the null hypothesis is accurate
	Statistical test to use with nominal data	Chi-Square -single sample (only 1 variable) -multiple sample (2+ variables) IV and DV aren't differentiated, both count toward variables
	Statistical test to use with Ordinal Data	Mann-Whitney (2 independent groups) Wilcoxon matched-pairs test (2 correlated groups) Kruskal-Wallis test (2+ independent groups)
	Statistical test to use with Interval/Ratio Data	t-test for single sample (sample v. population mean) t-test for independent samples (2+ independent groups) t-test for correlated samples (2 correlated groups) one-way ANOVA (1 IV, 2+ independent groups factorial ANOVA (2+IVs) repeated measures ANOVA (2+ correlated groups) ANCOVA (removes EV) mixed ANOVA (independent and correlated groups) Randomized block ANOVA (extraneous variable) trend analysis (quantitative IV) MANOVA (2+ DVs)
	Parametric Tests	Interval or Ratio Normal Distribution homoscedasticity - variance of the popoulations represented are about equal
	Robustness	equal numbers in group large sample size low alpha level
	Nonparametric Test	nominal or ordinal data "distribution-free" tests less powerful
	critical value	number that corresponds to the boundary between rejection and retention region. determined by alpha and degrees of freedom
	degrees of freedom	number of values that are "free to vary" based on the numbers that are already known calculations: t-test: df=N-1 chi-square df= ("columns"-1)(rows-1)
	Chi-Square Test	analyzes frequency of observation in each category determine if observed frequency is equivalent to expected frequency (expected is null hypothesis i.e. no difference) expected frequency for each category must be more than 5 and each observation can only appear in 1 category
	Single Sample Chi Square	"goodness of fit model" statistic: x squared df: (c-1)
	Multiple sample chi square	2+ variables df= (c-1)(r-1)
	Mann-Whitney U Test	1 IV, 2 independent groups 1 DV, ordinal
	Wilcoxon Matched-Pairs Signed-Ranks Test	1 IV, 2 correlated groups 1DV, ordinal
	Kruskal-Wallis Test	1 IV, 2+ independet groups 1 DV, Ordinal
	t-test for single sample	1 IV, single group 1 DV, interval/ratio df: (N-1)
	t-test for independent samples	1 IV, 2 independent groups 1 DV, interval/ratio df= (N-2) where N = number of subjects
	t-test for correlated samples	1 IV, 2 correlated groups 1 DV, interval/ratio df = (N-1) where N=number of PAIRS of scores
	one-way ANOVA	1 IV, 2+ independent groups 1 DV, interval/ratio df = (C-1), (N-1) where C = number of IV levels and N = number of subjects F ratio: treatment + error / error, the larger F the more likely to have Tx effect. Post hoc needed to determine which means were significantly different
	Factorial ANOVA	2+IV, independent groups 1 DV, interval/ratio F statistic
	Randomized block ANOVA	use extraneous variable as IV reduces within group variability increases statistical power
	ANCOVA	control extraneous variable by statistically removing it reduces within-group variability increases power
	repeated measures ANOVA	for within-subjects design
	mixed (split-plot) ANOVA	used formized design
	Trend Analysis	1+ quantitative variablea nd wants to know shape of relationship on DV (linear or nonlinear)
	MANOVA	1+ IV, independent groups 2+ DV, interval ratio simultatneously assessing effects of IV on all DVs (increases statistical power and control experimentwise error)
	what measures effect size?	Cohen's d r squared and eta square
	Cohen's d	measure of difference between 2 groups Mean(group A)- Mean(group B) and difvide by SD for both groups gives you a difference in SD units small = .2 medium = .5 large = .8
	r square and eta square	effect size measurement indicate percent of variance in outcome variable that's accounted for by tx
	correlation coefficient	summarizes the degree of associateion between variables with one number
	pearson r *	most common with interval/ratio
	Spearman rank-order (Rho)	2 ordinal variables
	Phi	2 nominal variables that are both tru dichotomies (i.e. male/female vs. fiction/nonfiction)
	Tetrachoric	2 nominal variables, BUT artificial (favorable/unfavorable, successfull/unsuccessful)
	Contingency	2 nominal variables
	Point Biserial *	one true dichotomy (nominal) one interval/ratio
	Biserial *	Artificial dichotomy (nominal) Interval/ratio
	Eta *	interval/ratio vs. interval/ratio DOES NOT NEED TO BE LINEAR RELATIONSHIP
	Assumptions of correlation coefficient	Linearity Unrestricted Range for both variables (i.e. heterogenous groups) - if restricted range, pearson r is underestimate Homoscedasticity: the range of Y scores is about the same as range of X scores - will result in correlation that does not represent a full range of scores
	coefficient of determination	square the correlation coefficient to determine shared variability ONLY square a correlation if it correlated to different variables
	Regression Analysis	allows predictions to be made ASSUMPTIONS: linear relationship degree of predictive ability is directly related to magnitude of correlation coefficient standard error of estimate is used to construct confidence interval of predicted score
	Multiple Regression	2+ continuous or discrete predictors used to predict single continues criterion R (multple correlation coefficient) indicates degree of association between criterion and predictors ideal to have high correlation with criterion and low correlation wtih other predictors multicolinearity: when predictors are highly correlated simple or stepwise
	multicolinearity	when predictors (multple regression) are highly correlated
	Types of multiple regression	simple: analyze all predictors on criterion at once stepwise: explain greatest variability with fewest predictors -forward stepwise: one predictor added to analysis -backward stepwise: start with all predictors and remove one at a time
	Multiple Regression vs. ANOVA	Multple Regression: when groups are unequal size, IV is continuous scale, allows researcher to add or subtract IV to determine the best model
	Cross-Validation	try it out on another sample likely to experience "shrinkage" because doesn't fit exactly like original sample
	canonical correlation	2+ continuous predictors to predict 2+ continuous criteria Identifyt he number and nature of underlying dimensinos that account for the correlation between 2 sets of variables
	Discriminant Function analysis	2+ continusou predictors to estmate status on 1 nominal criteria accuracy measured by "hit rate" or accuratly classified cases
	Logistic Regression	predict status on discrete criteiorn using 2+ continuous or discrete predictors REQUIRES LINEAR RELATIONSHIP (unlike discriminant function analysis)
	Path Analysis	extension of multiple regression translate theory about causal relationships into diagram indicating direction and strength between variables
	LISREL	when causal model includes recursive (one-way) and non-recursive (two-way) paths observed variables AND latent traits variables are believed to measure

Share This Flashcard Set

Set the Language

Related Flashcards

Eppp - Statistics & Research Design

Add to Folders

Upgrade to Cram Premium

Card Range To Study

115 Cards in this Set