Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Related Flashcards

Flashcards
»
EPPP Statistics

Eppp Statistics

by mayjungkim11, Feb. 2005

Subjects: statistics

Favorite

Add to folder

Flag

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/107

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

107 Cards in this Set

Front
Back

	AB Design	A single-subject research design that includes a single baseline (A) phase and a single treatment (B) phase. Shortcoming: not adequately control "history", which can threaten internal validity
	Alpha (level of significance)	probability of rejecting the null hypothesis when it is true (i.e., probability of making a Type I error). Set by experimenter prior to collecting or analyzing data
	ANCOVA (Analysis of Covariance)	A verson of ANOVA used to increase the efficiency of the analysis by statistically removing variability in the DV that is due to an extraneous variable. Each person's score on the DV is adjusted on the basis of score on the extraneous variable
	AREAS UNDER THE NORMAL CURVE	when scores are normally distributed,possible to conclude that specific number of observations fall within certain areas of the distribution that are defined by SD. 60% fall plus and minus one SD, 95% plus and minus 2 SD, 99% plus and minus 3 SD
	BETWEEN GROUPS DESIGN	studies in which the effects of the different levels of one or more IVs are compared by administering each level or combination of levels to a different group of subjects
	BISERIAL CORRELATION COEFFICIENT	a bivariate correlation coefficent used when one viarble is an aritifical dichotomy (i.e., a continuous variable that has been artificially dichotomized) and the other is a continuous variable
	BLOCKING	used to control an extraneous variable when an investigator wants to statistically analyze its main and interaction effects on the DV. Involves grouping (blocking) subjects with regard to their status on the extraneous variable and then randomly assigning subjects in each block to one of the treatment groups
	CENTRAL LIMIT THEOREM	derived from probability theory that predicts that the sampling distribtuion of the mean (1) will approach a normal shape as the sample size increases, regardless of the shape of the population distribution of scores; (2) has a mean equal to the population mean, and (3) has a SD equal to the pop. SD divided by square root of the sample size
	CLUSTER SAMPLING	entails selecting units or groups (clusters) of individuals from the population (vs. involves selecting indvds from pop)
	COEFFICIENT OF DETERMINATION	name given tothe r when it is squared, indicate amt of variability in Y that is accounted for by the variability of X (amt of variability shared by the two variables). r is squared only when it is the correlation between two different variables
	CORRELATION COEFFICIENT	index of the relationship between two or more variables. indicates strength of the relationship; can be positive or negative
	COUNTERBALANCED DESIGN	research design used to control carryover (order) effects; involves administering the differetn levels fo IV to different subjects or gorups of subjects in a different order. ex. Latin square design
	CROSS-VALIDATION	validating a correlation coeffience (e.g.,criterion related validity coefficient) on a new sample. B/c the same chance factors operating in the original sample are not operating in the subsequent sample, the correlation coefficient tends to "shrink" on cross validation. In terms of multiple correlation coefficient (R) shrinkage is greatest when the original sample is small and the number of predictors is large
	DEMAND CHARACTERISTICS	cues in the experimental situation that inform research participants of how they are expected to behave during the course of the study. threaten internal and external validity
	DEPENDENT VARIABLE	variable that is observed and measured in a study and is believes to be affected by the IV
	DISCRIMINANT FUNCTION ANALYSIS	the multivariate technique used when there are 2 or more continuous predictors and one discrete (nominal) criterion. Multiple discriminant function when the criterion has more than 2 categories
	ETA	bivariate correlction coefficent used when both variables are continuous and the relationship between them is nonlinear
	EVENT SAMPLING	method of behavioral sampling that is useful for behaviors that are rare or that leave a permanent product. involves recording each occurrence of a behaivor during a predefined or preselected event
	EXPERIMENTALWISE ERROR RATE	probability of amking Type I error; as number of statistical comparisons in a study increases, the experimentwise error rate also increases
	EXTERNAL VALIDITY	degree to which a study's results can be generalized to other people, settings, conditions, etc
	F-RATIO	test statistic yielded by the analysis of variance; represents a mesure of treatment effects plus error divided by a measure of error only (MSB/MSW); when treatment has an effect, the ratio is larger than 1.0
	FACTORIAL ANOVA	type of ANOVA used when a study includes two or more IV's (i.e., when the study has used a factorial design) Also referred to as 2-way ANOVA, etc...referring to the number of IVs
	INTERACTION	occurs when the impact of one IV differs at different levels of another variable
	FACTORIAL DESIGN	name given to any research design that includes two or more "factors" (IVs); permit analysis of main and interaction effects
	HISTORY	event that is external to a research study and that is not relevant to the research hypothesis but that affects subjects' performance ont he DV in a systematic way and thereby confounnds the results of the study; threatens internal validity
	INDEPENDENT VARIABLE	variable that is manipulated in research for the purpose of determing its effects on the DV; each IV must have at least 2 levels
	INTERNAL VALIDITY	the degree which a research study allows investigator to conclude that observed variability in a dependent variable is due to the IV rather than other factors
	INTERVAL RECORDING	method of behavioral sampling that involves dividing a period of time into discrete intervals and recording whether the behavior occurs in each interval; useful in behaviors that have no clear beginning or end
	LISREL	a casual (structural equation) modeling technique used to verify a predefined causal model or theory. More ocmplex than path analysis; allows 2-way (non-recursive) paths and takes into account observed variables, the latent traits they are believed to measure, and the effects of measurement error
	MANOVA (MULTIVARIATE ANALYSIS OF VARIANCE)	form of ANOVA used when a study includes one or more IVs and 2 or more DVs, each of which is measured on an interval or ratio scale; helps reduce the experimentalwise error rate and increases power by analyzing the effects of the IV(s) on all DVs simultaneously
	MATCHING	method of controlling an extraneous variable or other source of systematic error; involves pairing or grouping subjects on the basis of their status on the extraneous variable and randomly assigning members of each pair or group to a different treatment group so that groups are initially equivalent with regard to the extraneous variable
	MATURATION	any physical or psychological process or even that occurs as the result of the passage of time (e.g., fatigue, motivation) and that has a systematic effect on subjects' status on the DV. Acts as a threat to internal validity
	MEAN	measure of central tendency that is the arithmetic average of a set of scores; used when scores are measured on an interval or ratio scale
	MEDIAN	measure of central tendency that is the middle score in a distribution of scores when scores have been ordered from lowest to highest; used with ordinal data (and often with interval and ratio data when the distribution is skewed or contains one or few outliers)
	MIXED DESIGN	research design in which both between-groups and within subjects comparisons can be made
	MIXED DESIGN	research designs in which both between-groups and within-subjects comparisons are made
	MODE	measure of central tendency that represents the most frequently occurring category or score in a distribution
	MULTICOLLINEARITY	an undesirable condition in multiple regression and other multivariate (prediction) techniques; occuring when predictors are highly correlated with one another
	MULTIPLE BASELINE DESIGN	a single-subject design that involves sequentially applying a treatment to different "baselines" (e.g., to different behaviors, settings, or subjects); useful when a reversal design would be impractical or unethical
	MULTIPLE CORRELATION COEFFICIENT (R)	correlation coefficient that indicates the degree of association between 3 or more variables. R can be squared to obtain a mesure of shared variability
	MULTIPLE REGRESSION	multivariate technique used for predicting a score on a continuous criterion based on performance on 2 or more continous and/or discrete predictors; predictors will have low correlations with each other and high correlations with criterion; output is mutiple correlation coefficient and multiple regression equation
	MULTIPLE SAMPLE CHI SQUARE TEST	nonparametric inferential statistical test used when a study includes 2 or more variables and the data to be analyzed are reported in terms of frequencies in each category; involves comparing observed frequencies to expected frequencies to determine if the 2 distributions of frequencies differ (Df = (C-1)(R-1)
	NONPARAMETRIC TESTS	inferential statistical tests used when the data to be analyzed represent either an ordinal or nominal scale or when the assumptions for a parametric test have not been met; does not make the sampe assumptions about the population distribution as the parametric tests and therefore, are also known as "distribution free tests"; includes chi-square, Mann-Whitney U, Wilcoxon matched pairs test
	NORMAL DISTRIBUTION (CURVE)	symmetrical bell shaped distribtuion that is defined by a specific mathematical formula; "Gaussian distribution"
	ONE WAY ANOVA (ANALYSIS OF VARIANCE)	parametric statistical test used to compare the means of 2 or more groups when a study includes 1 IV and 1 DV that is measured on an interval or ratio scale; yields F ratio that indicates if any group means are significantly different; preferable to multiple t-tests when a study involves more than 3 groups b/c helps control the experimentwise error rate
	PARAMETRIC TESTS	inferential statistical tests that are used when the data to be analyzed represent an interval or ratio scale and when certain assumptions about the pop distributions have been met (ie., when scores on the variable of interest are normally distributed and when there is homoscedasticity); more powerful
	PATH ANALYSIS	a causal modeling technique used to verify a pre-defined causal model or theory; involves translating the theory into a path diagram, collecting data on the variables of interest and calculating and interpreting path coefficients
	PEARSON R	correlation coefficient that can be used when both variables have been measured on an interval or ratio scale; requires linearity, unrestricted range of scores, and homoscedasticity
	POINT BISERIAL CORRELATION COEFFICIENT	bivariate correlation coefficient used when one variable is a true dichotomy (male/female) and the other variable is continuous
	POWER	probability of rejecting a false null hypothesis; cannot be directly controlled but can be increased by including a large sample, maximizing the effects of the IV, increasing the size of alpha, and reducing error
	PROTOCOL ANALYSIS	technique used by cognitive psychologists to identify cognitions underlying problem-solving and decision making; thinking aloud while working and then analyzing the record (protocol) of the individual's verbalization
	QUASI-EXPERIMENTAL RESEARCH	experimental research in which experimental control is limited, especially to assign subjects to groups b/c intact groups must be used, the variable of interest is an organismic variable or the study includes only one group; limitation: not allow to conclude causal relationship
	RANDOM ASSIGNMENT	method of assigning subjects to treatmetn groups using random method; hallmark of trus experimental research and can conclude that any observed effect of an DV due to IV rather than error
	RANDOM BLOCK FACTORIAL ANOVA	version of ANOVA that appropriate when blocking has been used as a method of controlling extraneous variable; can statistically analyze main and interaction effects of the extraneous variable (which is being treated as an additional IV)
	REGRESSION ANALYSIS	statistical technique used to predict a score on a criterion based on the person's obtained score on a predictor; involves identification of a regression line (line of best fit) and use of equation for that line
	REJECTION REGION	region of sampling distribtuion that contains those sample values (e.g., means) that are unlikely to be obtained simply as the result of sampling error. when an inferential statistical test indicates that the obtained sample value falls in the rejection region, the null hypothesis is rejected and the alternative hypothesis is retained; size of the region determined by alpha
	RETENTION REGION	region of sampling distribution that contains those values that are likely to be obtained simply as the result of sampling error; when inferential statistical test indicates that an obtained sample value is in the retention region, the null hypothesis is retained an the laternative hypothesis is rejected; region is equal to one minus alpha
	REVERSAL (WITHDRAWAL) DESIGN	type of single-subject design that includes at a minimum, two baseline phases and one treatment phase (e.g., an ABA or ABAB design); treatmetn is withdrawn during the second and subsequent baseline phases
	SAMPLING DISTRIBUTION OF THE MEAN	distribution of sample mean that would be obtained if an infinite number of equal-size samples were randomly selected from the population and the mean for each sample calculated; normally shaped, mean is equal to the pop mean, SD (standard error of the mean) is = to pop SD divided by square root of the sampel size; used in inferential statistics to determine how unlikely it is to obtain a particular sample mean given the pop mean, the pop SD, sample size, and level of significance
	SAMPLING ERROR	type of random error that is due to uncontrolled factors and that is responsible for the fluctuations found between sample values and the corresponding value for the population from which the samples were randomly drawn
	SCALES OF MEASUREMENT	method of categorizing the various ways to measure variables; four types that differ in terms of mathematical sophistication; nominal, ordinal, interval, iand ratio; nominal yields frequency of observations; ordinal, interval and ratio yields scale values or scores
	SCATTERGRAM (SCATTERPLOT)	summary in graphic (pictorial) form of the degree of association between two variables; wide scatter in a scattergram indicates a low correlation between variables
	SELECTION	a potential threat to both the internal and external validty of research when subjects are not randomly assigned or selected; threatens internal validty when subjects in different treatmetn groups are initially different and would differ at the end of the study; threatens external validity when the characteristics of the subjects in different groups cuases them to react in an idiosyncratic way to the treatment
	SINGLE-SAMPLE CHI-SQUARE TEST	a nonparemetric inferential statistical test sued when a study includes 1 variable and the data therefore are reported in terms of frequencies in each category (level) of that variable; involves comparing the observed frequencies to expected frequencies to determine if the two distributions of frequencies differ (Df = C-1)
	SKEWED DISTRIBUTION	asymmetrical distributions in which the majority of scores are located on one side of the distribution; positively skewed most scores are in the low side but a few scores are in the high (positive) side; negatively skewed distribution the majority of scores are in the high side of the distribution, but a few on the low side
	STANDARD DEVIATION	a measure of dispersion (variability) of scores around the mean of the distribution; calculated by dividing the sum of the squared deviation by N (or N-1) and taking the square root of the result; the square root of the variance
	STANDARD ERROR OF THE MEAN	SD of the sampling distribution of the mean; calculated by dividing the pop SD by the square root of the sample size
	T-TEST FOR A SINGLE SAMPLE	version of t-test used to compare a single obtained sample mean to a known or hypothesized pop mean (Df = N-1)
	T-TEST FOR CORRELATED SAMPLES	version of t-test used to compare two sample means when subjects in the 2 groups are related in some way (e.g., b/c they were matched on an extraneous variable or b/c a single-group pretest/posttest design was used) Df= number of pairs of scores -1
	T-TEST FOR INDEPENDENT SAMPLES	version of t-test used to compare two samples means when subjects in the two groups are independent (unrelated) Df = N-2
	TREND ANALYSIS	type of analysis of variance used to assess linear and nonlinear trends when the ID is quantitative
	TRUE EXPERIMENTAL RESEARCH	experimental research that provides the investigator with maximal experimental control; randomly assign subjects
	TYPE I ERROR	decision error that occurs when a true null hypothesis is rejected; equal to alpha
	TYPE II ERROR	decision error that occurs when a false null hypothesis is retained; equal to beta
	WITHIN-SUBJECT DESIGN	experimental design in which each subject receives at different times, each level of the IV ( or combination of IVs) so that comparisons on the DV are made within subjects rather than between groups
	CLASSICAL TEST THEORY	theory of measurement that regards observed variability in test scores as reflecting two components; true differences between examinees on the attribute measured by the test and the effects of measurement (random) erro
	COEFFICIENT ALPHA	method of assessing internal consistency reliability that provides an index of average inter-item consistency; primary sources of error are content (item) sampling differences and heterogeniety of the content domian
	KUDER-RICHARDSON FORMULA 20 (KR-20)	used to substitute for coefficient alpha when test items are scored dichotomously
	CONSTRUCT VALIDITY	extent to which a test measures the hypothetical trait (construct) it is intended to measure
	CONTENT VALIDITY	extent to which test adequately samples the domain of information, knowledge, skill that it purports to measure
	CORRECTION FOR GUESSING	method used to ensure that examinees do not benefit from "wild" guessing; involves using formula to adjust scores; the resulting distribution has lower mean and larger SD than original distribution
	CRITERION CONTAMINATION	bias introduced into a person's criterion score as a result of the knowledge of the scorer about his/her performance on the predictor; tends ot artificially inflate the relationship between the predictor and criterion
	CRITERION-REFERENCED INTERPRETATION	interpretation of a test score in terms of a prespecified standard (i.e., terms of % of content correct or predicted performance on an external criterion (e.g., regression equation, expectancy table)
	CRITERION-RELATED VALIDITY	validity involving determining the relationship (correlation) between the predictor and the criterion; can be either concurrent or predictive
	CROSS VALIDATION	process of re-assessing a test's criterion-related validity on a new sample to check the generalizability of the original validity coefficient; ordinarily the validity coefficient "shrinks" on cross validation b/c the chance factors operating in the original sample are not all present in the sample
	FACTOR ANALYSIS	multivariate statistical technique used to determine how many factors (constructs) are needed to account for the intercorrelations among a set of tests, subtests, or test items; used to assess test's construct validity by indicating the extent to which the test correlates with factors that it would and owuld not be expected to correlate with
	INCREMENTAL VALIDITY	extent to which predictor increases decision-making accuracy; calculated by subtracting the base rate from the positive hit rate; terms to have linked with this are predictor and criterion cutoff scores
	INTER-RATER RELIABILITY	measure of degree of consistency of scores for 2 or more rates; can be measured either by correlation coefficient or percent agreement
	ITEM DIFFICULTY INDEX	measure of an item's difficulty level; calculated by dividing number of individuals who answered the item correctly by total number of individual; ranges in valie from 0 (very difficult) to 1.0 (very easy); .50 is preferred
	ITEM RESPONSE THEORY (IRT)	approach to test construction that involves identifying an item characteristic curve for each test items; vs. classical test theory and has advantage identified item parameters are sample invariant; scores from different tests or different sets of test items can be equated
	MULTITRAIT MULTI-METHOD MATRIX	systematic way to organize the correlation coefficient obtained when assessing a measure's convergent and discriminant validity; requires measuring at least 2 different traits using two different methods for each trait
	NORM-REFERENCED INTERPRETATION	interpretation of an examinee's test performance relative to the performance of examinees in a normative (standardization) sample; %tile rank, standard score, age and grade equivalent scores are examples
	ORTHOGONAL ROTATION	factor analysis; produces uncorrelated factors; rotation done to simplify the interpretation of identified factors
	OBLIQUE ROTATION	factor analysis; produces correlated factors
	PERCENTAGE SCORE	type of criterion-referenced (aka content referenced) score that indicates the percent of test content (items) endorsed or answered correctly;
	PERCENTILE RANK	type of norm-referenced score that indicates the percent of examinees in the normative group who obtained lower score; 60% of the examinees in the norm group obtained lower raw scores
	PRINCIPAL COMPONENT ANALYSIS	multivariate technique similar to factor analysis that is used to identify the fewest dimensions (components) that can explain the variability in a set of tests; amount of variability eplained by each identified component is provided by an eigenvalue
	RELATIONSHIP BETWEEN RELIABILITY AND VALIDITY	reliability is necessary but not sufficient for validity; in terms of criterion related validity-validity coeffficent can be no greater than the square root of the product of the reliabilities of the predictor and criterion
	RELIABILITY	consistency of test scores (i.e., extent to which test measures an attribute without being affected by random fluctuations -measurement errors-that produce inconsistencies over time;
	TYPES OF RELIABILITY	test-retest, alternative form, split-half, coefficient alpha, inter-rater
	SPEARMAN-BROWN FORMULA	used to estimate the reliability of a test if it were lengthended or shortened, with items from the same content domain or measuring the same construct; used with split-half reliablity to dtermine what reliability coefficient would have been had it been on full length test
	SPLIT-HALF RELIABILITY	method of assessing internal consistency relability that involves "splitting" the test half; contet sampling is the primary source of error; using corrected with Spearman Brown formula
	STANDARD ERROR OF ESTIMATE	an index of error when predicting criterion scores from predictor scores; used to construct a confidence interval around an examinee's predicted criterion score; magnitude depends on: 1. criterion's SD, 2. predictors validity coefficient
	STANDARD ERROR OF MEASUREMENT	an index of measurement error; used to construct confidence interval around an examinee's obtained test score; magnitude depends on 1. test's SD, 2. test's reliability coefficient
	STANDARD SCORE	transformed score that reports test performance in terms of SD unites from the mean achieved by normative sample; t-scores, z-scores, deviation IQs
	TEST-RETEST RELIABILITY	method for assessing reliability that invovles administering the same test to the same group on 2 different occasions and correlating the 2 sets of scores; time sampling factors are primary sources of error; yields a coefficient stability
	VALIDITY	extent to which test accurately measures what it is intended to measure; three types: content, construct, and criterion-related

Share This Flashcard Set

Set the Language

Related Flashcards

Eppp Statistics

Add to Folders

Upgrade to Cram Premium

Card Range To Study

107 Cards in this Set