• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/107

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

107 Cards in this Set

  • Front
  • Back
AB Design
A single-subject research design that includes a single baseline (A) phase and a single treatment (B) phase. Shortcoming: not adequately control "history", which can threaten internal validity
Alpha (level of significance)
probability of rejecting the null hypothesis when it is true (i.e., probability of making a Type I error). Set by experimenter prior to collecting or analyzing data
ANCOVA (Analysis of Covariance)
A verson of ANOVA used to increase the efficiency of the analysis by statistically removing variability in the DV that is due to an extraneous variable. Each person's score on the DV is adjusted on the basis of score on the extraneous variable
AREAS UNDER THE NORMAL CURVE
when scores are normally distributed,possible to conclude that specific number of observations fall within certain areas of the distribution that are defined by SD. 60% fall plus and minus one SD, 95% plus and minus 2 SD, 99% plus and minus 3 SD
BETWEEN GROUPS DESIGN
studies in which the effects of the different levels of one or more IVs are compared by administering each level or combination of levels to a different group of subjects
BISERIAL CORRELATION COEFFICIENT
a bivariate correlation coefficent used when one viarble is an aritifical dichotomy (i.e., a continuous variable that has been artificially dichotomized) and the other is a continuous variable
BLOCKING
used to control an extraneous variable when an investigator wants to statistically analyze its main and interaction effects on the DV. Involves grouping (blocking) subjects with regard to their status on the extraneous variable and then randomly assigning subjects in each block to one of the treatment groups
CENTRAL LIMIT THEOREM
derived from probability theory that predicts that the sampling distribtuion of the mean (1) will approach a normal shape as the sample size increases, regardless of the shape of the population distribution of scores; (2) has a mean equal to the population mean, and (3) has a SD equal to the pop. SD divided by square root of the sample size
CLUSTER SAMPLING
entails selecting units or groups (clusters) of individuals from the population (vs. involves selecting indvds from pop)
COEFFICIENT OF DETERMINATION
name given tothe r when it is squared, indicate amt of variability in Y that is accounted for by the variability of X (amt of variability shared by the two variables). r is squared only when it is the correlation between two different variables
CORRELATION COEFFICIENT
index of the relationship between two or more variables. indicates strength of the relationship; can be positive or negative
COUNTERBALANCED DESIGN
research design used to control carryover (order) effects; involves administering the differetn levels fo IV to different subjects or gorups of subjects in a different order. ex. Latin square design
CROSS-VALIDATION
validating a correlation coeffience (e.g.,criterion related validity coefficient) on a new sample. B/c the same chance factors operating in the original sample are not operating in the subsequent sample, the correlation coefficient tends to "shrink" on cross validation. In terms of multiple correlation coefficient (R) shrinkage is greatest when the original sample is small and the number of predictors is large
DEMAND CHARACTERISTICS
cues in the experimental situation that inform research participants of how they are expected to behave during the course of the study. threaten internal and external validity
DEPENDENT VARIABLE
variable that is observed and measured in a study and is believes to be affected by the IV
DISCRIMINANT FUNCTION ANALYSIS
the multivariate technique used when there are 2 or more continuous predictors and one discrete (nominal) criterion. Multiple discriminant function when the criterion has more than 2 categories
ETA
bivariate correlction coefficent used when both variables are continuous and the relationship between them is nonlinear
EVENT SAMPLING
method of behavioral sampling that is useful for behaviors that are rare or that leave a permanent product. involves recording each occurrence of a behaivor during a predefined or preselected event
EXPERIMENTALWISE ERROR RATE
probability of amking Type I error; as number of statistical comparisons in a study increases, the experimentwise error rate also increases
EXTERNAL VALIDITY
degree to which a study's results can be generalized to other people, settings, conditions, etc
F-RATIO
test statistic yielded by the analysis of variance; represents a mesure of treatment effects plus error divided by a measure of error only (MSB/MSW); when treatment has an effect, the ratio is larger than 1.0
FACTORIAL ANOVA
type of ANOVA used when a study includes two or more IV's (i.e., when the study has used a factorial design) Also referred to as 2-way ANOVA, etc...referring to the number of IVs
INTERACTION
occurs when the impact of one IV differs at different levels of another variable
FACTORIAL DESIGN
name given to any research design that includes two or more "factors" (IVs); permit analysis of main and interaction effects
HISTORY
event that is external to a research study and that is not relevant to the research hypothesis but that affects subjects' performance ont he DV in a systematic way and thereby confounnds the results of the study; threatens internal validity
INDEPENDENT VARIABLE
variable that is manipulated in research for the purpose of determing its effects on the DV; each IV must have at least 2 levels
INTERNAL VALIDITY
the degree which a research study allows investigator to conclude that observed variability in a dependent variable is due to the IV rather than other factors
INTERVAL RECORDING
method of behavioral sampling that involves dividing a period of time into discrete intervals and recording whether the behavior occurs in each interval; useful in behaviors that have no clear beginning or end
LISREL
a casual (structural equation) modeling technique used to verify a predefined causal model or theory. More ocmplex than path analysis; allows 2-way (non-recursive) paths and takes into account observed variables, the latent traits they are believed to measure, and the effects of measurement error
MANOVA (MULTIVARIATE ANALYSIS OF VARIANCE)
form of ANOVA used when a study includes one or more IVs and 2 or more DVs, each of which is measured on an interval or ratio scale; helps reduce the experimentalwise error rate and increases power by analyzing the effects of the IV(s) on all DVs simultaneously
MATCHING
method of controlling an extraneous variable or other source of systematic error; involves pairing or grouping subjects on the basis of their status on the extraneous variable and randomly assigning members of each pair or group to a different treatment group so that groups are initially equivalent with regard to the extraneous variable
MATURATION
any physical or psychological process or even that occurs as the result of the passage of time (e.g., fatigue, motivation) and that has a systematic effect on subjects' status on the DV. Acts as a threat to internal validity
MEAN
measure of central tendency that is the arithmetic average of a set of scores; used when scores are measured on an interval or ratio scale
MEDIAN
measure of central tendency that is the middle score in a distribution of scores when scores have been ordered from lowest to highest; used with ordinal data (and often with interval and ratio data when the distribution is skewed or contains one or few outliers)
MIXED DESIGN
research design in which both between-groups and within subjects comparisons can be made
MIXED DESIGN
research designs in which both between-groups and within-subjects comparisons are made
MODE
measure of central tendency that represents the most frequently occurring category or score in a distribution
MULTICOLLINEARITY
an undesirable condition in multiple regression and other multivariate (prediction) techniques; occuring when predictors are highly correlated with one another
MULTIPLE BASELINE DESIGN
a single-subject design that involves sequentially applying a treatment to different "baselines" (e.g., to different behaviors, settings, or subjects); useful when a reversal design would be impractical or unethical
MULTIPLE CORRELATION COEFFICIENT (R)
correlation coefficient that indicates the degree of association between 3 or more variables. R can be squared to obtain a mesure of shared variability
MULTIPLE REGRESSION
multivariate technique used for predicting a score on a continuous criterion based on performance on 2 or more continous and/or discrete predictors; predictors will have low correlations with each other and high correlations with criterion; output is mutiple correlation coefficient and multiple regression equation
MULTIPLE SAMPLE CHI SQUARE TEST
nonparametric inferential statistical test used when a study includes 2 or more variables and the data to be analyzed are reported in terms of frequencies in each category; involves comparing observed frequencies to expected frequencies to determine if the 2 distributions of frequencies differ (Df = (C-1)(R-1)
NONPARAMETRIC TESTS
inferential statistical tests used when the data to be analyzed represent either an ordinal or nominal scale or when the assumptions for a parametric test have not been met; does not make the sampe assumptions about the population distribution as the parametric tests and therefore, are also known as "distribution free tests"; includes chi-square, Mann-Whitney U, Wilcoxon matched pairs test
NORMAL DISTRIBUTION (CURVE)
symmetrical bell shaped distribtuion that is defined by a specific mathematical formula; "Gaussian distribution"
ONE WAY ANOVA (ANALYSIS OF VARIANCE)
parametric statistical test used to compare the means of 2 or more groups when a study includes 1 IV and 1 DV that is measured on an interval or ratio scale; yields F ratio that indicates if any group means are significantly different; preferable to multiple t-tests when a study involves more than 3 groups b/c helps control the experimentwise error rate
PARAMETRIC TESTS
inferential statistical tests that are used when the data to be analyzed represent an interval or ratio scale and when certain assumptions about the pop distributions have been met (ie., when scores on the variable of interest are normally distributed and when there is homoscedasticity); more powerful
PATH ANALYSIS
a causal modeling technique used to verify a pre-defined causal model or theory; involves translating the theory into a path diagram, collecting data on the variables of interest and calculating and interpreting path coefficients
PEARSON R
correlation coefficient that can be used when both variables have been measured on an interval or ratio scale; requires linearity, unrestricted range of scores, and homoscedasticity
POINT BISERIAL CORRELATION COEFFICIENT
bivariate correlation coefficient used when one variable is a true dichotomy (male/female) and the other variable is continuous
POWER
probability of rejecting a false null hypothesis; cannot be directly controlled but can be increased by including a large sample, maximizing the effects of the IV, increasing the size of alpha, and reducing error
PROTOCOL ANALYSIS
technique used by cognitive psychologists to identify cognitions underlying problem-solving and decision making; thinking aloud while working and then analyzing the record (protocol) of the individual's verbalization
QUASI-EXPERIMENTAL RESEARCH
experimental research in which experimental control is limited, especially to assign subjects to groups b/c intact groups must be used, the variable of interest is an organismic variable or the study includes only one group; limitation: not allow to conclude causal relationship
RANDOM ASSIGNMENT
method of assigning subjects to treatmetn groups using random method; hallmark of trus experimental research and can conclude that any observed effect of an DV due to IV rather than error
RANDOM BLOCK FACTORIAL ANOVA
version of ANOVA that appropriate when blocking has been used as a method of controlling extraneous variable; can statistically analyze main and interaction effects of the extraneous variable (which is being treated as an additional IV)
REGRESSION ANALYSIS
statistical technique used to predict a score on a criterion based on the person's obtained score on a predictor; involves identification of a regression line (line of best fit) and use of equation for that line
REJECTION REGION
region of sampling distribtuion that contains those sample values (e.g., means) that are unlikely to be obtained simply as the result of sampling error. when an inferential statistical test indicates that the obtained sample value falls in the rejection region, the null hypothesis is rejected and the alternative hypothesis is retained; size of the region determined by alpha
RETENTION REGION
region of sampling distribution that contains those values that are likely to be obtained simply as the result of sampling error; when inferential statistical test indicates that an obtained sample value is in the retention region, the null hypothesis is retained an the laternative hypothesis is rejected; region is equal to one minus alpha
REVERSAL (WITHDRAWAL) DESIGN
type of single-subject design that includes at a minimum, two baseline phases and one treatment phase (e.g., an ABA or ABAB design); treatmetn is withdrawn during the second and subsequent baseline phases
SAMPLING DISTRIBUTION OF THE MEAN
distribution of sample mean that would be obtained if an infinite number of equal-size samples were randomly selected from the population and the mean for each sample calculated; normally shaped, mean is equal to the pop mean, SD (standard error of the mean) is = to pop SD divided by square root of the sampel size; used in inferential statistics to determine how unlikely it is to obtain a particular sample mean given the pop mean, the pop SD, sample size, and level of significance
SAMPLING ERROR
type of random error that is due to uncontrolled factors and that is responsible for the fluctuations found between sample values and the corresponding value for the population from which the samples were randomly drawn
SCALES OF MEASUREMENT
method of categorizing the various ways to measure variables; four types that differ in terms of mathematical sophistication; nominal, ordinal, interval, iand ratio; nominal yields frequency of observations; ordinal, interval and ratio yields scale values or scores
SCATTERGRAM (SCATTERPLOT)
summary in graphic (pictorial) form of the degree of association between two variables; wide scatter in a scattergram indicates a low correlation between variables
SELECTION
a potential threat to both the internal and external validty of research when subjects are not randomly assigned or selected; threatens internal validty when subjects in different treatmetn groups are initially different and would differ at the end of the study; threatens external validity when the characteristics of the subjects in different groups cuases them to react in an idiosyncratic way to the treatment
SINGLE-SAMPLE CHI-SQUARE TEST
a nonparemetric inferential statistical test sued when a study includes 1 variable and the data therefore are reported in terms of frequencies in each category (level) of that variable; involves comparing the observed frequencies to expected frequencies to determine if the two distributions of frequencies differ (Df = C-1)
SKEWED DISTRIBUTION
asymmetrical distributions in which the majority of scores are located on one side of the distribution; positively skewed most scores are in the low side but a few scores are in the high (positive) side; negatively skewed distribution the majority of scores are in the high side of the distribution, but a few on the low side
STANDARD DEVIATION
a measure of dispersion (variability) of scores around the mean of the distribution; calculated by dividing the sum of the squared deviation by N (or N-1) and taking the square root of the result; the square root of the variance
STANDARD ERROR OF THE MEAN
SD of the sampling distribution of the mean; calculated by dividing the pop SD by the square root of the sample size
T-TEST FOR A SINGLE SAMPLE
version of t-test used to compare a single obtained sample mean to a known or hypothesized pop mean (Df = N-1)
T-TEST FOR CORRELATED SAMPLES
version of t-test used to compare two sample means when subjects in the 2 groups are related in some way (e.g., b/c they were matched on an extraneous variable or b/c a single-group pretest/posttest design was used) Df= number of pairs of scores -1
T-TEST FOR INDEPENDENT SAMPLES
version of t-test used to compare two samples means when subjects in the two groups are independent (unrelated) Df = N-2
TREND ANALYSIS
type of analysis of variance used to assess linear and nonlinear trends when the ID is quantitative
TRUE EXPERIMENTAL RESEARCH
experimental research that provides the investigator with maximal experimental control; randomly assign subjects
TYPE I ERROR
decision error that occurs when a true null hypothesis is rejected; equal to alpha
TYPE II ERROR
decision error that occurs when a false null hypothesis is retained; equal to beta
WITHIN-SUBJECT DESIGN
experimental design in which each subject receives at different times, each level of the IV ( or combination of IVs) so that comparisons on the DV are made within subjects rather than between groups
CLASSICAL TEST THEORY
theory of measurement that regards observed variability in test scores as reflecting two components; true differences between examinees on the attribute measured by the test and the effects of measurement (random) erro
COEFFICIENT ALPHA
method of assessing internal consistency reliability that provides an index of average inter-item consistency; primary sources of error are content (item) sampling differences and heterogeniety of the content domian
KUDER-RICHARDSON FORMULA 20 (KR-20)
used to substitute for coefficient alpha when test items are scored dichotomously
CONSTRUCT VALIDITY
extent to which a test measures the hypothetical trait (construct) it is intended to measure
CONTENT VALIDITY
extent to which test adequately samples the domain of information, knowledge, skill that it purports to measure
CORRECTION FOR GUESSING
method used to ensure that examinees do not benefit from "wild" guessing; involves using formula to adjust scores; the resulting distribution has lower mean and larger SD than original distribution
CRITERION CONTAMINATION
bias introduced into a person's criterion score as a result of the knowledge of the scorer about his/her performance on the predictor; tends ot artificially inflate the relationship between the predictor and criterion
CRITERION-REFERENCED INTERPRETATION
interpretation of a test score in terms of a prespecified standard (i.e., terms of % of content correct or predicted performance on an external criterion (e.g., regression equation, expectancy table)
CRITERION-RELATED VALIDITY
validity involving determining the relationship (correlation) between the predictor and the criterion; can be either concurrent or predictive
CROSS VALIDATION
process of re-assessing a test's criterion-related validity on a new sample to check the generalizability of the original validity coefficient; ordinarily the validity coefficient "shrinks" on cross validation b/c the chance factors operating in the original sample are not all present in the sample
FACTOR ANALYSIS
multivariate statistical technique used to determine how many factors (constructs) are needed to account for the intercorrelations among a set of tests, subtests, or test items; used to assess test's construct validity by indicating the extent to which the test correlates with factors that it would and owuld not be expected to correlate with
INCREMENTAL VALIDITY
extent to which predictor increases decision-making accuracy; calculated by subtracting the base rate from the positive hit rate; terms to have linked with this are predictor and criterion cutoff scores
INTER-RATER RELIABILITY
measure of degree of consistency of scores for 2 or more rates; can be measured either by correlation coefficient or percent agreement
ITEM DIFFICULTY INDEX
measure of an item's difficulty level; calculated by dividing number of individuals who answered the item correctly by total number of individual; ranges in valie from 0 (very difficult) to 1.0 (very easy); .50 is preferred
ITEM RESPONSE THEORY (IRT)
approach to test construction that involves identifying an item characteristic curve for each test items; vs. classical test theory and has advantage identified item parameters are sample invariant; scores from different tests or different sets of test items can be equated
MULTITRAIT MULTI-METHOD MATRIX
systematic way to organize the correlation coefficient obtained when assessing a measure's convergent and discriminant validity; requires measuring at least 2 different traits using two different methods for each trait
NORM-REFERENCED INTERPRETATION
interpretation of an examinee's test performance relative to the performance of examinees in a normative (standardization) sample; %tile rank, standard score, age and grade equivalent scores are examples
ORTHOGONAL ROTATION
factor analysis; produces uncorrelated factors; rotation done to simplify the interpretation of identified factors
OBLIQUE ROTATION
factor analysis; produces correlated factors
PERCENTAGE SCORE
type of criterion-referenced (aka content referenced) score that indicates the percent of test content (items) endorsed or answered correctly;
PERCENTILE RANK
type of norm-referenced score that indicates the percent of examinees in the normative group who obtained lower score; 60% of the examinees in the norm group obtained lower raw scores
PRINCIPAL COMPONENT ANALYSIS
multivariate technique similar to factor analysis that is used to identify the fewest dimensions (components) that can explain the variability in a set of tests; amount of variability eplained by each identified component is provided by an eigenvalue
RELATIONSHIP BETWEEN RELIABILITY AND VALIDITY
reliability is necessary but not sufficient for validity; in terms of criterion related validity-validity coeffficent can be no greater than the square root of the product of the reliabilities of the predictor and criterion
RELIABILITY
consistency of test scores (i.e., extent to which test measures an attribute without being affected by random fluctuations -measurement errors-that produce inconsistencies over time;
TYPES OF RELIABILITY
test-retest, alternative form, split-half, coefficient alpha, inter-rater
SPEARMAN-BROWN FORMULA
used to estimate the reliability of a test if it were lengthended or shortened, with items from the same content domain or measuring the same construct; used with split-half reliablity to dtermine what reliability coefficient would have been had it been on full length test
SPLIT-HALF RELIABILITY
method of assessing internal consistency relability that involves "splitting" the test half; contet sampling is the primary source of error; using corrected with Spearman Brown formula
STANDARD ERROR OF ESTIMATE
an index of error when predicting criterion scores from predictor scores; used to construct a confidence interval around an examinee's predicted criterion score; magnitude depends on: 1. criterion's SD, 2. predictors validity coefficient
STANDARD ERROR OF MEASUREMENT
an index of measurement error; used to construct confidence interval around an examinee's obtained test score; magnitude depends on 1. test's SD, 2. test's reliability coefficient
STANDARD SCORE
transformed score that reports test performance in terms of SD unites from the mean achieved by normative sample; t-scores, z-scores, deviation IQs
TEST-RETEST RELIABILITY
method for assessing reliability that invovles administering the same test to the same group on 2 different occasions and correlating the 2 sets of scores; time sampling factors are primary sources of error; yields a coefficient stability
VALIDITY
extent to which test accurately measures what it is intended to measure; three types: content, construct, and criterion-related