Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
71 Cards in this Set
- Front
- Back
reliability coefficient |
a proportion that indicates the ratio between the true score variance on a test and a total variance |
|
variance |
statistic useful in describing sources of test score variability |
|
true variance |
variance from true differences |
|
error variance |
variance from irrelevant, random sources |
|
reliability |
proportion of the total variance attributed to true variance |
|
measurement error |
all of the factors associated with the process of measuring some variable, except the one being measured |
|
random error |
error in measuring a targeted variable caused bu unpredictable fluctuations and inconsistencies of other variables |
|
systematic error |
error that is typically constant or proportionate to what is presumed to be the true value of the variable |
|
test construction test administration test scoring and interpretation |
sources of error variance |
|
item sampling |
(source of error variance) variation among items within a test and between tests |
|
test environment |
(source of error variance) room temperature, level of lighting, amount of ventilation, noise |
|
testtaker variables |
(source of error variance) pressing emotional problems, physical discomfort |
|
examiner-related variables |
(source of error variance) examiner's physical appearance and demeanor |
|
- test-retest method - parallel forms / alternate forms method - split half method - KR20 - KR21 - coefficient alpha |
models of reliability |
|
test-retest method |
used to evaluate error associated with administrring a test on two different times |
|
coefficient of stability |
test-retest coefficient of reliability |
|
coefficient of equivalence |
parallel/alternate forms coefficient of reliability |
|
parallel forms |
when the means and variances of observed test forms are equal |
|
parallel forms method |
compares 2 equivalent forms of a test that measure the same attribute sometimes the two forms are given at different times |
|
alternate forms |
simply differebt versions of a test |
|
alternate forms reliability |
estimate to which different forms of the same tests have been affected |
|
internal consistency estimate of reliability |
evaluation of the internal consistency of the test items |
|
split-half method |
test is given and divided into halves that are scored separately, then results of the halves will be compared |
|
odd-even reliability |
assigning odd numbered items to one half of the test and even numbered items to the other half |
|
Spearman-Brown formula |
estimate of what the correlation between two halves would have been if each half had been the length of the whole test |
|
inter item consistency |
degree of correlation among all the items on the scale |
|
homogeneity |
degree to which a test measures a single item |
|
heterogeneity |
degree to which a test measures different factors |
|
Kuder-Richardson formulas |
formula used on tests with dichotomous items (scored 0 or 1) |
|
coefficient alpha |
used only in tests with non-dichotomous items (not scored 0 or 1) |
|
average proportional distance |
measure used to evaluate the internal consistency of a test that focuses on the degree of difference that exists between item scores |
|
inter-scorer reliability |
sample of tests is independently scored by two or more examiners then correlated |
|
homogenous - high degree of internal consistency heterogenous - low degree of internal consistency |
homogenous - ____ degree of internal consistency heterogenous - ____ degree of internal consistency |
|
dynamic characteristics |
trait, state or ability presumed to be ever-changing as a function of situational and cognitive experiences |
|
static characteristics |
trait, state or ability presumed to be relatively unchanging |
|
low |
if the variance if either variable is restricted by the sampling procedure used, then the resulting correlation coefficient tends to be _____ |
|
high |
if the variance if either variable is inflated by the sampling procedure used, then the resulting correlation coefficient tends to be _____ |
|
power test |
test when a time limit is long enough and some items are so difficult |
|
speed test |
test that contains items if uniform level of difficulty and generous time limit |
|
criterion-referenced test |
test designed to provide an indication of where a testtaker stands with respect to some criterion |
|
domain sampling theory |
considers the problems created by using a limited number of items to represent a larger, more complicated construct |
|
generalizability theory |
alternative model of measurement based on the idea that a person's test scores vary from testing to testing because of variable in the testing situation |
|
generalizability study |
examines how generalizable scores from a particular test are if the test is administered in different situations |
|
coefficients of generalizability |
influence of particular facets on the test score |
|
decision study |
examine the usefulness of test scores in helping the tests user make decisions |
|
item response theory (latent trait theory) |
computer focuses on the range of difficulty that helps assess the individual's ability level |
|
dichotomous test items |
test items answered with only one of two alternative responses |
|
polychotomous test items |
test items answered with three or more alternative responses |
|
standard error of measurement |
tool used to estimate or infer the extent to which an observed score deviates from a true score |
|
standard error of a score |
an index if the extent to which one individual's scores vary over tests presumed to be parallel |
|
confidence interval |
a range or band of test scores that is likely to contain the true score |
|
standard error of difference |
determines how large a difference should be before it is considered statistically significant |
|
test-retest method |
only of value when measuring traits that do not change over time |
|
carryover effect |
first session influences scores from the second session |
|
- careyover effect - some skills improve with prwctice - time interval between testing sessions |
points to consider in test-retest method |
|
parallel forms method |
one of the most rigorous assessments of reliability |
|
split-half method |
test scores gain reliability as the number of items increases |
|
coefficient alpha |
most general reliability coefficient |
|
KR20 formula |
considers all possible ways of splitting the items |
|
content sampling and heterogeneity |
two sources of error variance |
|
KR20 formula |
20th formula presented in the famous article of Kuder and Richardson |
|
KR21 formula |
formula that does not require the calculation of percentage who got every item right and wrong |
|
KR21 formula |
uses an approximation of the sum of the pq products (mean test score) |
|
all items must be of equal difficulty or that the average difficulty level is 50% |
points to consider in KR21 formula |
|
larger |
as the sample gets _____, it represents the domain more and more accurately |
|
higher |
greater number of items means ______ reliability |
|
charles spearman (1904) |
responsible for the advancement of reliability assessment |
|
abraham de moivre (1733) |
introduced the basic notion of sampling error |
|
karl pearson (1846) |
developed the product moment correlation |
|
classical test score theory |
assumes that each person has a true score thay would be obtained if there were no errors in the measurement |
|
classical test score theory |
uses the SD of errors and is usually called standard error of measurement |