Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
43 Cards in this Set
- Front
- Back
Simple random sample (SRS)
|
size n consists of n individuals from the population chosen in such a way that every set of n individuals has an equal chance to be the sample actually selected.
|
|
probability sample
|
a sample chosen by chance. We must know what samples are possible and what chance, or probability, each possible sample has.
|
|
confounding
|
Two variables are confounded when their effects on a response variable cannot be distinguished from each other.
|
|
z-score
|
the standardized value of x
|
|
interquartile range (IQR)
|
the distance between the first and third quartiles
|
|
mean
|
Arithmetic average of the observations in a data set.
|
|
median
|
Midpoint of the values in a data set.
|
|
Q1
|
the median of the observations whose position in the ordered list is to the left of the location of the overall median.
|
|
Q3
|
the median of the observations whose position in the ordered list is to the right of the location of the overall median.
|
|
standard deviation
|
Measure of the spread about the mean of a distribution. It is an average of the squares of the deviations of the observations from their mean, also equal to the square root of the variance.
|
|
range
|
all numbers in a set of given numbers
|
|
distribution
|
The values and the frequency of values taken by a variable
|
|
five number summary
|
Description of a distribution consisting of the smallest observation, the first quartile, the median, the third quartile, and the largest observation, written in order from smallest to largest.
|
|
resistant measure
|
cannot resist the influence of extreme observations
|
|
statistically significant
|
An observed effect so large that it would rarely occur by chance is called
|
|
explanatory variable
|
may explain or influence changes in a response variable
|
|
response variable
|
measures an outcome of a study
|
|
population
|
The entire group of individuals that we want information about.
|
|
sample
|
A part of the population that we actually examine in order to gather information
|
|
individual (or basic unit)
|
Object described by a set of data. Individuals may be people, but they may also be animals or things
|
|
stratified sample
|
A sample composed of separate SRSs chosen for different strata of a population, groups of individuals that are similar in some way that is important to the response
|
|
multistage sample
|
a form of stratified sampling where you take large groups and break them down into smaller groups before SRSs are chosen
|
|
convenience sample
|
A sample selected by taking the members of the population that are easiest to reach. This often produces unrepresentative data
|
|
voluntary response sample
|
consists of people who choose themselves by responding to a broad appeal. Voluntary response samples are biased because people with strong opinions are most likely to respond
|
|
completely random
|
all the subjects are allocated at random among all the treatments.
|
|
randomized block design
|
the random assignment of individuals to treatments is carried out separately within each block.
|
|
matched pairs
|
A type of experiment in which each subject receives both treatments in a random order, or the subjects are matched in pairs as closely as possible, and one subject in each pair receives each treatment
|
|
lurking variable
|
A variable that has an important effect on the relationship among the variables in a study but is not included among the variables studied.
|
|
response bias
|
individuals in the sample will sometimes forget or lie
|
|
non-response bias
|
occurs when an individual chosen for the sample can’t be contacted or refuses to participate
|
|
question wording bias
|
Confusing or leading questions can introduce strong bias, and even minor changes in wording can change a survey’s outcome
|
|
interviewer bias
|
Careful training of interviewers and careful supervision to avoid variation among the interviewers can reduce response bias
|
|
undercoverage bias
|
occurs when some groups in the population are left out of the process of choosing the sample
|
|
placebo effect
|
The response to a dummy treatment
|
|
lack of realism
|
the subjects or treatments or setting of an experiment may not realistically duplicate the conditions we really want to study
|
|
double blind
|
An experiment in which neither the subjects nor the people who work with them know which treatment each subject is receiving.
|
|
observational study
|
A study to observe individuals and measure variables of interest, but do not attempt to influence the responses
|
|
experiment
|
deliberately imposes some treatment on individuals in order to observe their responses. The purpose of an experiment is to study whether the treatment causes a change in the response.
|
|
quantitative
|
A variable that takes numerical values that measure some characteristic of each individual, like height in centimeters or salary in dollars per year. Uses histograms and stemplots. For small data sets, a stemplot is quicker to make and presents more detailed information.
|
|
categorical
|
A variable that places each individual into a category. Uses bar graphs and pie charts. Use a pie chart only when you want to emphasize each category’s relation to the whole
|
|
steps of statistical problem solving
|
STATE: What is the practical question, in the context of the real-world setting?
FORMULATE: What specific statistical operations does this problem call for? SOLVE: Make the graphs and carry out the calculations needed for this problem. CONCLUDE: Give your practical conclusion in the setting of the real-world problem. |
|
density curves
|
An idealized description of the overall pattern of a distribution that smooths out the irregularities in the actual data. A density curve has total area 1 underneath it.
|
|
normal distributions
|
Distribution described by a special family of bell-shaped, symmetric density curves, called Normal curves. The mean μ and standard deviation σ completely specify a Normal distribution N(μ, σ).
|