Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
43 Cards in this Set
- Front
- Back
individuals |
objects described by a set of data |
|
variable |
any characteristic of an individual |
|
categorical variable |
places an individual into one of several groups or categories (pie and bar) |
|
quantitative variable |
takes a numerical value (histogram and stem plot) |
|
unit of measurement |
seconds or kilograms |
|
distribution (of a variable) |
what values it takes and how often |
|
distribution of a categorical variable |
lists categories and gives either the count or the percent in each category |
|
time plot |
plots each observation against the time that it was measured |
|
what best plots quantitative variables? |
histograms and stemplots |
|
what kind of variable is age? |
quantitative |
|
Q1 and Q3 |
medians on each side of the median |
|
five number summary |
minimum, Q1, M, Q3, maximum |
|
boxplot |
central box spans Q1-Q3 line in box is M lines extend from box to min and max |
|
IQR |
= Q3-Q1 |
|
identifying outliers |
1.5 x IQR |
|
resistant measures |
median and quartiles |
|
non-resistant measures |
mean, correlation and SD |
|
N(mean, SD) |
mean and SD of a normal distribution (0,1) |
|
z= |
x-mean/SD |
|
standardized value |
Z |
|
response variable |
measures an outcome of a study (altered by the explanatory variable) |
|
explanatory variable |
explain or influence change in a response variable |
|
scatterplot |
displays the relationship between two quantitative variables measured on the same individual |
|
correlation (r) |
measures the direction and strength of the linear association (positive and negative associations) |
|
regression line |
straight line that describes how a response variable changes as an explanatory variable changes |
|
least-squares regression line |
the line that minimizes the sum of the squares of the vertical distances of the observed points to the lines |
|
influential observations |
individual points that substantially change the correlation or regression line |
|
ecological correlation |
the tendency for correlations based on averages to be stronger than correlations based on individuals |
|
extrapolation |
use of a regression line to predict values far outside the range of what was actually observed |
|
lurking variable |
may explain the relationship between the explanatory and response variable |
|
conditional distribution |
distributions of a row variable for each fixed value of the column variable, and vis versa (each value in the column as a percent of the column total) |
|
bar graphs |
flexible means of presenting categorical data |
|
sample survey |
selects a sample from the population of all individuals about which we desire information |
|
random sampling |
uses chance to select a sample |
|
stratified random sampling |
classify the population into strata, and then choose SRS from each stratum |
|
bias |
systematic error in the way the sample represents the population |
|
voluntary response samples are particularly prone to
|
bias |
|
forms of bias |
-undercoverage -nonresponse -response bias -poorly worded questions |
|
experiments |
actively impose treatment (vs observational studies) |
|
confounded variable |
effects on a response cannot be distinguished |
|
randomization |
uses chance to assign subjects to the treatments |
|
blocks |
individuals who are similar in some way |
|
matched pairs |
subject receives both treatments, or subjects are paired as similarly as possible to compare treatments |