Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
22 Cards in this Set
- Front
- Back
- 3rd side (hint)
variable relationships
|
variables are associated if knowing the value of one variable tells you something about the other variable
|
ex: if it's a boy, it's not a girl
|
|
response variable
|
measures an outcome of a study
|
dependent variable (y)
|
|
explanatory variable
|
explains or causes changes in a response variable
|
independent variable (x)
|
|
scatterplots
|
show the relationship between the two quantitative variables for the same individuals
|
plot values of one variable on x and other on y
|
|
direction - positive association
|
on avg, an increase in one variable increases another variable
|
|
|
direction - negative association
|
on avg, a decrease in one variable decreases another variable
|
|
|
correlation (r)
|
measures the direction and strength of a linear relationship between two quantitative variables
|
|
|
correlation properties - variables
|
doesn't matter what variable is x or y, doesn't tell which variables are expl. or resp., both must be quantitative, correlation has no units
|
|
|
correlation properties - association
|
positive (r) indicates positive association, negative (r) indicates negative association
|
|
|
correlation properties - range
|
ranges from -1≼r≼1; close to 0 - weak relationship, close to 1 - strong + trend, close to -1 - strong - trend, if r=∓1, then data lies on a straight line
|
|
|
regression line
|
straight line that describes how response (y) changes as explanatory (x) changes
|
|
|
prediction
|
use regression line to predict outside range of data
|
|
|
extrapolation
|
using regression line to predict outside the range of data
|
may or may not be a good prediction based on accuracy of regression
|
|
Least Square Regression line ("y on x")
|
the line that minimizes the sum of squares of the vertical distances between line and data points
|
|
|
interpreting the regression line
|
always passes through (x̄, y bar)
|
|
|
square of the correlation
|
r²=variance of predicted values ŷ/variance of actual values y
|
|
|
residual
|
error between the real value and estimated value
|
y=ŷ-e
|
|
residual plot
|
scatterplot of the residuals (y-axis) vs explanatory variable (x-axis)
|
help us determine if LSR is good fit...if data is randomly placed around line y=0, LSR is good fit
|
|
outlier
|
observation that lies outside the overall pattern
|
outliers in (y) direction have large residuals but other outliers may not have large residuals
|
|
influential observation
|
observation that drastically changes the statistical calculations if it is removed
|
outliers in (x) direction are influential
|
|
lurking variable
|
variable that is neither explanatory or response in your study, but influences those variables
|
can falsely suggest a relationship
|
|
restricted range problem
|
data set doesn't contain the full set of values that can happen
|
can produce misleading results
|