Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
43 Cards in this Set
- Front
- Back
Association
|
Values of one variable tend to occur with certain values of another. Conditional distributions differ from marginal
|
|
Bivariate data
|
data collected on two variables for each individual in a study
|
|
CLT
|
When sampling from a non-Normal population, the sampling distribution of is approximately Normal whenever the sample is large and random.
|
|
conditional distribution:
|
The distribution of the values in a single row (or a single column) of a two-way table.
|
|
correlation coefficient
|
A measure of the strength of the linear relationship between two quantitative variables.
|
|
law of large numbers:
|
The fact that the average ( ) of observed values in a sample will get closer and closer
to μ as the sample size increases. |
|
least squares:
|
A method for finding the equation of a line that minimizes the sum of squared residuals.
|
|
least squares regression line:
|
The line with the smallest sum of squared residuals.
|
|
marginal distribution:
|
The distribution of the values in the “total” row (or the “total” column) of a two-way
table. |
|
parameter
|
A characteristic of a population that is usually unknown; this could be mean, median, proportion,
standard deviation computed on all the data from the population.; a parameter does not have variability. |
|
parameter symbols:
|
μ, σ, and p (mean of population, standard deviation of population, proportion of a
population, respectively) |
|
probability of an outcome:
|
A measure of the proportion of times an outcome occurs in a very long series of
repetitions that gives us an indication of the likelihood of the outcome. |
|
process:
|
Sequence of operations used in production, manufacturing, etc.
|
|
process in statistical control:
|
A process whose inputs and outputs exhibit natural variation when observed over
time. |
|
quality control chart:
|
A chart plotting the means of regular samples of size n against time; this chart is used
to access whether the process is in control. |
|
quantitative bivariate needs what type of analysis?
|
The type of data required for regression analysis.
|
|
r
|
The symbol for correlation coefficient.
|
|
r2:
|
The percentage of total variation in the response variable, Y, that is explained by the regression equation; in
other words, the percentage of total variation in the response variable, Y, that is explained by the explanatory variable, X. |
|
random:
|
A phenomenon that describes the uncertainty of individual outcomes in the short run, but gives a regular distribution of the outcomes in the long run.
|
|
random variable:
|
A variable whose value is a numerical outcome of a random phenomenon.
|
|
regression equation
|
A formula for a line that models a linear relationship between two quantitative variables.
|
|
residual
|
The observed y minus the predicted y; denoted: y ; prediction error
|
|
sample mean, x bar :
|
The random variable of the sampling distribution of xbar .
|
|
sampling distribution
|
A distribution of a statistic; a list of all the possible values of a statistic together with
the frequency (or probability) of each value. |
|
sampling distribution of xbar
|
A list of all the possible values for xbar together with the frequency (or probability) of each value; in other words, the distribution of all xbar’s from all possible samples.
|
|
sampling variability:
|
The variability of sample results from one sample to the next; something we must
measure in order to effectively do inference. |
|
scatterplot:
|
A two dimensional plot used to examine strength of relationship between two variables as well as
direction and type of relationship. |
|
simulation:
|
Using random numbers to imitate chance behavior.
|
|
slope:
|
A measure of the average change in the response variable for every one unit increase in the explanatory
or independent variable. |
|
standard deviation of xbar (also called the standard deviation of the sampling distribution of xbar)
|
A measure of the variability of the values of the statistic xbar about μ; a measure of the variability of the sampling distribution of xbar; in other words, the Aaverage@ amount that the statistic, xbar , deviates from its
mean μ. Computed as sigma/square root n. |
|
statistic
|
A number computed from sample data (without any knowledge of the value of a parameter) used to estimate the value of the parameter.
|
|
statistic symbols
|
xbar , s, p-hat (mean of sample, standard deviation of sample, proportion of sample, respectively)
|
|
statistical process control:
|
A procedure used to check a process at regular intervals to detect problems and correct them before they become serious.
|
|
sum of squared residuals (or error):
|
the residuals are squared and added.
|
|
total variation in Y
|
The sum of the squared deviations of the Y observations about their mean, y-hat.
|
|
two-way table
|
A table containing counts for two categorical variables. It has r rows and c columns.
|
|
unexplained variation
|
The sum of squared residuals
|
|
fun facts about r
|
gives direction and strength of linear relationship between x & y
affected by outliers association NOT causation Need quantitative range from -1 to 1 0=not linear x predict y, y predict x needs to be quantitative variables no units |
|
Theoretical Sampling distribution of xbar
|
All x bar values from ALL possible sample of the same size from the population, and miu=xbar
|
|
As n increases, what happens to sd?
|
it decreases.
|
|
What does this notation mean? N(1.0875, 0.015)
|
N=Normal
1.0875 is mean .015 is SDm |
|
mean of sampling distribution ALWAYS equals miu
|
always!
|
|
Remember CLT does not compute probablilities on DATA Rather sample MEANS
|
yup
|