• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/39

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

39 Cards in this Set

  • Front
  • Back
Individual
the objects described by a set of data.
variable
any characteristic of an individual
categorical variable
places an individual into one of several groups or categories
quantitative variable
takes numerical values for which arithmetic operations such as adding or subtracting make sense
distribution
tells us what values it takes and how often it takes these values
rules for a histogram
1. The bars of a histogram should cover the entire range of values of a variable. When the possible values of a variable have gaps between them, extend the bases of the bars to meet halfway between the two adjacent possible values
describing overall pattern
shape, center, spread and check for outliers
how to make a stemplot
1. separate each observation into stem/leaf 2. Write the stems in a vertical column with smallest at top 3. Write each leaf in the row to the right of its stem in increasing order out 4. For double stems, 0-4 goes on the upper stem and 5-9 goes on the lower stem
mean
average (x1+x2+x3….)/n
resistant measure
because the mean cannot resist the influence of extreme observations, it is not a resistant measure. Outliers may pull the mean toward its long tail and skew the mean. The median is resistant. Standard deviation is not resistant either.
median
midpoint
how to find the median
1. Arrange all observations in order of size, from smallest to largest 2. If the number of observations, n, is odd, the median is the center. Find the location by counting (n+1)/2. If n is even, average the two center observations
relationship between the mean and the median graphically
in a symmetric distribution, they are close together. If the distribution is exactly symmetric, the mean and median are exactly the same. In a skewed distribution, the mean is farther out n the long tail than the median
How to measure spread:
look at the spread of the middle half of the data where there are not outliers, that is look at the quartiles
first quartile
lies one quarter of the way up the list
third quartile
larger than 75% of the observations
how to make a boxplot
take the 5 number summary. Draw horizontal lines at the 3rd quartile, the median and the first quartile. Then draw vertical lines to the maximum and minimum.
variance
(s squared) of a set of observations is an average of the squares of the deviations of the observations from their mean
standard deviation
(s) measures spread by looking at how far the observations are from their mean
density curve rules
1. Always on or above the horizontal axis 2. Has area exactly 1 underneath it… describes the overall pattern of a distribution. The area under the curve and above any range of values is the proportion of all observations that fall in that range
median on a density curve
equal areas point with half the area under the curve to the left and the remaining to the right
mean on a density curve
the point at which the curve would balance if it was made of solid material. The mean and median of a symmetric density curve are equal
mu
the notation for the mean of an idealized distribution
Normal distributions
normal curves that are symmetric, single-peaked, and bell-shaped.
68-95-99.7 rule
68% of the observations fall within 1 standard deviation on either side of the mean, 95% of the observations fall within 2 standard deviations on either side of the mean, 99.7% of the observations fall within 3 standard deviations on either side of the mean
standardized value
z= (x-mu)/alpha also known as Z score
z score
tells us how many standard deviations the original observation falls away from the mean and in which direction. Observations larger than the mean are positive when standardized and vice versa
the standard normal distribution
N(0,1) with mean 0 and standard deviation 1. If a variable x has any Normal distribution N(mu, alpha) with mean mu and standard deviation alpha, then the standardized variable is Z
response variable
measures an outcome of a study aka dependent variables
explanatory variable
explains or influences changes in a response variable also known as independent variables
scatterplot
shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each indvidual in the data appears as the point in the plot fixed by the values of both variables for that individual. Always plot the explanatory variable, if there is one, on the horizontal axis of a scatterplot. Explanatory variable = x and response variable = y.
how to examine a scatterplot
chech for overall pattern, striking deviation, direction, strength, outlier
positive association
above average values of one tend to accompany above-average values of the other, and below-average values also tend to occur together
negative association
above average values of one tend to accompany below-average values of the other, and vice versa
strength of a stemplot
determined by how closely the points follow a clear form
correlation
r measures the direction and strength of the linear relationship between two quantitative variables ( you are trying to evaluate how closely points lie to a straight line)
how to assess correlation
1. positive r denotes positive correlation and negative r shows negative correlation. 2. Correlation is always between -1 and 1. Values of r near 0 indicate a weak linear relationship. The strength of the linear relationships increases as r moves toward 1 or -1. -1 or 1 only occur in the case of a perfect linear relationship. 3. Does not describe curved relationships 4. Not resistant
If X~N(6, 2)and a sample of n=4 is taken, then x bar is distributed
N(6,1) (standard deviation/root n)
If X~N (6,2) and a sample of n=4 is taken, (x-6)/2 is distributed
N(0,1) equal to Z