Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
50 Cards in this Set
- Front
- Back
Distribution
|
Distribution of a quantitative variable slices up all the possible values of the variable equal- width bins and gives the number of values (or counts) falling into each bin
|
|
Histogram
|
A histogram uses adjacent bars to show the distribution of a quantitative variable. Each represents the frequency (or relative frequency) of values falling in each bin
|
|
Gaps
|
a region where there is no value in histogram
|
|
Relative Frequency Histogram
|
replacing the count on the vertical axis with the percentage of the total number of cases falling in each bin
|
|
Stem-and-Leaf Display
|
show quantitative data values in a ways that sketches the distribution of the data
|
|
Dot Plot
|
a dot for each case against a single axis
|
|
Describe a Distribution
|
1. Shape
2. Center 3. Spread |
|
Shape
|
Look for:
1. Single v. Multiple Modes 2.Symmetry vs.Skewness 3. Outliers and gaps |
|
Center
|
The place in the distribution of a variable that yo's point to if you wanted to attempt the impossible by summarizing the entire distribution with a single number. Measures of center include the mean and median.
|
|
Spread
|
A numerical summary of how tightly the values are clustered around the center. Measures of spread include the IQR and standard deviation.
|
|
Mode
|
A hump or local high point in the shape of the distribution of a variable. The apparent location of modes can change as the scale of a histogram is changed.
|
|
(Uni, Bi,or Multi)modal
|
Uni- one mode
Bi- two moders Multi- more that three |
|
Uniform
|
a distribution that's roughly flat, no humps
|
|
Symmetric
|
A distribution is symmetric to the two halves on either sides of the center look approximately like mirror images of each other.
|
|
Tails
|
parts that typicallly trail off on either side
|
|
Skewed
|
if not symmetric and one tail stretches out farther than the other
|
|
Outliers
|
Extreme values that don't appear to belong to rest of the data
|
|
Midrange
|
Average; not for histogram too sensitive with outliers
|
|
Median
|
(n+1)/2, if number comes out odd average the two places
|
|
Range
|
Max.- Min.
|
|
Quartile
|
The mean and quartiles divide data into four parts with equal number of data value.
|
|
Interquartile Range
|
upper quartile- lower quartie
|
|
Percentiles
|
the ith percentile is the number that falls above i% data.
lower quartile- 25% Upper quartile- 75% |
|
5-Number Summary
|
summary of a distribution reports the, Min,Q1, Med, Q3, and Max.
|
|
Mean
|
found by adding all the data values and dividing by the count
(formula) usually paired by standard deviation |
|
Resistance
|
a calculated summary is said to be resistant if it is affected only by a limited amount of outliers.
|
|
Variance
|
the sum of squared deviation from the mean, divided by the count minus 1.
|
|
Standard Deviation
|
square roots of the variance
|
|
Quartile
|
The mean and quartiles divide data into four parts with equal number of data value.
|
|
Interquartile Range
|
upper quartile- lower quartie
|
|
Percentiles
|
the ith percentile is the number that falls above i% data.
lower quartile- 25% Upper quartile- 75% |
|
5-Number Summary
|
summary of a distribution reports the, Min,Q1, Med, Q3, and Max.
|
|
Mean
|
found by adding all the data values and dividing by the count
(formula) usually paired by standard deviation |
|
Resistance
|
a calculated summary is said to be resistant if it is affected only by a limited amount of outliers.
|
|
Variance
|
the sum of squared deviation from the mean, divided by the count minus 1.
|
|
Standard Deviation
|
square roots of the variance
|
|
Comparing Distribution
|
Consider their:
1. Shaper 2. Center 3. Spread |
|
Comparing Box Plots
|
Compare:
1. shapes 2. medians 3. IQRs 4. Outliers |
|
Timeplot
|
displays data that change over time
|
|
Standardizing
|
We standardize to eliminate units. Standardized values can be compared and combined even if the original variables had different unit and magnitude.
|
|
Standardized Value
|
(n-mean)/sd
|
|
Z-score
|
tells hoes many standard deviation a value is from the mean; z-scores have a mean of 0 and standard deviation of 1
(look at formulas) |
|
Shifts
|
adding or subtracting a constant to each value, all measures of position will increase by the same constant.
Leaves the spread unchanged |
|
Rescale
|
multiply or divide all the data values by a constant: the five points ares multiplied or divided by that constants
|
|
Standard deviation to:
Shapes Center Spread |
Standardizing into z-scores does:
1. not change the distribution of a variable 2. changes the center by making the mean 0 3, changes the spread by making the standard deviation 1 |
|
Standard Normal Mode
|
Mean=0
Deviation=1 Standard Normal Distribution |
|
Normal Models
|
"bell-shaped curves"
appropriate for distributions whose shapes are unimodal and roughly symmetric |
|
Parameters
|
not numerical summaries of data
part of the model Number we use to specify the model N(mean,sd) |
|
Nearly Normal Condition
|
A distribution is nealy Normal id it is unimodal and symmetric. WE can check by looking at a histogram or a normal probability plot.
|
|
68-95-99.7 Rule
|
In normal models:
68% fall within 1 standard deviation of the mean 95% fall within 2 standard deviation of the mean 99.7% fall within 3 standard deviation of the mean |