• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/50

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

50 Cards in this Set

  • Front
  • Back
Distribution
Distribution of a quantitative variable slices up all the possible values of the variable equal- width bins and gives the number of values (or counts) falling into each bin
Histogram
A histogram uses adjacent bars to show the distribution of a quantitative variable. Each represents the frequency (or relative frequency) of values falling in each bin
Gaps
a region where there is no value in histogram
Relative Frequency Histogram
replacing the count on the vertical axis with the percentage of the total number of cases falling in each bin
Stem-and-Leaf Display
show quantitative data values in a ways that sketches the distribution of the data
Dot Plot
a dot for each case against a single axis
Describe a Distribution
1. Shape
2. Center
3. Spread
Shape
Look for:
1. Single v. Multiple Modes
2.Symmetry vs.Skewness
3. Outliers and gaps
Center
The place in the distribution of a variable that yo's point to if you wanted to attempt the impossible by summarizing the entire distribution with a single number. Measures of center include the mean and median.
Spread
A numerical summary of how tightly the values are clustered around the center. Measures of spread include the IQR and standard deviation.
Mode
A hump or local high point in the shape of the distribution of a variable. The apparent location of modes can change as the scale of a histogram is changed.
(Uni, Bi,or Multi)modal
Uni- one mode
Bi- two moders
Multi- more that three
Uniform
a distribution that's roughly flat, no humps
Symmetric
A distribution is symmetric to the two halves on either sides of the center look approximately like mirror images of each other.
Tails
parts that typicallly trail off on either side
Skewed
if not symmetric and one tail stretches out farther than the other
Outliers
Extreme values that don't appear to belong to rest of the data
Midrange
Average; not for histogram too sensitive with outliers
Median
(n+1)/2, if number comes out odd average the two places
Range
Max.- Min.
Quartile
The mean and quartiles divide data into four parts with equal number of data value.
Interquartile Range
upper quartile- lower quartie
Percentiles
the ith percentile is the number that falls above i% data.

lower quartile- 25%
Upper quartile- 75%
5-Number Summary
summary of a distribution reports the, Min,Q1, Med, Q3, and Max.
Mean
found by adding all the data values and dividing by the count
(formula)
usually paired by standard deviation
Resistance
a calculated summary is said to be resistant if it is affected only by a limited amount of outliers.
Variance
the sum of squared deviation from the mean, divided by the count minus 1.
Standard Deviation
square roots of the variance
Quartile
The mean and quartiles divide data into four parts with equal number of data value.
Interquartile Range
upper quartile- lower quartie
Percentiles
the ith percentile is the number that falls above i% data.

lower quartile- 25%
Upper quartile- 75%
5-Number Summary
summary of a distribution reports the, Min,Q1, Med, Q3, and Max.
Mean
found by adding all the data values and dividing by the count
(formula)
usually paired by standard deviation
Resistance
a calculated summary is said to be resistant if it is affected only by a limited amount of outliers.
Variance
the sum of squared deviation from the mean, divided by the count minus 1.
Standard Deviation
square roots of the variance
Comparing Distribution
Consider their:
1. Shaper
2. Center
3. Spread
Comparing Box Plots
Compare:
1. shapes
2. medians
3. IQRs
4. Outliers
Timeplot
displays data that change over time
Standardizing
We standardize to eliminate units. Standardized values can be compared and combined even if the original variables had different unit and magnitude.
Standardized Value
(n-mean)/sd
Z-score
tells hoes many standard deviation a value is from the mean; z-scores have a mean of 0 and standard deviation of 1

(look at formulas)
Shifts
adding or subtracting a constant to each value, all measures of position will increase by the same constant.

Leaves the spread unchanged
Rescale
multiply or divide all the data values by a constant: the five points ares multiplied or divided by that constants
Standard deviation to:
Shapes
Center
Spread
Standardizing into z-scores does:

1. not change the distribution of a variable

2. changes the center by making the mean 0

3, changes the spread by making the standard deviation 1
Standard Normal Mode
Mean=0
Deviation=1
Standard Normal Distribution
Normal Models
"bell-shaped curves"
appropriate for distributions whose shapes are unimodal and roughly symmetric
Parameters
not numerical summaries of data
part of the model

Number we use to specify the model

N(mean,sd)
Nearly Normal Condition
A distribution is nealy Normal id it is unimodal and symmetric. WE can check by looking at a histogram or a normal probability plot.
68-95-99.7 Rule
In normal models:

68% fall within 1 standard deviation of the mean

95% fall within 2 standard deviation of the mean

99.7% fall within 3 standard deviation of the mean