Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
28 Cards in this Set
- Front
- Back
Bimodal
|
Distributions with two modes
|
|
Boxplot
|
A boxplot displays the 5-number summary as a central box with whiskers that extend to the non-outlying values. Boxplots are particularly effective for comparing groups
|
|
Center
|
The middle of the distribution, usually summarized numerically by the mean or the median
|
|
Distribution
|
The distribution of a variable gives:
--Possible values of the variable --Frequency or relative frequency of each value |
|
Five-number summary
|
A five-number summary for a variable consists of:
--The minimum and maximum --The quartiles Q1 and Q3 --The median |
|
Histogram (relative frequency histogram)
|
The histogram uses adjacent bars to show the distribution of values in a quantitative variable. Each bar represents the frequency (relative frequency) of values falling in an interval of values
|
|
Interquartile range (IQR)
|
The difference between the first and third quartiles. IQR = Q3-Q1
|
|
Mean
|
A measure of center found as the sum of the variables divide by the total number of the variables
|
|
Median
|
The middle value with half of the data above it and half below it
|
|
Mode
|
A peak or local high point in the shape of the distribution of a variable. The apparent location of modes can change as the scale of a histogram is changed
|
|
Multimodal
|
Distributions with more than two modes
|
|
Outliers
|
Extreme values that don't appear to belong with the rest of the data. They may be unusual values that deserve further investigation or just mistakes; there's no obvious way to tell
|
|
Quartile
|
The lower quartile (Q1) is the value with a quarter of the data below it. The upper quartile (Q3) has a quarter of the data above it. The median and quartiles divide the data into four equal parts.
|
|
Range
|
The difference between the lowest and the highest values in a data set. Range = maximum-minimum
|
|
Shape
|
The visual appearance of the distribution. To describe the shape, look for:
--single versus multiple modes --symmetry versus skewness |
|
Skewed
|
A distribution is skewed if one tail stretches out farther than the other
|
|
Spread
|
The description of how tightly clustered the distribution is around its center. Measures of spread include the IQR and the standard deviation
|
|
Standard deviation
|
A measure of spread.
The equation says that SD is found by subtracting the mean from each point on the graph, squaring those values and then adding them all up. Then divide by the amount of points on the graph minus one. |
|
Standardized value
|
Standardize a value by subtracting the mean and dividing by the standard deviation for the variable. These values, called z-scored, have no units
|
|
Stationary
|
A time series is said to be stationary if its statistical properties don't change over time
|
|
Stem-and-leaf display
|
A stem and leaf display shows quantitative data values in a way that sketches the distribution of the data. It's best described in detail by example
|
|
Symmetric
|
A distribution is symmetric if the two halves on either side of the center look approximately like mirror images of each other
|
|
Tail
|
The parts of a distribution that typically trail off on either side
|
|
Time series plot
|
Displays data that change over time. Often, successive values are connected with lines to show trends more clearly
|
|
Uniform
|
A distribution that's roughly flat
|
|
Unimodal
|
Having one mode. This is a useful term for describing the shape of a histogram when it's generally mound-shaped
|
|
Variance
|
The standard deviation squared
|
|
Z-score
|
A standardized value that tells how many standard deviations a value is from the mean; z-scores have a mean of 0 and a standard deviation of 1
|