Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
50 Cards in this Set
- Front
- Back
What is ordered array?
|
Numerical data sorted by order of magnitude.
Data collected from smallest to largest. |
|
What is ordered array usually display on?
|
stem-and-leaf diagram
|
|
What is a stem-and-leaf diagram?
|
A simple way to see distribution details in a data set.
Stem: leading digits (10, 100) Leaves: trailing digits (11, 24, 190) |
|
What is a frequency distribution?
|
A lsit or table containing class groupings (ranges within which the data fall) and the corresponding frequencies within which data fall in each grouping or category.
|
|
How do you calculate the width interval of class intervals?
|
width of interval =
range/no. of desried class groupings. |
|
What graph is used for numerical data?
|
Histogram.
|
|
What does a histogram display on its horizontal axis?
|
The class boundaries (midpoints).
|
|
What do the bars represent on a histogram?
|
The number or percentage of observations within each class.
|
|
What are the 4 summary measures that describe data numerically?
|
1. Central Tendency
2. Quartiles 3. Variation 4. Shape |
|
What are the measures of central tendency?
|
1. Arithmetic mean
2. Median 3. Mode |
|
Which is the most common measure of central tendency?
|
The mean. However as it is affected by outliers, the median is also often used as it is not sensitive to extreme outliers.
|
|
What is the mean?
|
The average value.
|
|
How do you calculate the mean?
|
mean=
sum of all values/no. of values |
|
What is the median?
|
The middle value in an array.
|
|
How do you find the median?
|
median position=
no. of values +1/2 |
|
What difference does the number of values (odd or even) make to the median?
|
Odd: the median is the middle value.
Even: the median is the average of the two middle numbers. |
|
What is the mode?
|
The most frequent value in an array.
|
|
How many mode's are there in an array?
|
There may be no modes or several modes, it depends on the array.
|
|
What are quartiles?
|
They split the ranked data into four parts, each with an equal number of values.
|
|
What are the five measures of variation?
|
1. Range
2. Interquartile range 3. Variance 4. Standard deviation 5. Coefficient of variation |
|
What do measures of variation give information on?
|
The spread or variablility of the data values.
|
|
How do you calculate the range?
|
Range=
Xlargest - X smallest |
|
What does the interquartile range show?
|
estimatses some high and low value observations and calculates the range from the remaining values.
It can eliminate some of the outlier problems. |
|
How do you calculate the interquartile range?
|
interquartile range=
3rd quartile - 1st quartile IR = Q3 - Q1 |
|
What is variance?
|
The approximate average of squared deviations of values from the mean. It is directly related to the standard deviation.
|
|
How do you calculate variance?
|
Variance =
(the sum of ((i'th value of the variable x) - mean) / (sample size - 1) |
|
What is standard deviation?
|
The most commonly used meausre of variation.
It is the square root of the variance and has the same units as the orginial data. It shows variation about the mean. |
|
How do you calculate standard deviation?
|
standard deviation =
the square root of variance. |
|
What is the coefficient of variation?
|
The relative measure of variation.
Always in percentage form. Can be used to compare two or more sets of data measured in different units. |
|
How do you calculate the coefficient of variation?
|
Coefficient of variation =
standard deviation/mean |
|
What is a Z score?
|
A measure of relative standing.
The number of standard deviations that given data values are from the mean. |
|
What would a Z score of 2.0 mean?
|
This means that a value is 2.0 standard deviations from the mean.
|
|
How do you calculate the Z score?
|
Z score =
value (x) - mean/standard deviation |
|
What does the shape of a distribution show?
|
It describes how data is distributed.
|
|
What are the measures of shape?
|
1. symmetric
2. skewed |
|
What does a left-skewed distribution show?
|
mean < medain
|
|
What does a symmetric distribution show?
|
mean = median
|
|
What does a right-skewed distribution show?
|
mean > median
|
|
What is the Empirical Rule?
|
It gives the distribution of data values in terms of standard deviations from the mean for bell-shaped distributions.
|
|
What does m(+ or -)10 mean according to the empriical rule?
|
That 68% of the values in the population are within a distance of plus or minus one standard deviation from the mean.
|
|
What does m(+ or -)20 mean according to the empirical rule?
|
That 95% of the values in the population are within a distance of plus or minus two standard deviation from the mean.
|
|
What does m(+ or -)30 mean according to the empirical rule?
|
That 99.7% of the values in the population are within a distance of plus or minus three standard deviation from the mean.
|
|
What is a box-and-whisker plot?
|
It is a graphical display of data using the 5 number summary.
|
|
What is the 5 number summary?
|
1. Minimum
2. Q1 3. Median 4. Q3 5. Maximum |
|
What is a scatter diagram used for?
|
To examine possible relationships between two numerical variables.
|
|
What is the coefficient of correlation?
|
It measures the relative strength of the linear relationship between two variables.
|
|
Which formula is used for the coefficient of correlation?
|
r = cov(X,Y)/SxSy
where: cov(X,Y) is the sum of variance x plus variance y Sx is the standard deviation of variable x Sy is the standard deviation of variable y |
|
What does p = -1 mean?
|
Perfect negative correlation.
It means that the larger values of X tend to paired with the smaller values of Y. |
|
What does p = 0 mean?
|
No correlation.
Larger values of X have no influence of the values of Y. |
|
What does p = +1 mean?
|
Perfect positive correlation.
It means that larger values of X tend to be paired with the larger values of Y. |