Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
17 Cards in this Set
- Front
- Back
Measure of location |
Summarising each distribution by finding a measure which describes the size of the values in the distribution - representative of the data set as a whole |
|
Measure of spread |
For values that are distributed differently, need to consider how spread out the data is |
|
Measures of location |
Mode - most occurring values in the data (m) Median - 50th percentile (M) Mean - sum of all values divided by number of values - average |
|
Bimodal |
Where multiple values are most common at once - mode ceases to be useful as a ‘typical’ value |
|
Advantages and disadvantages of the Mode |
Advantages: - most frequent value may be most useful measure eg. Majority opinion - very easy to evaluate - used with non numeric data - open ended distributions don’t affect finding it Disadvantages: - appropriate value might not exist eg all values different - if bimodal, no longer useful - could be extreme value in a skewed distribution and then is unsuitable - can not be used in further calculations - not involve all data points - small changes in values not reflected by small changes in mode - ^ descriptive way not analytical |
|
Advantages and disadvantages- median |
Advantages: - useful to know value below which 50% of distribution lies - value of median not affecting by outlying values or open ended distributions Disadvantages: - doesn’t involve all data points - small changes in values not reflected proportionally by small changes in median - medians from separate distributions can’t combine - can not be used in further calculations |
|
Advantages and disadvantages of the mean |
Advantages: - all values in data set used - possible to pool data from multiple sets to calculate overall mean - further used in statistical analysis - can be used for reflecting small changes in data set Disadvantages: - affected by outlying values - difficult to interpret for certain scenarios |
|
Excel functions |
=MODE() : calculates mode =MEDIAN() : calculates median =AVERAGE() : calculates mean =QUARTILE() : calculates quartile |
|
Weighted mean |
Back (Definition) |
|
Measure of dispersion - Range |
Range = largest value - smallest value Easy to calculate and easy to understand But uses only 2 values and can be subject to distortion |
|
Measure of dispersion |
Measures of the scatter, spread, variability or unpredictability of the data values relative to a measure of location |
|
Measure of dispersion - Quartiles |
Quartile deviation - (Q3 - Q1)/2 |
|
Standard deviation |
Find mean Find difference between every mean figure and actual figures (difference) Square these figures (d^2) Add them up Divide by number of values Square root this number for the standard deviation |
|
What does a standard deviation figure tell us |
If standard deviation smaller, it means figures closer to the mean - less outliers or varied figures |
|
Coefficient of variation |
Expresses standard deviation as a percentage of mean = (Standard deviation / mean ) x100% |
|
Positive skew |
Mean > median > mode |
|
Negatively skewed |
Mean < median < mode |