• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/17

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

17 Cards in this Set

  • Front
  • Back

Measure of location

Summarising each distribution by finding a measure which describes the size of the values in the distribution - representative of the data set as a whole

Measure of spread

For values that are distributed differently, need to consider how spread out the data is

Measures of location

Mode - most occurring values in the data (m)


Median - 50th percentile (M)


Mean - sum of all values divided by number of values - average

Bimodal

Where multiple values are most common at once - mode ceases to be useful as a ‘typical’ value

Advantages and disadvantages of the Mode

Advantages:


- most frequent value may be most useful measure eg. Majority opinion


- very easy to evaluate


- used with non numeric data


- open ended distributions don’t affect finding it


Disadvantages:


- appropriate value might not exist eg all values different


- if bimodal, no longer useful


- could be extreme value in a skewed distribution and then is unsuitable


- can not be used in further calculations


- not involve all data points


- small changes in values not reflected by small changes in mode


- ^ descriptive way not analytical

Advantages and disadvantages- median

Advantages:


- useful to know value below which 50% of distribution lies


- value of median not affecting by outlying values or open ended distributions


Disadvantages:


- doesn’t involve all data points


- small changes in values not reflected proportionally by small changes in median


- medians from separate distributions can’t combine


- can not be used in further calculations

Advantages and disadvantages of the mean

Advantages:


- all values in data set used


- possible to pool data from multiple sets to calculate overall mean


- further used in statistical analysis


- can be used for reflecting small changes in data set


Disadvantages:


- affected by outlying values


- difficult to interpret for certain scenarios

Excel functions

=MODE() : calculates mode


=MEDIAN() : calculates median


=AVERAGE() : calculates mean


=QUARTILE() : calculates quartile

Weighted mean

Back (Definition)

Measure of dispersion - Range

Range = largest value - smallest value


Easy to calculate and easy to understand


But uses only 2 values and can be subject to distortion

Measure of dispersion

Measures of the scatter, spread, variability or unpredictability of the data values relative to a measure of location

Measure of dispersion - Quartiles

Quartile deviation - (Q3 - Q1)/2

Standard deviation

Find mean


Find difference between every mean figure and actual figures (difference)


Square these figures (d^2)


Add them up


Divide by number of values


Square root this number for the standard deviation

What does a standard deviation figure tell us

If standard deviation smaller, it means figures closer to the mean - less outliers or varied figures

Coefficient of variation

Expresses standard deviation as a percentage of mean


= (Standard deviation / mean ) x100%

Positive skew

Mean > median > mode

Negatively skewed

Mean < median < mode