• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/35

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

35 Cards in this Set

  • Front
  • Back
*

Mean
*
"average", computed by adding all values of the variable in the data set and dividing by the number of observations. It is not resistant!
*

Population Mean
*
Is computed using all the individuals in a population. This is also a parameter; µ - pronounced "mew"
*

Sample Mean
*
pronounced "x-bar"; (bar over an x for equation) is computed using sample data. This is a statistic
*

Median
*
the value that lies in the middle of the data when arranged in ascending order. We use "M" to represent this in equation. It is resistant!
*

Steps of finding Median in Data Set:
*
1.) Arrange data in ascending order
2.) Determine the number of observations, n
3.) Determine the observation in the middle of the data set
a.) If # of observations is odd, the the median is the data value exactly in the middle of the data set. This lays in the n+1/2 position
b.) If the number of observations is even, the median is the mean of the two middle observations in the data set. That is, the median is the mean of the observation that lie in the n/2 position and the n/2 + 1 position
*

Resistant
*
A numerical survey of data is said to be _________ if extreme values (very large or small) relative to the data do not affect its value substantially.
*

Mode
*
the most frequent observation of the variable that occurs in the data set.
*

No Mode
*

If no observations occur more than once in a data set
Bimodal
If a data set has two data values that occur with the highest frequency, the data set is ____________.
Multimodal
If a data set has three or more data values that occur with the highest frequency, the data set is _______________.
*

Dispersion
*

The degree to which the data are spread out
*

Range (R)
*
The difference between the largest and smallest data value
Range (R) Equation
Range = R = largest data value - smallest data value
Deviation about the mean
( xi - µ )
The sum of all deviations about the mean must equal zero
**pg's 132, 133
*

Population Standard Deviation
*
Is the square root of the sum of squared deviations about the population mean divided b the number of observations in the population, N. (*PSD divided by N*)
*

Sample Standard Deviation (s)
*
s, of a variable is the square root of the sum of squared deviations about the sample mean divided by n-1, where n is the sample size
Degrees of Freedom
we call n-1 the _________ ___ ________ because the first n-1 observations have freedom to be whatever value they wish, but the nth value hs no freedom. I must be whatever value forces the sum of the deviations about the mean to equal zero.
*
Determining Standard Deviation Using Technology
Interpretations of Standard Deviation (to know)
(to know) If we are comparing two populations, then the larger the standard deviation, the more dispersion the distribution has, provided that the variable of interest from the two populations has the same unit of measure.
(IT SHOWS DISPERSION)
Variance
The _______ of a variable is the square of the standard deviation.
Population Variance
Sample Variance
*

Empirical Rule (when distribution is bell shaped)
*

Z-score
*
Represents the distance that a data value is from the mean in terms of standard deviations. It is unitless, it has a mean of 0 and a standard deviation of 1
*

Quartiles
*
divide the data sets into fourths, or four equal parts (ie. each 25%)
*

Interquartile Range, IQR
*
is the range of the middle 50% of the observations in a data set. That is, the _____ is the difference between the third and first quartiles. The more spread a set of data has, the higher the _____ will be.
*

Describe the Distribution (must know!)
*

Outliers
*

Extreme observations in a data set
*

Fences
*

Serve as cutoff points for determining outliers
Checking for Outliers by using Quartilels
1.) Determine the first and third quartiles of the data
2.) Compute the IQR
3.) Determine the fences.
Lower fence = Q1 - 1.5(IQR)
Upper fence = Q3 - 1.5(IQR)
4.) If a data value is less than the lower fence or greater than the upper fence, it is considered and outlier
*

Five-Number Summary
*

MINIMUM Q1 M Q3 MAXIMUM
*

Boxplot
*

Another graph that can be created using the five-number summary
*

Steps for Drawing a Box Plot
*
1.) Find the upper and lower fences
2.) Draw horizontal # line with a scale that will accommodate our graph. Draw vertical lines at Q1, M, and Q3. Enclose vertical lines in a box
3.) Label the lower & upper fences ([ and ])
4.) Draw whiskers = the smallest and larges data value inside the fence (horizontal lines to these points)
5.) Identify outliers (denote by asterisk *)
Determining the Mean and Median (Calculator!)
1.) Enter the raw data in L1 by pressing STAT and selecting 1:Edit.
2.) Press STAT, highlight CALC menu, and select 1:1-Var Stats
3.) with 1-Var Stats appearing on the HOME screen, press then 2nd then 1 to insert L1 on the HOME screen. Press ENTER
Determining the Mean and Median (Calculator!)
1.) Enter the class midpoint in L1 and the frequency or relative frequency in L2 by pressing STAT and selecting 1:Edit.
2.) Press STAT, highlight the CALC menu, and select 1:1-Var Stats
3.) With 1-Var Stats appearing on the HOME screen, press 2nd then 1 to insert L1 on the HOME screen. Then press the comma and press 2nd 2 to insert L2 on the HOME screen. So the HOME screen should have the following: 1-Var Stats L1, L2
Press ENTER to obtain the mean and standard deviation.