Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
60 Cards in this Set
- Front
- Back
Define descriptive statistics
|
summarize data w/ charts, graphs, values etc.
|
|
Define inferential statistics
|
make generalizations based on descriptive statistics.
|
|
group of elements for which understanding is desired...
|
population
|
|
set of conditions that repeatedly come together to transform inputs or outputs...
|
process
|
|
subset of a population or process...
|
sample
|
|
number summarizing some aspect of a population or process- descriptive measure of a population...
|
parameter
|
|
some number that summarizes some aspect of a sample-descriptive measure of a sample...
|
statistic
|
|
using sampling info to learn about a population/process...
|
statistical inference
|
|
single characteristic of an object or event...
|
statistical variable
|
|
models can be useful in several ways...
|
-making better decisions
-communication -understanding concepts -identifying relationships |
|
how do you know which concept or technique to apply to a given situation..?
|
1)identify the purpose/objective
2)identify the type of data |
|
the data values are measured by meaningful numbers that tell how much and how many...
|
quantitative data
|
|
data values are placed in classes and have no natural ordering...
|
qualitative data
|
|
the way observations are spread out across a range of values...
|
distribution
|
|
a table that tabulates the number of occurences or counts of a particular variable
|
frequency table
|
|
frequency table group that covers a particular range of values...
|
bin
|
|
chart in which each bar represents a particular bin...
|
histogram
|
|
A cumulative relative frequency must total...
|
1 (one)
|
|
pth percentile...
|
value in a distribution such that p% of the distribution is less than or equal to that value.
|
|
values located at the 25th, 50th, and 75th percentiles are...
|
quartiles
|
|
interquartile range (IQR)
|
central range. contains middle 50% of data
|
|
the sum of the numerical observations divided by the total number of observations...
|
arithmetic mean
|
|
the middle value...
|
median
|
|
most frequently occurring value...
|
mode
|
|
the distance between the largest and smallest values within a data set...
|
range
|
|
Difference between MAD, variance, std. deviation, coefficient of variation
|
?
|
|
What should population and sample be matched with?
|
Parameter and statistic
|
|
do we divide by n-1 for populations or samples?
|
samples
|
|
Do you need the mean to calculate variability? Do you need the variability to calculate the mean?
|
Yes, no
|
|
Explain the significance of the following percentages: 68, 95, 99.7
|
-68% of data values lie within one std. dev of the mean
-95% lie within 2 std devs -99.7% lie within 3 std devs |
|
standard deviation divided by the mean results in..
|
the coefficient of variation
|
|
measure of the lack of symmetry in the distribution of data values...
|
skewness, a skewness of 0 means its symmetric
|
|
measure of heaviness of tails in distribution...
|
kurtosis
|
|
more extreme values then expected...
|
positive kurtosis
|
|
less extreme values than expected...
|
negative kurtosis
|
|
name two values that a boxplot does not show...
|
mean and standard deviation
|
|
... tells us what can happen and with what likelihood
|
probability,
general --> specific |
|
... looks at what happened and tries to generalize about how the world works
|
statistical inference, specific --> general
|
|
is the process of obtaining information through observation or measurement of a phenomenon whose outcomes are not certain
|
random experiment
|
|
the most basic possible outcomes of an experiment, cannot be broken down into smaller occurrences...
|
simple events
|
|
collection of all simple events...
|
sample space, outcomes must be mutually exclusive and exhaustive
|
|
subset of a sample space
|
event
|
|
two events are mutually exclusive if...
|
the occurrence of one event excludes the occurrence of another. i.e., they can't both happen at the same time.
|
|
complement of A is...
|
Not A
|
|
theoretical approach
|
-#of ways of obtaining event/total # of equally liely possible outcomes
-no experiment |
|
Relative frequency
|
#of times an event occurs/# of replications
|
|
subjective assessment
|
-conditions cannot be replicated
-probability represents an individual's judgement |
|
discrete probability distribution
|
consists of whole numbers or counts
|
|
continuous probability distribution
|
theoretically an infinite # of outcomes within a given range
|
|
What are the two requirements for a discrete probability distribution?
|
-probs must be between 0 and 1.
-sum of probs must = 1 |
|
a variable whose values occur at random, following a probability distribution
|
random variable
|
|
what are the two requirements of a random variable
|
-numerical values
-probs associated w/ those values (probs need not be equal) |
|
how do you find the expected value?
|
sigma (x*P(x))
|
|
sample selected in such away that each member of the population has an equal and inpdepenedent chance of being selected
|
simple random sample
|
|
randomly select first, than all chances are equal
|
systematic random sample
|
|
splitting population into non-overlapping subpopulations and draw from each
|
stratified random sample
|
|
self-selected non-random sample
|
ex: calling in, internet
|
|
judgement non-random sample
|
ex: pick target cities
|
|
convenience non-random sample
|
ex: street corner
|
|
advantages to sample over census
|
cost/time savings, accuracy, destruction testing
|