Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
23 Cards in this Set
- Front
- Back
Statistics
|
the scientific discipline that provides methods to help us make sense of data.
|
|
steps in the Data analysis process
|
1. Understand the nature of the problem - goal, ?s trying to answer
2. Deciding what to measure and how to measure it/ what info is needed to answer the ?s 3. Data collection - existing or new; careful plan 4. Data summarization and preliminary analysis - numerically and graphically 5. Formal data analysis - apply statistical methods 6. Interpretation of results |
|
3 critical tasks in Statistics
|
1. collecting data
2. summarizing data 3. analyzing data |
|
Population of interest
|
the entire collection of individuals about which info is desired
|
|
Sample
|
a subset of the population selected for study.
|
|
Descriptive Statistics
|
methods for organizing and summarizing data: tables, graphs, numerical summaries
|
|
Inferential statistics
|
generalizing from a sample to the population from which it was selected and assessing the reliability of such generalizations
|
|
Variable
|
any characteristic whose value may change from one individual to another.
|
|
Data
|
the result of making observations on one or more variables
|
|
univariate data set
|
data set consisting of observations on a single characteristic
|
|
categorical/qualitative data
|
identifies basic differentiating characteristics of the population
|
|
bivariate data
|
2 variables considered at the same time.
|
|
2 types of data
|
1) categorical 2) numerical
|
|
2 types of numerical data
|
1. discrete - each possible value corresponds to isolated points on a number line. COUNTING
2. continuous - the plausible set of values is an interval on the number line. MEASURING |
|
Frequency distribution - use
|
Used for categorical data
a table that displays the possible categories along with the associated frequencies and/or relative frequencies |
|
Frequency
|
the number of times the category appears in the data set
|
|
relative frequency - how calculated
|
the proportion of the observations that belong to that category.
= Frequency/ # of observations in the data set |
|
bar chart
|
a graph of a frequency distribution of categorical data
|
|
How to construct a bar chart
|
- horizontal axis - category names
- vertical axis - frequency/relative frequency - rectangular bar above each category that is the same width |
|
What to look for in a bar chart
|
frequently and infrequently occurring categories
|
|
When to use a dotplot
|
with small numerical data sets
|
|
how to construct a dotplot
|
- horizontal line - mark with an appropriate measurement scale
- represent each value by a dot. Stack the dots vertically if a value has more than one observation. |
|
Dotplots convey info about:
|
1) a representative or typical value
2) the extent to which the data values spread out 3) the nature of the distribution of values along the number line 4) the presence of unusual values |