Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
33 Cards in this Set
- Front
- Back
univariate data sets
|
one variable
|
|
bivariate data sets (binary)
|
two variables
|
|
multivariate data sets
|
more than two
|
|
categorical data
|
qualitative data , described by words
|
|
numerical data
|
quantitative data, numbers
|
|
discrete
|
variable with countable number of distinct values . 37
|
|
continuous
|
any value within an interval. physical measurements (lenght, width, time, speed), and financial variables (sales) 423.67
|
|
observation
|
subject: 1 person had this
1 2 3 4 or male female |
|
variable
|
charactristic of a subject or individual, such as employee income
|
|
nominal data
|
identitfy a category, same as qualitative, categorical, or classification data.
|
|
ordical data
|
natural order of something. rankuing of data values. usually code nominal data numerically
1= strongly agree, 2= agree 3= disagree (likert scale) |
|
Interval data
|
Absensce of meaningful 0.distances. likert scale. Ex: temperature
|
|
ratio data
|
meaningful 0 that represents the absence of the quantity being measured.
|
|
Time series data
|
the periodicity is the time between observation, can be annual, quarterly, monthy, weekly, daily.
|
|
Cross sectional data
|
if each observation represents a different individual unit (a person, firm, area) at the same point in time.
Variation among observations in relationships with other observations. |
|
Central tendency
|
where are the data values concentrated? what seem to be typical or middle data values?
|
|
dispersion
|
how much variation is there in the data? How spread out are the data values? are there unusual values?
|
|
shape
|
are the data values distributed symmetrically? skewed? sharply peaked? flat? bimodal?
|
|
Frequency distribution
|
a table formed by classifying n data values into k classes called bins.
|
|
Sturges' Rule
|
An equation to choose the number of bins needed. guideline proposed that says that everytime we double the sample size, we should add one bin.
|
|
Sturges' Rule
|
k= 1 + 3.3log(n)
n=sample size |
|
Bin Limits
|
Bin width = xmax- xmin / k
K is the number of bins needed |
|
Histogram
|
graphical rep of a frequency distribution.
|
|
Modal class
|
a histogram with a bar that is higher than those on either side
|
|
Unimodal
|
histogram with a single modal class
|
|
Bimodal
|
histogram with two modal class.
|
|
Multimodal
|
histogram with two or more modal classes.
|
|
Symmertic
|
In shape. If neither tale is longer than its symmetric
|
|
Right-skewed
|
positively skwed. histogram has a longer right taile, with most data values clustered on the let side.
|
|
Left-skewed
|
negatively skewed. histogram has a longer left tail, with most data values clustered on the right side.
|
|
line chart
|
display a time series, to spot trends, or to compare time periods. Used to display several variables at once. times series. time on x, numerical on y.
|
|
scatter plot
|
starting point for bivariate data analysis. invesitigate relationship between tow variables.
|
|
law of large numbers
|
as the number of trials increases, the empirical probabilty approaches its theoretical limit
|