Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
45 Cards in this Set
- Front
- Back
study population
|
the population of individuals from which samples are obtained for inclusion in an investigation
|
|
sample
|
a subset of all possible measurements from a population
|
|
inferential statistics
|
-test hypotheses
-demonstrate an association-make population comparisons |
|
descriptive statistics
|
describe and summarize sample characteristic
|
|
gaussian distribution
|
-the most commonly assumed population distribution used in statistics
-emipirical rule: bell curve |
|
normal distribuition data
|
1SD: 68%
2SD: 95% 3SD: 99.7% |
|
standard error
|
-a measure of sampling error
-the spread in the mean that would be if ALL possible samples of the same size as the one actually obtained were drawn from the population -used to determine confidence intervals |
|
kurtosis
|
- without changing the mean
- vertical distortion |
|
leptokurtosis
|
- high middle
|
|
platykurtosis
|
- lower middle
- can be flat |
|
negative skew
|
higher data values predominate
|
|
postivie skew
|
lower data values predominate
|
|
statistics
|
- values calculated from sample data used to estimate a value or parameter in a larger population
|
|
point estimate
|
a single statistic calculated from sample observations used to estimate the numerical value of a particular population parameter
|
|
statistical significance testing
|
tells us how much statistical determinations are expected to vary due to chance variations in random sample
|
|
two-tailed test
|
- investigator can't be sure that the population's parameter is greater or less than the null hypothesis
- most common |
|
one-tailed test
|
used when the direction of the relationship being studed is known and analysis is only concerned with determining the strength of the relationship
- greater statistical power |
|
dependent variables
|
- the study outcome or endpoint
- usual approach: analyze one such variable at a time |
|
independent variable(s)
|
- variable(s) measured to estimate the corresponding measurement of the dependent variable
- defines the conditions under which the dependent variable is to be measured |
|
univariable analysis
|
- 1 dependent variable, no independent variable
- estimation of the probability of a disease without regard to any characteristic such as age, gender, diet |
|
bivariable analysis
|
- 1 dependent variable, 1 independent variable
- determining the if there is an assciation between birth control pill use and stroke |
|
multivariable analysis
|
- 1 dependent variable, >1 independent variable
- determining the likelihood of stroke for femails, of different ages and smoking status - also used to adjust for confoudning variables |
|
continuous data
|
- the range of measurement is any number of equally spaced numerical values between any two points
- measure on ratio or interval scales - a great number of possible values exist - ex. serum cholesterol, age, weight - wide range of values |
|
ordinal data
|
- discrete data
- the interval between consecutive measurements is not necessarily known or constant - ordered sequence - ex. number of pregancies, scoring systems, mammogram interpretation, cancer staging - limited number of possible variables |
|
nominal data
|
- data which is not ordered is measured on a nominal scale
- ex. eye color, gender - the number of nominal variables is the number of categories minus one |
|
rescaling of data
|
- age to age range to young, old description
- shifting down results in the loss of information - statistical power reduced, increasing the likelihood of type II error |
|
Uses of univariable analysis
|
1.) Descriptive studies - clinical and lab results on a group of diseased patients
2.) Descriptive measurements of the study sample 3.) Comparison, using 2 measurements, with the same or paired individuals: a special situation - a hypothesis may be tested in some cases, no independent variable *ex. a blood pressure determination made twice on the same individual, or on two paired similar individuals |
|
no independent variable ->
one continuous dependent variable -> |
mean is point estimate -> student's t-test is statistical method
|
|
no independent variable -> one ordinal dependent variable ->
|
median is point estimate -> Wilcoxon Signed rank test
|
|
no independent variable -> one nominal dependent variable ->
|
"regular" proportion -> binomial distribution
rare -> proportion -> Poisson distribution |
|
natural sampling
|
- a random, unbiased, sampling from the population
- The distribution of values for independent and dependent variable is representative of the population distribution |
|
purposive sampling
|
- The researcher determines and therefore defines the sample distribution
- The distribution of values for the independent variable is not representative of the population |
|
one continuous dependent variable -> one nominal independent variable
|
difference between means is estimation -> student's t-test is inference
|
|
one continuous dependent variable -> one continuous independent variable from naturalistic or purposive sample
|
slope and intercept is estimations -> regression analysis -> F test is inference
|
|
one continuous dependent variable -> one continuous independent variable from naturalistic sample
|
correlation coefficient -> correlation analysis -> student's t-test
|
|
least squares regression analysis
|
- Defines a line using a formula which minimizes the sum of the squared differences between the sample data and those estimated by the regression equation
- Y = a + bX - Slope indicates how a change in the independent variable affects a change in the dependent variable - Intercept is the mean of the dependent variable when the independent variable =0 - Slope and Intercept are point estimates - Provides estimate of dependent variable values from independent variable values - Regression analysis does NOT indicate the strength of a relationship in the population |
|
correlation analysis
|
- Requires naturalistic sample, continuous independent and dependent variables
- Determines how independent and dependent variables change together - Covariance |
|
Pearson's correlation coefficient
|
- equals r
- The point estimate of the strength of association between two continuous variables - Results range: -1 to +1 |
|
coefficient of determination
|
- r^2
- Represents the amount of variation in the dependent variable that can be explained by the change in the independent variable |
|
advantages of multivariable analysis
|
1.) Study the relationships between a dependent variable and an independent variable while adjusting for the effect of other independent variables
2.) Perform statistical significance testing on several variables with a chosen value for Type 1 error 3.) One procedure for estimation, inference, and adjustment |
|
analysis of variance
|
An analysis of the variation in the outcomes of an experiment to assess the contribution of each variable to the variation
|
|
covariance
|
Statistic which estimates how closely a dependent and independent variable change together
|
|
ANOVA
|
analysis of variance
|
|
Multiple regression and correlation analysis
|
Dependent variable predicted by two or more independent variables
|
|
ANCOVA
|
analysis of covariance
|