Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
57 Cards in this Set
- Front
- Back
Branch of applied mathematics that deals with collecting, organizing & interpreting data using well-defined procedures in order to make decisions....
|
Statistics
|
|
an estimate of the population, calculated from data in a sample drawn from the population...
|
Statistic
|
|
the term used to describe a characteristic of a population...
|
parameter
|
|
____ data types can only take on certain values.
|
Discrete data types.
ex: -Nominal (Categorical) -Ordinal |
|
____ data types can take on (almost) any value; often reported using specific values (ie. BP)
|
Continuous:
ex: -Interval -Ratio |
|
Nominal (categorical) Data =
|
non-orderable qualitative discrete categories
ex: race, ethnicity, gender |
|
Ordinal Data =
|
reflects a "greater or lesser" degree of something, or may reflect a "precedes" or "superior" concept.
-numbers or discrete categories can be placed in a meaningful order -interval between categories cannot be assumed to be equal. ex: SES 1= low, 2= medium 3= high |
|
2 types of Ordinal Data
|
Ordered-categories Data
-ex: cancer stages Rank-ordered Data -ex: committee proposals 1st, 2nd, 3rd, in order of proposal, not importance. |
|
2 types of Continuous Data
|
Interval Data
-data ordered in a logical sequence. -EQUAL intervals -isnt necessarily an absolute zero point. (e.g. IQ scores) Ratio Data - # continuous with equal intervals between them -has a meaningful zero point. e.g. BP, income. |
|
4 common types of Descriptive Stats
|
1) frequency distribution tables/graphs
2) central tendencies (mean, median, mode) 3) variability (range, SD) 4) Z-scores |
|
Advantages and disadvantages of Frequency Distributions
|
Advantages:
presents entire set of scores Disadvantages: can be complex with large sets of data |
|
3 Descriptors of Central Tendency
|
Mode
Median Mean |
|
Variability definition
|
The degree to which scores in a distribution are spread out or dispersed.
-Homogeneity= little variability -Heterogeneity= great variabilty |
|
Deviation Score =
|
How far any element in a distribution is from the mean
i.e. mean of 16, an element with a value of 23 has a deviation score of 7 |
|
Variance (mean square) =
|
mean of the squares of all the deviation scores (eliminates “-” sign)
|
|
Standard Deviation =
|
Square root of variance.
- % of elements in a normal distribution is constant for a given # of SDs above or below the mean |
|
Purpose of Z scores
|
to identify the precise location of a specific value within a distribution by using a single number
-Indicates how many standard deviations there are between the score and the mean; can use this to compare scores from different ways of measuring the same thing (i.e. SAT vs. ACT scores |
|
Z scores=
|
# of SDs away from the mean
|
|
Used to make inferences about the population based on sample data
|
Inferential Stats
*Done by "Stat-hypothesis-testing" |
|
Standard Error of the Mean (SEM) =
|
SD of the "Sampling Distribution of the Means".
ex: % women in SBU PA program compared to % women in ALL PA programs |
|
Confidence Interval
|
used to state how "confident" we are that the mean for the SAMPLE we have chosen is close to the mean for the WHOLE population.
- % of values under each part of the normal curve. (within 2 SD = 95% CI) |
|
Z scores vs. T scores
|
Z- uses data from a single sample to test a hypothesis about the population mean in situations where the population SD is KNOWN (not the usual)
T- uses data from sample(s) to test a hypothesis about a population in situations where the population SD is UNKNOWN (we have to use an estimated SEM) |
|
ANOVA
|
analysis of variance:
Used when more than one comparison is being made Ex - means of more than two groups are compared: placebo and two different drugs. Measured by F ratio= Varianc btwn groups/variance within groups. -type of parametric test |
|
3 Examples of Non-parametric tests
|
1) Chi square
2) mann-whitney U test 3) Wilcoxon T test |
|
Used when frequency data classifies individuals with a given outcome. Differences between expected and observed.
|
Chi Square
|
|
Formula for Chi Square=
|
= sum of all (F observed- F expected)^2 / F expected
|
|
In Chi Squares, the larger the value, the more likely the value is...
|
NOT due to chance.
|
|
Mann Whiteny U test
|
uses ordinal data (rank orders) from two separate samples to test a hypothesis about the difference between 2 populations or 2 treatment conditions
|
|
Wilcoxon T test
|
uses the data from a repeated measures or matched-samples design to evaluate the difference between 2 tx conditions (used as an alternative to the related-sample t test in situations where the data can be rank ordered but do not satisfy the more stringent t test requirements)
|
|
Correlation
|
used to establish and quantify the STRENGTH and DIRECTION of relationship between two variables
|
|
Regression
|
used to express the FUNCTIONAL relationship between two variables
|
|
Pearson Correlation
|
measures the degree of linear relation between 2 variables. (+ or -) indicates the direction. The magnitude (0 to 1) indicates the degree to which the data points fit on a straight line (interval or ratio data)
|
|
2 applications of th Pearson Correlation
|
Point-Biserial Correlation: used when one variable is dichotomous (i.e., has only 2 values; e.g., yes/no) and the second variable is measured on an interval or ratio scale
Phi-Coefficient: both variables are dichotomous. |
|
Spearman Correlation
|
measures the degree to which the relationship between two variables is one-directional. Used when both variables (X and Y) are ranks measured on an ordinal scale
|
|
Null Hypothesis definition
|
(HO), usually states that there is NO difference between two (or more) populations or that two (or more) variables will NOT be correlated
|
|
Type 1 error
|
rejection of a null hypothesis when it should not be rejected—a false positive study result
risk is controlled by the level of significance (alpha), usually set at .05 or .01 (likelihood a result has occurred by chance) *IMPORTANT |
|
P value
|
probability of making a Type I error if Ho is rejected.
|
|
Type II error
|
failure to reject a null hypothesis when it should be rejected—a false negative study result (results were, in fact, NOT due to chance, but were related to the study variable)
Defined as β, and is usually set at 0.20 |
|
Power
|
ability of a study to detect an outcome of interest; the probability that a false null hypothesis will actually be rejected
-Power is defined as (1-β), and is usually set at .80 (80%) -should ALWAYS be done before a study -Tells you how large a sample you need to detect a specific amount of change (difference) |
|
T or F: The Null hypothesis is either rejected or accepted.
|
FALSE. The null hypothesis is either rejected or NOT rejected.
It is NOT accepted. |
|
To get more Power you must...
|
increase the sample size, collect higher-level data and/or do a paired design.
|
|
______ statistics involves variables measured on a nominal or ordinal scale.
|
Non Parametric Stats
|
|
proportion of patients that HAVE disease (same as study pre-test probability)
|
Prevalence
|
|
proportion of patients WITH the condition who test positive for the condition
|
Sensitivity
|
|
Specificity
|
proportion of patients WITHOUT the condition whose test results are NEGATIVE for the condition.
|
|
Positive Predictive Value (PPV)
|
proportion of patients with a POSITIVE test who HAVE the condition.
|
|
Negative Predictive Value (NPV)
|
proportion of patients with a NEGATIVE test who DO NOT have the condition
|
|
Likelihood ratio:
|
proportion of patients WITH disease who have a particular finding divided by the proportion of patients WITHOUT disease who have the same finding
|
|
Shows how much the odds of disease are increased if the test result is positive
|
Positive likelihood ratio
LR+ |
|
Negative likelihood ratio LR-
|
shows how much the odds of disease are decreased if the test result is negative.
|
|
Prevalence=
|
all patients with disease/
total population |
|
Sensitivity=
|
true +/ all diseased
|
|
Specificity=
|
true - / all without disease
|
|
PPV=
|
true + / all +
|
|
NPV=
|
true - / all -
|
|
LR+ =
|
Sensitivity/ (1-specificity)
|
|
LR- =
|
(1-sensitivity)/ specificity
|