• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/57

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

57 Cards in this Set

  • Front
  • Back
Branch of applied mathematics that deals with collecting, organizing & interpreting data using well-defined procedures in order to make decisions....
Statistics
an estimate of the population, calculated from data in a sample drawn from the population...
Statistic
the term used to describe a characteristic of a population...
parameter
____ data types can only take on certain values.
Discrete data types.
ex:
-Nominal (Categorical)
-Ordinal
____ data types can take on (almost) any value; often reported using specific values (ie. BP)
Continuous:
ex:
-Interval
-Ratio
Nominal (categorical) Data =
non-orderable qualitative discrete categories

ex: race, ethnicity, gender
Ordinal Data =
reflects a "greater or lesser" degree of something, or may reflect a "precedes" or "superior" concept.

-numbers or discrete categories can be placed in a meaningful order
-interval between categories cannot be assumed to be equal.

ex: SES 1= low, 2= medium 3= high
2 types of Ordinal Data
Ordered-categories Data
-ex: cancer stages
Rank-ordered Data
-ex: committee proposals 1st, 2nd, 3rd, in order of proposal, not importance.
2 types of Continuous Data
Interval Data
-data ordered in a logical sequence.
-EQUAL intervals
-isnt necessarily an absolute zero point. (e.g. IQ scores)

Ratio Data
- # continuous with equal intervals between them
-has a meaningful zero point. e.g. BP, income.
4 common types of Descriptive Stats
1) frequency distribution tables/graphs
2) central tendencies (mean, median, mode)
3) variability (range, SD)
4) Z-scores
Advantages and disadvantages of Frequency Distributions
Advantages:
presents entire set of scores

Disadvantages:
can be complex with large sets of data
3 Descriptors of Central Tendency
Mode
Median
Mean
Variability definition
The degree to which scores in a distribution are spread out or dispersed.
-Homogeneity= little variability
-Heterogeneity= great variabilty
Deviation Score =
How far any element in a distribution is from the mean

i.e. mean of 16, an element with a value of 23 has a deviation score of 7
Variance (mean square) =
mean of the squares of all the deviation scores (eliminates “-” sign)
Standard Deviation =
Square root of variance.
- % of elements in a normal distribution is constant for a given # of SDs above or below the mean
Purpose of Z scores
to identify the precise location of a specific value within a distribution by using a single number
-Indicates how many standard deviations there are between the score and the mean; can use this to compare scores from different ways of measuring the same thing (i.e. SAT vs. ACT scores
Z scores=
# of SDs away from the mean
Used to make inferences about the population based on sample data
Inferential Stats

*Done by "Stat-hypothesis-testing"
Standard Error of the Mean (SEM) =
SD of the "Sampling Distribution of the Means".

ex: % women in SBU PA program compared to % women in ALL PA programs
Confidence Interval
used to state how "confident" we are that the mean for the SAMPLE we have chosen is close to the mean for the WHOLE population.
- % of values under each part of the normal curve. (within 2 SD = 95% CI)
Z scores vs. T scores
Z- uses data from a single sample to test a hypothesis about the population mean in situations where the population SD is KNOWN (not the usual)

T- uses data from sample(s) to test a hypothesis about a population in situations where the population SD is UNKNOWN (we have to use an estimated SEM)
ANOVA
analysis of variance:
Used when more than one comparison is being made
Ex - means of more than two groups are compared: placebo and two different drugs.

Measured by F ratio=
Varianc btwn groups/variance within groups.
-type of parametric test
3 Examples of Non-parametric tests
1) Chi square
2) mann-whitney U test
3) Wilcoxon T test
Used when frequency data classifies individuals with a given outcome. Differences between expected and observed.
Chi Square
Formula for Chi Square=
= sum of all (F observed- F expected)^2 / F expected
In Chi Squares, the larger the value, the more likely the value is...
NOT due to chance.
Mann Whiteny U test
uses ordinal data (rank orders) from two separate samples to test a hypothesis about the difference between 2 populations or 2 treatment conditions
Wilcoxon T test
uses the data from a repeated measures or matched-samples design to evaluate the difference between 2 tx conditions (used as an alternative to the related-sample t test in situations where the data can be rank ordered but do not satisfy the more stringent t test requirements)
Correlation
used to establish and quantify the STRENGTH and DIRECTION of relationship between two variables
Regression
used to express the FUNCTIONAL relationship between two variables
Pearson Correlation
measures the degree of linear relation between 2 variables. (+ or -) indicates the direction. The magnitude (0 to 1) indicates the degree to which the data points fit on a straight line (interval or ratio data)
2 applications of th Pearson Correlation
Point-Biserial Correlation: used when one variable is dichotomous (i.e., has only 2 values; e.g., yes/no) and the second variable is measured on an interval or ratio scale

Phi-Coefficient: both variables are dichotomous.
Spearman Correlation
measures the degree to which the relationship between two variables is one-directional. Used when both variables (X and Y) are ranks measured on an ordinal scale
Null Hypothesis definition
(HO), usually states that there is NO difference between two (or more) populations or that two (or more) variables will NOT be correlated
Type 1 error
rejection of a null hypothesis when it should not be rejected—a false positive study result
risk is controlled by the level of significance (alpha), usually set at .05 or .01 (likelihood a result has occurred by chance)


*IMPORTANT
P value
probability of making a Type I error if Ho is rejected.
Type II error
failure to reject a null hypothesis when it should be rejected—a false negative study result (results were, in fact, NOT due to chance, but were related to the study variable)
Defined as β, and is usually set at 0.20
Power
ability of a study to detect an outcome of interest; the probability that a false null hypothesis will actually be rejected
-Power is defined as (1-β), and is usually set at .80 (80%)
-should ALWAYS be done before a study
-Tells you how large a sample you need to detect a specific amount of change (difference)
T or F: The Null hypothesis is either rejected or accepted.
FALSE. The null hypothesis is either rejected or NOT rejected.
It is NOT accepted.
To get more Power you must...
increase the sample size, collect higher-level data and/or do a paired design.
______ statistics involves variables measured on a nominal or ordinal scale.
Non Parametric Stats
proportion of patients that HAVE disease (same as study pre-test probability)
Prevalence
proportion of patients WITH the condition who test positive for the condition
Sensitivity
Specificity
proportion of patients WITHOUT the condition whose test results are NEGATIVE for the condition.
Positive Predictive Value (PPV)
proportion of patients with a POSITIVE test who HAVE the condition.
Negative Predictive Value (NPV)
proportion of patients with a NEGATIVE test who DO NOT have the condition
Likelihood ratio:
proportion of patients WITH disease who have a particular finding divided by the proportion of patients WITHOUT disease who have the same finding
Shows how much the odds of disease are increased if the test result is positive
Positive likelihood ratio
LR+
Negative likelihood ratio LR-
shows how much the odds of disease are decreased if the test result is negative.
Prevalence=
all patients with disease/
total population
Sensitivity=
true +/ all diseased
Specificity=
true - / all without disease
PPV=
true + / all +
NPV=
true - / all -
LR+ =
Sensitivity/ (1-specificity)
LR- =
(1-sensitivity)/ specificity