Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Flashcards
»
Statistics

Statistics

by rlc122009, Dec. 2009

Subjects: statistics

Favorite

Add to folder

Flag

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/57

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

57 Cards in this Set

Front
Back

	Branch of applied mathematics that deals with collecting, organizing & interpreting data using well-defined procedures in order to make decisions....	Statistics
	an estimate of the population, calculated from data in a sample drawn from the population...	Statistic
	the term used to describe a characteristic of a population...	parameter
	____ data types can only take on certain values.	Discrete data types. ex: -Nominal (Categorical) -Ordinal
	____ data types can take on (almost) any value; often reported using specific values (ie. BP)	Continuous: ex: -Interval -Ratio
	Nominal (categorical) Data =	non-orderable qualitative discrete categories ex: race, ethnicity, gender
	Ordinal Data =	reflects a "greater or lesser" degree of something, or may reflect a "precedes" or "superior" concept. -numbers or discrete categories can be placed in a meaningful order -interval between categories cannot be assumed to be equal. ex: SES 1= low, 2= medium 3= high
	2 types of Ordinal Data	Ordered-categories Data -ex: cancer stages Rank-ordered Data -ex: committee proposals 1st, 2nd, 3rd, in order of proposal, not importance.
	2 types of Continuous Data	Interval Data -data ordered in a logical sequence. -EQUAL intervals -isnt necessarily an absolute zero point. (e.g. IQ scores) Ratio Data - # continuous with equal intervals between them -has a meaningful zero point. e.g. BP, income.
	4 common types of Descriptive Stats	1) frequency distribution tables/graphs 2) central tendencies (mean, median, mode) 3) variability (range, SD) 4) Z-scores
	Advantages and disadvantages of Frequency Distributions	Advantages: presents entire set of scores Disadvantages: can be complex with large sets of data
	3 Descriptors of Central Tendency	Mode Median Mean
	Variability definition	The degree to which scores in a distribution are spread out or dispersed. -Homogeneity= little variability -Heterogeneity= great variabilty
	Deviation Score =	How far any element in a distribution is from the mean i.e. mean of 16, an element with a value of 23 has a deviation score of 7
	Variance (mean square) =	mean of the squares of all the deviation scores (eliminates “-” sign)
	Standard Deviation =	Square root of variance. - % of elements in a normal distribution is constant for a given # of SDs above or below the mean
	Purpose of Z scores	to identify the precise location of a specific value within a distribution by using a single number -Indicates how many standard deviations there are between the score and the mean; can use this to compare scores from different ways of measuring the same thing (i.e. SAT vs. ACT scores
	Z scores=	# of SDs away from the mean
	Used to make inferences about the population based on sample data	Inferential Stats *Done by "Stat-hypothesis-testing"
	Standard Error of the Mean (SEM) =	SD of the "Sampling Distribution of the Means". ex: % women in SBU PA program compared to % women in ALL PA programs
	Confidence Interval	used to state how "confident" we are that the mean for the SAMPLE we have chosen is close to the mean for the WHOLE population. - % of values under each part of the normal curve. (within 2 SD = 95% CI)
	Z scores vs. T scores	Z- uses data from a single sample to test a hypothesis about the population mean in situations where the population SD is KNOWN (not the usual) T- uses data from sample(s) to test a hypothesis about a population in situations where the population SD is UNKNOWN (we have to use an estimated SEM)
	ANOVA	analysis of variance: Used when more than one comparison is being made Ex - means of more than two groups are compared: placebo and two different drugs. Measured by F ratio= Varianc btwn groups/variance within groups. -type of parametric test
	3 Examples of Non-parametric tests	1) Chi square 2) mann-whitney U test 3) Wilcoxon T test
	Used when frequency data classifies individuals with a given outcome. Differences between expected and observed.	Chi Square
	Formula for Chi Square=	= sum of all (F observed- F expected)^2 / F expected
	In Chi Squares, the larger the value, the more likely the value is...	NOT due to chance.
	Mann Whiteny U test	uses ordinal data (rank orders) from two separate samples to test a hypothesis about the difference between 2 populations or 2 treatment conditions
	Wilcoxon T test	uses the data from a repeated measures or matched-samples design to evaluate the difference between 2 tx conditions (used as an alternative to the related-sample t test in situations where the data can be rank ordered but do not satisfy the more stringent t test requirements)
	Correlation	used to establish and quantify the STRENGTH and DIRECTION of relationship between two variables
	Regression	used to express the FUNCTIONAL relationship between two variables
	Pearson Correlation	measures the degree of linear relation between 2 variables. (+ or -) indicates the direction. The magnitude (0 to 1) indicates the degree to which the data points fit on a straight line (interval or ratio data)
	2 applications of th Pearson Correlation	Point-Biserial Correlation: used when one variable is dichotomous (i.e., has only 2 values; e.g., yes/no) and the second variable is measured on an interval or ratio scale Phi-Coefficient: both variables are dichotomous.
	Spearman Correlation	measures the degree to which the relationship between two variables is one-directional. Used when both variables (X and Y) are ranks measured on an ordinal scale
	Null Hypothesis definition	(HO), usually states that there is NO difference between two (or more) populations or that two (or more) variables will NOT be correlated
	Type 1 error	rejection of a null hypothesis when it should not be rejected—a false positive study result risk is controlled by the level of significance (alpha), usually set at .05 or .01 (likelihood a result has occurred by chance) *IMPORTANT
	P value	probability of making a Type I error if Ho is rejected.
	Type II error	failure to reject a null hypothesis when it should be rejected—a false negative study result (results were, in fact, NOT due to chance, but were related to the study variable) Defined as β, and is usually set at 0.20
	Power	ability of a study to detect an outcome of interest; the probability that a false null hypothesis will actually be rejected -Power is defined as (1-β), and is usually set at .80 (80%) -should ALWAYS be done before a study -Tells you how large a sample you need to detect a specific amount of change (difference)
	T or F: The Null hypothesis is either rejected or accepted.	FALSE. The null hypothesis is either rejected or NOT rejected. It is NOT accepted.
	To get more Power you must...	increase the sample size, collect higher-level data and/or do a paired design.
	______ statistics involves variables measured on a nominal or ordinal scale.	Non Parametric Stats
	proportion of patients that HAVE disease (same as study pre-test probability)	Prevalence
	proportion of patients WITH the condition who test positive for the condition	Sensitivity
	Specificity	proportion of patients WITHOUT the condition whose test results are NEGATIVE for the condition.
	Positive Predictive Value (PPV)	proportion of patients with a POSITIVE test who HAVE the condition.
	Negative Predictive Value (NPV)	proportion of patients with a NEGATIVE test who DO NOT have the condition
	Likelihood ratio:	proportion of patients WITH disease who have a particular finding divided by the proportion of patients WITHOUT disease who have the same finding
	Shows how much the odds of disease are increased if the test result is positive	Positive likelihood ratio LR+
	Negative likelihood ratio LR-	shows how much the odds of disease are decreased if the test result is negative.
	Prevalence=	all patients with disease/ total population
	Sensitivity=	true +/ all diseased
	Specificity=	true - / all without disease
	PPV=	true + / all +
	NPV=	true - / all -
	LR+ =	Sensitivity/ (1-specificity)
	LR- =	(1-sensitivity)/ specificity

Share This Flashcard Set

Set the Language

Statistics

Add to Folders

Upgrade to Cram Premium

Card Range To Study

57 Cards in this Set