Set the Language

We weren't able to detect the audio language on your flashcards. Please select the correct language below.

Front

Back

Flashcards
»
ST 351 Midterm 2 and Final

St 351 Midterm 2 And Final

by shopper313, Apr. 2010

Subjects: statistics

Favorite

Add to folder

Flag

Shuffle
Toggle On

Toggle Off
Alphabetize
Toggle On

Toggle Off
Front First
Toggle On

Toggle Off
Both Sides
Toggle On

Toggle Off
Read
Toggle On

Toggle Off

Reading...

Front

Card Range To Study

through

Play button

Progress

1/170

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

170 Cards in this Set

Front
Back

	trial	one repetition of a experimental situation
	outcome	what happens in the trial
	relative frequency	# times an outcome of interest occurs/total # of trials
	regularity	idea that in the long run that there will be a set proportion to outcomes
	law of large numbers	long run relative frequency of repeated independent trials will get closer to the true relative frequency as the number of trials increases
	sample space	list of all possible outcomes in a probability experiment
	event	outcome of collection of outcomes in a probability experiment ex: A= HH (getting heads then heads again for flipping a coin)
	equally likely outcomes	have the same probability of occuring
	probability	# outcomes of an event/# of total outcomes
	rules of probability	P is always between 0 and 1 probabilities of all possible outcomes sum to 1
	complement rule	P(B) = 1 - P(A)
	Addition rule	If two events are mutually exclusive then P(A or B) = P(A) + P (B)
	mutually exclusive	no outcomes in common
	mutliplication rule	if two events are independent of each other then P (A and B) = p(A) x P(B)
	indepedent	the outcome of A does not depend on the outcome of B
	disjoint	when two events have nothing in common
	what is one way to tell that events are independent	see if selection of one event affects the probability of the other OR The probability of (A or B) = P(A) * P(B)
	General addition rule	P(A or B) = P (A) + P (B) - P (A and B)
	what does the word and imply	that they events occur at the same time
	what does the word or imply	that event A occurs, event B occurs, or event A and B occurs at the same time
	conditional probability	P(B\|A) = P (A and B)/P(A)
	How is P(B\|A) read	the probability of event B given that event A has already occured
	General multiplication rule	P(A and B) = P(A) x P(B\|A)
	what does the box in a venn diagram represent	the sample space
	what does a circle in a venn diagram represent	an individual event
	If events in a venn diagram do not overlap what does that imply	that the events are disjoint
	premise of conditional probability	finding the probability of one event given that a second event has already occured
	What does an event to the power C represent	the compliment ex: if there are two events blue marbles and red marbles. If A is choosing a red marble, then Ac is chosing a blue marble
	what does truly random mean	that every outcome is equally likely to happen
	random variable	a variable whose value is a numerical outcome of some random event for example the number of heads that occur during 10 coin flips is a random variable because it is subject to a random event
	what is a typical notation of random vairable	X or Y
	discrete random variable	finite number of possible outcomes for an event
	continous random variable	can contain any value within a range of values
	what does a probability distribution of X (a random variable) mean?	it is the range of values that the random variable probability lies within
	how is the probability of a continuous random variable described	by a density curve
	How is probability determined by a density curve?	It is the area underneath the curve between the values in which the continuous variable falls
	how does one distinguish between a variable of interest and a random variable	variable of interest represents the piece of information one wishes to receive from just one observation. (ex: what happens when we flip a coin? we get a heads or tail) random variable: combines all the observations together. Keeping track of the number of times an outcome occurs. (ex: the number of times a coin flip comes up heads)
	What is a nuance if the variable of interest is quantitative	the random variable will be the same as well
	If the variable of interest is categorical, the random variable will be	discrete
	If the random variable is quantitative then the random variable is	continuous
	when using density curves to calculate the probability under a curve, what is essential to remember	that the area under the curve is 1 therefore l x w = 1
	on a continuous random variable density curve, what is the probability of a single value	zero because it does not have a width on the graph
	what is he mean and standard deviation of a standard normal curve	mean is 0 and standard deviation is 1
	z score	is the number of standard deviation an observation is from the mean
	what does it mean if a z statistic is positive? what does it mean if it is negative?	positive- greater than the mean negative- less than the mean
	what does a z score of zero mean	it is equal to the mean
	how does one calculate a z score	observed value - mean / standard deviation X -u / o
	if the population data are normally distrubutedd, then the sample means are	normally distributed
	why use sample means	because it is often difficult to get data for an entire population
	when are sample means normally distributed	most of the time so long as the sample size for each sample is large enough
	what is always true about the population mean and the sample mean	they are always equal
	what does ux mean? uxbar?	ux = population mean uxbar =the mean of the sample sample means
	what is always true about the standard deviation sof the population and sample mean	the smaple mean will always be smaller
	equation for the standard deviation of the sample means	o xbar = ox / srt(n) n = sample size
	what is a condition of the standard deviation of sample means equation	it is best when sampling is done with replacement
	sample mean	is a mean of a collection of samples often this is done many times in order to create a distribution of sample means
	alternative hypothesis notation	Ha
	what is required about the null and alternative hypothesis	they have the same hypothesized value
	what is often true about an alternative hypothesis	it makes a statement where the population mean is greater or less than the hypothesized value or that it does not equal it
	What are conditions that lead to a one sided (one tailed) test	when the alt hypothesis states that the population mean is greater or less than the hypothesized mean
	what are conditions that lead to a two sided (two tailed test)	when the alt hypothesis states that the population mean does not equal the hypothesized value
	if there is a quantitative variable of interest, what information is needed to determine what statistical test should be done?	n (sample size) xbar (sample mean) sx (sample standard deviation) ox (population standard deviation)
	if the population standard deviation is known what stat test should be done?	one sample t test
	if the sample standard deviationis known what stat test should be done	one sample t test
	what are conditions of the one sample z test	1) the sample is representative of the population 2) the distribution of sample means is normal 3) obervations are independent of each other
	what is the only way to guarantee that you have a representative sample from a population	take a random sample
	what are assumptions of a one way z test	that the sample is representative of the population that the distrubution of sample means is approximately normal observations are independent of each other
	what assumption is made for all hypothesis tests	that observations are independent of each other
	p value	the probability of getting a sample mean that is as or more extreme than the one that was observed during the experiment if the null hypothesis is true
	is it possible that a sample mean could have turned out to be a different value than the one hypothesized	yes
	what action can be taken about the null hypothesis	it can be rejected or fail to be rejected
	z statistic	the number of standard deviations from what is observed from the mean oberv-hypo/stand dev
	when do you reject the null hypothesis	when it is small! less than 0.1 is strong evidence 0.1 to 0.5 show suggestive evidence 0.05 to .1 weak evidence .1 or larger is not sufficient
	significance level	a point where if the p value is below it, then the null hypothesis is rejected, or if it is above, then it can not be rejected
	what is the notation for significance	alpha
	what are the steps to hypothesis testing	1. define the variable of interest and the population of interest. determine if the VI is quantitative or categorical. 2. determine the null hypothesis and alternative hypothesis 3. choose an appropriate hypothesis test for the null hypothesis and list the assumptions made by the hypothesis test. 4. find the p value 5. Conclusion write a sentnece in the context of the problem that answers the question of interest.
	null hypothesis	a statement of no effect or no difference
	what is the notation for the null hypothesis	H0
	what are some general rules for a null hypothesis	always involves an equal sign the population parameter, ux, will be the same as the hypothesized value, u0
	alternative hypothesis	often associated with the researcheres claim or question. It makes a statement hypothesizing some change that will be expected to happen from a given treatment.
	what does a normal probability plot show	the normality of a data set regardless of size
	what does not equal to impluy	less than or greater than a value
	one tail test vs two tail test	one tail test either looks @ one end or the other of a normal distribution two tail test looks at both sides of a normal distribution, usually a given distance away from the mean.
	How do you find the p value for a two tailed test	add together the probability of both tails
	what is an easy way to find a probability for a two tailed test	find the probability of one tail and multiply by 2
	what is the type 1 hypothesis error	occurs when there is evidence to reject the null hypothesis when the null hypothesis is actually true
	what is the type 2 hypothesis error	when the null hypothesis is not rejected when it is actually false
	what is the probability of making a type 1 error if the significance level is given	P(making error) = alpha
	what is the probability of making a type 1 error if the significane level is not given	P(making error) = p value
	how do you determine the probability of making a type II error	P(type II error) = beta and beta depends on the effect size
	effect size	choosing a specific value for u that would make the null hypothesis and be large enough to make a difference
	power	probability of correctly rejecting the null hypothesis
	how does one obtain a higher power value	more sample size
	power is complements with...	beta
	beta depends on	a sample mean u that would cause one to accept the null hypothesis when it is not true
	equation for power	1 - B
	point estimate	using the sample mean as the best estimate of the population mean
	margin of error	range of values above and below the sample mean that give potential values for the population parameter
	purpose of confidence interval	it is used when trying to find out characteristics of a given population. In using confidence intervals, an assumption is made that the sample mean is close to the population mean. The confidence interval is a measure of the reliability of the estimate (usually the sample mean) and provides a range for which possible true values for the population parameter lie
	what is a requirement of using confidence intervals	that the data be distributed normally
	what is the confidence interval surround	the sample mean
	what is a 68% confidence interval	the range is determined to be 1 SD above and below the sample mean. This is the range where 68% of the data is believed to fall
	what is the general formula for a confidence interval	lower bound = sample mean - (# of sd away from the mean)(value of SD) upper bound same as above but with a plus sign
	what does a 95% confidence interval imply	that 95% of the population data is within 2 (*actually it is 1.96) sds above or below the sample mean
	how do you write a confidence interval	(lower limit # & units, upper limit # & units)
	what is the critical value for a confidence interval and what does it depend on?	critical value is z and it depends on the level of confidence
	what conditions must be met to use a confidence interval	sample must be representative of the population distribution of the sample means must be normal observations must be independent
	what does the confidence interval states about many samples and their confidence intervals	that from all the samples, 95% (or whatever conf value) will have a confidence interval that contains the true population mean
	when should a confidence interval be included	whenever a hypothesis test is performed
	what is the relation between a two sided hypothesis test and the confidence interval at a given significance level	If the hypothesized null hypothesis value is in the confidence interval, then there is no evidence to reject the null hypothesis at that significance level. If the hypothesized value for the null hypothesis is not within the confidence interval, then there is evidence to reject the null hypothesis
	standard error of the distribution of sample means	sample standard deviation / number of individuals in the sample it is used when you do not know the standard deviation of the population
	t value equation	mean - value / SEx
	t statistic represents	the number of standard errosr from an observed value from the mean of the distribution
	as the sample size increases what happens to sx in terms of ox	sx becomes a better estimate of ox
	what the point of degrees of freedom	to account for the different sample means that will occur from different sample sizes
	degrees of freedmo	n-1
	what is needed to calculate a p value in a t stat	t stat and deg of freed
	a higher confidence means less	precision
	what does the sahpe of a t distribution depend on	the degrees of freedom
	can t stats be negative	yes, the represent the # of standard errors below the mean
	where is the t value in a t value chart	it makes up the bulk of it.
	If one were to look at a t stat of 2.492 what does the probability for that value mean?	It is the probability of getting a t stat greater than the one you currently have P(x>2.492)
	what are properties of a t distribution	symmetric and unimodal
	how should one find the probaibliyt of a negative t stat	find the positive one, they are they same
	what should one do when the t statistic can't be found on the table	create an interval between the two closest values that surround your desired t stat
	when is it best to use a t test	when the population standard deviation is NOT KNOWN
	what is the equation for the confidence interval on a t test	mean +/- t value @ n-1 degrees of freedom *SEx
	what is a synonym for variable of interest	response variable
	in a two sample t test how many populations are thewre	2
	what is the purpose of a two sample t test	it compares the means between two independent populations that have a response variable in common
	how should one write a null hypothesis for a 2 sample t test	either than u1 = u2 or u1-u2 = 0
	how should one write the alternative hypothesis for a 2 sample t test	using an inequality sign ex: u1 > u2 or u1 - u2 > 0
	t are the conditions of a two sample t test	the samples are representative of the population the distribution of the differences betweetn the sample means is approx normal the observations are independent of each other
	u vs x bar	population parameter vs sample mean statistic
	what distribution does a two sample t test look at	the distribution of the difference between the sample means of the two populations
	what does the standard error of the difference in the sample means represent	the variance of the two populations added together, sqrted
	variance	standard dev squared
	t stat for a 2 sample t tests	observed - hypothethical / standard error
	how do you find the degrees of freedom for a two sample t test	the smaller of the two sample sizes minus 1 n(smallest #) - 1
	how do you find the confidence interval for a two sample t test	estimate between diff of population means +/- (t stat)(standard error)
	frequency table	shows the frequency of individuals in a category
	contingency table	a table that compares the frequency of the categorical variable of interests between 2 (or more) groups
	marginal distribution	number of individuals in each category divided by the total
	conditional distribution	same as a marginal distribution, only that it shows that of the group, how the distribution is broken up into the categories ex: 38% of women felt overweight 34% of women felt just right etc
	when the variable of interest is categorical, what is the random variable	discrete
	expected value is synonymous with	mean
	how to calculate the expected mean	it is the categorical value times the probability of that value all summed together
	how to calculate the standard deviation	the categorical value minus the mean squared times the probability summed together and square rooted
	what are the four conditions that must be met for a random varaible to have binomial distribution?	1. each observation has 2 possible outcomes of which only 1 occurs 2. there is a fixed number of observations (n the sample size 3. the outcomes of the observations are independent 4. the probability of of an outcome occuring is the same in a single trial
	0! =?	1
	what is a requirement of probability distributions	that they will sum up to equal 1
	how to calculate the mean of a binomial random variable X	ux = n*p
	what is the standard deviation of a binomial random variable	ox = sqrt(np(p-1)
	what is a geometric distribution show	it is for a discrete random variable it is the probability distribution of how many trials must happen before the first "success" occurs, or before the desired outcome occurs
	what does x represent in a geometric distribution	the number of trials to the first "success"
	what is an equation to represent the probability of x in a geometric distribution	P(x) = (1-p)^x-1 * p
	how is the expected value calculated in the geometric distribution	ux = 1/p
	how is the standard deviation calculated for the geometric distribution	ox = sqrt(I1-p)/p2)
	bernoulli trial	is an experiment in which only two outcomes are possible
	binomial random variable	the number of successes in a series of independent bernoulli trials
	when the sample size increases what happens to a bionomial distribution	it begins to look more like a normal distribution
	what is a problem about computing the probability with a large sample size	it is computationally intense
	what does the distribution of the sample proportions represent	the distribution of lots of sample proportions of sample size n taken from a population with a true proportion p
	as the sampling size increases how does the sampling proportion and the population proportion relate to each other	p hat = p
	what happens to the distribution of sampling proportions as the sample size increases	it becomes smaller
	what is the sampling proportion distribution centered around	p the true population proportion
	If we do not know p, the population proportion, what should one do	use p hat in its place
	when using a sampling proportion distribution what should you check	if np >10 and if n(1-p) >10
	how do you calculate the z statistic for a sampling proportion	p hat - p all over standard deviation
	bernoulli trial	situations where there are two outcomes: success or fail

Share This Flashcard Set

Set the Language

St 351 Midterm 2 And Final

Add to Folders

Upgrade to Cram Premium

Card Range To Study

170 Cards in this Set