Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
93 Cards in this Set
- Front
- Back
Alternative Hypothesis |
The Hypothesis which states the Null Hypothesis is incorrect in a significance test. |
|
Block design |
The random assignment of subjects to treatments is carried out separately within each block. |
|
Boxplot |
Displays the 5-number summary as a central box with whiskers that extend to the non-outly ingredients data values. |
|
Causation |
A cause and effect relationship in which one variable controls the changes in another variable. |
|
Center |
A descriptive feature which describes the placement and relationships of the median to the other parts of the graphic representation. |
|
Central Limits Theorem |
The sampling distribution of the mean will approach the normal distribution as n increases (n>30). |
|
Chi-Squared Goodness of Fit |
Uses sample data to test hypothesis about the shape or proportions of a population distribution. The test determines how well the obtained sample proportions specified by the null hypothesis. |
|
Cluster Sample |
A sampling design in which entire groups are chosen at random. |
|
Coefficient of Determination |
Measures the percentage of variation in a dependent variable explained by one or more independent variables (r^2). |
|
Conditional Probability |
The probability that a particular event will occur, given that another event has already occurred. |
|
Confidence Variable |
The range of values within which a population parameter is estimated to lie. |
|
Confounded Variable |
An intended difference between the conditions of an experiment that could have affected the dependent variable. |
|
Convenience Sample |
A sample that includes members of the population that are easily accessed. |
|
Correlation |
The measure of a relationship between two variables or sets of data. |
|
Data |
Information gathered from observations. |
|
Degrees of Freedom |
A parameter of the t distribution. When the t distribution is used in the computation of an interval estimate of a population mean, the appropriate t distribution has n-1 degrees of freedom. Where n is the size of the sample random sample. |
|
Descriptive Statistics |
Statistical procedures used to describe characteristics and responses of group of subjects. |
|
Discrete Random Variable |
Variable where the number of outcomes can be counted and each outcome has a measurable and positive probability. |
|
Dotplot |
Graphs and dot for each case against a single axis. |
|
Empirical Rule |
States that, in a normal distribution, about 68% of the terms are within one standard deviation of the mean, about 95% are within two standard deviations, and about 99.7% are within three standard deviations (normal curve). |
|
Experiment |
The act of conducting a controlled test or investigation. |
|
Experimental Probability |
Probability based on what happens when an experiment is actually done. |
|
Extrapolation |
Estimating a value outside the range of measured data. |
|
Histogram |
A bar graph that shows frequency of data within equal intervals. |
|
Independent |
A relationship between two sets of data or datum which states the outcome of one has no effect on the outcome of the other. |
|
Inferential Statistics |
Statistics that are used to interpret data and draw conclusions. |
|
Interpolation |
Using the Least Squares Regression Line to predict a y-value for an x-value within the x-dada set. |
|
IQR |
Range of the middle 50% of the values; Q3-Q1=75th percentile -25th percentile. |
|
Joint frequency |
The number of responses for a given characteristic. |
|
Law of Large Numbers |
Law stating that a large number of items taken at random from a population will (on the average ) have the population statistics. |
|
Least Squares Regression Line |
The line that minimizes the sum of squared residuals. |
|
Lurking Variable |
A variable other than x and y that simultaneously affect both variables, accounting to the correlation between the two. |
|
Margin of Error |
The range of percentage points in which the sample accurately reflects the population, the range surrounding a sample's response within which researchers are confident the larger population's true response would fall. |
|
Marginal Frequency |
Row and column totals in a contingency table (cross-tabulation) that represents the unitarian frequency distributions of responses the row and column variables. |
|
Matched Pairs |
Either two measurements are taken on each individual such as pre and post OR two individuals are matched by a third variable (different from explanatory variable and the response variable) such as identical twins. |
|
Mean |
The arithmetic average of a distribution, obtained by adding the scores and then dividing the number of scores. |
|
Median |
The middle score in the distribution; half the scores are above it and half are below it. |
|
Mode |
That datum which occurs the most in a set of data. |
|
Mutually Exclusive |
Each event or variable is independent from one another. No event or variable will have an effect on the probability of outcome for any other event or variable. |
|
Non-Response Bias |
A bias caused by a number of people who did not respond to the survey. |
|
Normal |
A sample which follows the Empirical Rule for distribution. |
|
Null Hypothesis |
The Hypothesis that states there is no difference between two or more sets of data in a significance test. |
|
Observational Study |
An experiment which observes individuals and measures variables of interest but does not attempt to influence the responses. |
|
Outlier |
An extreme deviation from the mean. |
|
P-Value |
The probability of getting a result at least as extreme as the result given from the test. The lower the value the stronger the evidence. |
|
Parameter |
A numerical measurement describing some characteristic of a population. |
|
Placebo Effect |
Experimental results caused by expectations alone; any effect on behavior caused by the administration of an inert substance or condition, which is assumed to be an active agent. |
|
Population |
The entire aggregation of items from which samples can be drawn. |
|
Probability |
The likelihood that a particular event will occur. |
|
Qualitative |
Data identified by something other than numbers. |
|
Quantitative |
Dad or datum being numerically defined. |
|
Quota Sample |
A sample deliberately constructed to reflect several of the major characteristics of a given population. |
|
Random Sample |
A sample in which every element in the sample has an equal chance of being selected. |
|
Residual |
The difference between an observed value of the response variable and the value predicted by the Regression Line. |
|
Response Bias |
Anything in the survey design that influences the responses from the sample. |
|
Sample |
Items selected at random from a population and used to test hypotheses about the population. |
|
Sample Space |
All possible outcomes of an experiment. |
|
Sampling distribution |
The distribution of values taken by the statistic in all possible samples of the same size from the same population. |
|
Scatterplot |
A graphed cluster of dots, each of which represents the values of two variables. |
|
Simple Random Sample |
Ever member of the population has a known and equal chance of selection. |
|
Simulation |
The act of repeating an experiment to get more accurate statistical evidence |
|
Spread |
A descriptive feature in which describes the range of data graphically. |
|
Standard Deviation |
A measure of variability that describes an average distance of every score from the mean (r). |
|
Standard Error |
5he standard deviation of a sampling distribution. |
|
Standardized Value (z-score) |
A value found by subtracting the mean and dividing by the standard deviation. |
|
Statistic |
A numerical value measurement describing some characteristic of a sample. |
|
Statistical Significance |
When your discovered p- value is less than your alpha (.05 if not given). States that chance alone would rarely produce an equally extreme results. |
|
Stemplot |
A graphical representation of a quantitative data set. Leading values of each data point are presented as stems and second digits are give as leaves. |
|
Stratified Sample |
A sample drawn by selecting individuals systematically from a sampling frame. |
|
t-test |
A parametric inferential statistical test of the null hypothesis for a single sample where the population standard deviation is unknown. |
|
Theoretical Probability |
The ratio of the number of favorable outcomes to the number of possible outcomes if all outcomes have ever the same chance of happening. |
|
Two-way Table |
A table counts for two categorical variables. It has really rows and c columns. |
|
Type I Error |
An error that occurs when a researcher concludes that the independent variable had an effect on the dependent variable, when no such relation exists; a false positive. |
|
Type II Error |
An error that occurs when a researcher concludes that the independent variable had no effect on the dependent variable, when in truth it did; a false positive. |
|
Undercoverage |
Occurs when some groups in the population are left out of the process of choosing the sample. |
|
Voluntary Response Bias |
Bias introduced to a sample when individuals can choose on their own whether to participate in the sample. |
|
Wording Bias |
A type of response Bias where the question is posed to achieve a desired result. |
|
z-test |
A parametric inferential statistical test of the null hypothesis for a single sample where the population standard deviation is know. |
|
Cumulative Frequency Plot |
A line graph that depicts cumulative Frequencies. |
|
Census |
A sample that consists of the entire population. |
|
Experimental Control |
Aspects of the experiment that we know. May have an effect on the response, but that are not the factors being stated. |
|
Grammatical Factor |
Variable whose levels are controlled by the experimenter. |
|
Double Blind |
Neither the subjects nor the people who have contact with then know which treatment a subject has received. |
|
Randomization |
Process by which each individual is given a fair chance of selection. |
|
Small Variance |
Cluster; the data is around the mean. |
|
Large Variance |
Data is spread out. |
|
Residual (error) |
The sum of the residuals is always 0. |
|
Methods for Gathering Data |
Surveys Interviews Experiments |
|
Multistage Sample |
Unbiased and easy to conduct. |
|
Stratified Sample Advantages |
Less variability Unbiases |
|
Categorical Variable Examples |
Eye color Zip code Race Type of pet |
|
Number of successes observed |
k (from the binomial formula) |
|
The 25 percentile |
Q1 (the first quartile) |