• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/64

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

64 Cards in this Set

  • Front
  • Back
When is an event random?
If we know what outcomes could happen, but not which particular values will happen.
Random numbers are easy/hard to generate?
Hard.

However, random number tables and/or generators easily allow for them to be used to create a random sample, survey, etc.
What is a simulation?
It models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we're trying to model.
What is a simulation component?
The most basic situation in a simulation in which something happens at random.
Outcome
An individual result of a component of a simulation is its outcome.
Trial
The sequence of several components representing events that we are pretending will take place.
Response variable =
Y; the dependent variable.

Or, ultimately, it's the results of each trial with respect to what we're interested in.
Population:
The entire group of individuals or instances about whom we hope to learn.
Sample:
A (representative) subset of a population, examined in hope of learning about the population.
Sample survey:
A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population.

Polls taken to assess voter preferences are common sample surveys.
Bias:
Any systematic failure of a sampling method to represent its population.

It is almost impossible to recover from bias, so efforts to avoid it are well spent.
What are the main types of bias?
(1) Relying on voluntary response

(2) Undercoverage of the population.

(3) Nonresponse bias

(4) Response bias
Randomization:
The best defense against bias in which each individual is given a fair, random chance of being selected.
Matching:
Any attempt to force a sample to resemble specified attributes of the population.

Matching may help make better samples, but is no substitute for randomization.
Sample size:
The number of individuals in a sample.

The sample size determines how well the sample represents the population, NOT the fraction of the population sampled.
Census:
A sample that consists of the entire population.
Population parameter:
A numerically valued attribute of a model for a population.

We rarely expect to know the true value of a population parameter. but we do hope to estimate it from sampled data.

Example: the mean income of all employed people in the country.
Statistic:
Values calculated for sampled data.
Representative:
A sample is said to be representative if the statistics computed from it accurately reflect the corresponding population parameters.
Simple Random Survey (SRS):
A simple random sample of sample size (n) is one in which each set of (n) elements in the population has an equal chance of selection.
Sampling frame:
A list of individuals from whom the sample is drawn.

Individuals who may be in the population of interest but are not in the sampling frame cannot be included in any sample.
Sampling variability:
The natural tendency of randomly drawn samples to differ, one from another.

Sometimes, unfortunately, called sampling error, sampling variability is no error at all - but just the natural result of random sampling.
Stratified Random Sample:
A sampling design in which the population is divided into several subpopulations, or strata, and random samples are then drawn from each stratum.

If the strata are homogenous but are different from each other, a stratified sample may yield more consistent results.
Cluster Sample:
A sampling design in which entire groups, or clusters, are chosen at random.

Cluster sampling is usually selected as a matter of convenience, practicality, or cost.

Each cluster should be heterogenous (and representative of the population), so all the clusters should be similar to each other.
Multistage Sample:
Sampling schemes that combine several sampling methods.

Example: a national polling service may stratify the country by geographical regions, select a random sample of cities from each region, and then interview a cluster of residents in each city.
Systematic Sample:
A sample drawn by selecting individuals systematically from a sampling frame.

When there is no relationship between the order of the sampling frame and the variables of interest, a systematic sample can be representative.
Voluntary response bias:
Bias introduced to a sample when individuals can choose on their own whether to participate in the sample.

Samples based on voluntary response are always invalid and cannot be recovered.
Convenience sample:
Consists of the individuals who are "conveniently" available.

These samples often fail to be representative seeing as every individual in the population is not equally convenient to the sample.
Undercoverage:
A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population.
Nonresponse bias:
Bias introduced to a sample when a large fraction of those sampled fail to respond. Those who actually do respond aren't likely to represent the whole.

Voluntary response bias is a form of nonresponse bias, but nonresponse may occur for other reasons as well.
Response bias:
Anything in a survey that influences responses including the wording of questions, how you approach them, etc.
Basics of Stratified Sampling:
(1) Homogeneous; population broken into different groups.

(2) "Some of all"
Basics of Cluster Sampling:
(1) Heterogeneous; taking all student responses from some of the classrooms.

(2) "All of some"
Basics of Systematic Sampling:
(1) Often the least expensive method of sampling.

(2) Randomly pick the first person but after them choose every 10th person.

ONLY works if the first person is chosen randomly.
Basics of Multistage Sampling:
Incorporates various methods of sampling.
Basics of Nonresponse Bias:
Can arise when sampled individuals will not or cannot respond.
Basics of Response Bias:
When the answers of those responding may be influenced by external factors such as the wording of the questions, how they're approached, or interviewer behavior.
What samples are most likely to be biased?
Voluntary response samples.
Basics of Undercoverage Bias:
Not including people in the population such as those who do not have cell phones, internet connection, are homeless, are unlisted, etc.
Basics of Overcoverage Bias:
Voluntary responses; those who use InstaPolls or online polls/samples/surveys.

Only those who are interested in responding will respond and that doesn't represent a larger realm of people that is needed to be sampled.
How is the population of interest usually determined?
By "why"
What are the subjects called?

What are humans called?
Experimental units; if humans use the term participants or subjects.
Observational Study:
A study based on data in which no manipulation of factors has been employed.
Confounding:
Essentially something that you didn't control that affected your experiment and its results (i.e. lurking variables).

Something else that could explain your results.
What are the two types of observational studies; briefly describe.
Retrospective: looking back and studying previous behaviors or conditions.

Prospective: subjects are followed to observe future outcomes.
Retrospective study:
An observational study in which subjects are selected and then their previous behaviors or conditions are determined.

Because retrospective studies are not based on random samples, they usually focus on estimating differences between groups or associations between variables.
Prospective study:
An observational study in which subjects are followed to observe future outcomes.

Because no treatments are deliberately applied, a prospective study is not an experiment.

They're typically focused on estimating differences among groups that might appear as the groups are followed during the course of the study.
Experiment:
An experiment MANIPULATES factor levels to creat treatments, RANDOMLY assigns subjects to these treatment levels, and then compares the responses of the subject groups across treatment levels.
Random assignment:
To be valid, an experiment must assign experimental units to treatment groups at random.
Factor:
A variable whose levels are controlled by the experimenter.

Experiments attempt to discover the effects that differences in factor levels may have on the responses of the experimental units.
Response:
A variable whose values are compared across different treatments.

In a randomized experiment, large response differences can be attributed to the effect of differences in treatment level.
Experimental units:
Individuals on whom an experiment is performed.

Usually called subjects and/or participants when they're human.
Level:
The specific values that the experimenter chooses for a factor are called the levels.
Treatment:
The process, intervention, or other controlled circumstance applied to randomly assigned experimental units/

Treatments are the different levels of a single factor or are made up of combinations of levels of two or more factors.
Principles of Experimental Design:
(1) Control: aspects of the experiment that we know may have an effect on the response.

(2) Randomize: subjects to treatments to even out effects that we cannot control.

(3) Replicate: as many subjects as possible. If the subjects are not representative of the population of interest, replicate the entire study with a different group of subjects - preferably from a different part of the population

(4) Block: to reduce efects of identifiable attributes of the subjects that cannot be controlled.
Statistically significant:
When an observed difference is too large for us to believe that it is likely to have occurred naturally, we consider the difference to be statistically significant.
Control group:
The experimental units assigned to a baseline treatment level, typically either the default treatment, which is well understood, or a null, placebo treatment.

Their responses provide a basis for comparison.
Blinding:
Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups is said to be "blind."
Single-blind/Double-blind:
There are two main classes of individuals who can affect the outcome of an experiment:

(1) Those who could influence the results (the subjects, treatment administrators, or technicians).

(2) Those who evaluate the results (judges, treating physicians, etc.)

When every individual in either of these cases is blinded = single-blind.

When everyone in both cases is blinded = double-blind case.
Placebo:
A treatment known to have no effect.

Only by comparing with a placebo can we be sure that the observed effect of a treatment is not due simply to the placebo effect.
Placebo Effect:
The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo.
Block:
When groups of experimental units are similar, it is often a good idea to gather them together into blocks.

By blocking, we isolate the variability attributable to the differences between the blocks so that we can see the differences caused by the treatments more clearly.

Examples: blocking full-term babies from premature, ethnicity, gender, etc.
Matching:
In a retrospective or prospective study, subjects who are similar in ways not under study may be matched and then compared with each other on the variables of interest.

Matching, like blocking, reduces unwanted variation.
Matched Pairs:
Putting the same type of person (very similar statistics) in different testing environments to make more accurate statements of your data.