Use LEFT and RIGHT arrow keys to navigate between flashcards;
Use UP and DOWN arrow keys to flip the card;
H to show hint;
A reads text to speech;
51 Cards in this Set
- Front
- Back
- 3rd side (hint)
Using a vertical scale that doesn't start at zero |
Exaggerate the difference between the two numbers of responses |
|
|
Voluntary response (self-selected) sample |
Respondents decide themselves whether to participate |
Those with the strong interest in the topic or more likely to participate so results are very questionable |
|
It is important to obtain sample data |
That are representative of the population from which the data are shown |
|
|
Small samples |
Conclusion should not be based on samples that are far too small |
|
|
Sample data must be collected |
In an appropriate way such as through a process of random selection |
|
|
If sample data are not collected in an appropriate way |
The data may be so completely useless that no amount of statistical torturing can salvage them |
|
|
Data |
Collections of observations such as measurements genders or survey responses |
|
|
A single data value |
Datum |
|
|
Statistics |
The science of planning studies and experiments obtaining data and then organizing summarizing presenting analyzing and interpreting those data and then drawing conclusions based on them |
|
|
Population |
The complete collection of all measurements or data that are being considered |
|
|
Census |
The collection of data from every member of the population |
|
|
Sample |
A subcollection of members selected from a population |
|
|
Statistical study |
Prepare analyze and conclude |
|
|
Prepare |
Context source of the data sampling method |
|
|
Analyze |
Graph the data Explore the data Apply statistical methods |
|
|
Conclude |
Statistical significance |
|
|
Statistical significance |
Results not likely to happen by chance |
|
|
Practical significance |
Looks at whether the difference is large enough to be a value in a practical sense |
For example if participants in the Atkins diet lost two pounds a year there is statistical significance but not practical |
|
Value of a statistical life (VSL) |
The amount of money a person or Society is willing to spend to save a life |
Since there is no formal market for lives the only way to measure the vsl is through indirect methods, for example surveys or observed human behavior in risky environments |
|
Common examples of voluntary response samples |
Internet polls in which people online can decide whether to respond Mail-in polls in which subject can decide whether to reply Telephone call-in polls in which newspaper radio or television announcements can ask that you voluntarily call a special number to register your opinion |
|
|
What is incorrectly use to assert or imply conclusions about a larger population? |
Voluntary response samples |
Such a sample is fundamentally flawed and should not be used for making a general statement about a larger population |
|
Publication bias |
The tendency to publish positive results (such as showing that some treatment is effective) much more often the negative results |
|
|
Graph such as bar graphs and pie charts can |
Be used to exaggerate or understate the true nature of data |
|
|
Pictographs |
Drawing of objects |
Can also be misleading |
|
Misleading or unclear percentages |
Are sometimes used If you are taking 100% of some quantity you are taking it all |
|
|
It is not necessarily the size of the sample that makes it effective |
But the sampling method does |
|
|
Benford's law |
An observation about the frequency distribution of leading digits in many real life sets of numerical data |
The law states that in many naturally occurring collections of numbers deleting significant digit is likely to be small |
|
Correlation |
Association between two variables Correlation does not imply causation |
For example smoking and pulse rate |
|
Loaded questions |
If survey questions are not worded carefully the results of a study can be misleading |
|
|
Order of questions |
Sometimes survey questions are unintentionally loaded by such factors as the order of the items being considered |
|
|
Non-response |
Occurs when someone either refuses to respond to a survey question or is unavailable |
|
|
Sugging |
Mini telemarketers trying to sell goods or services by beginning with a sales pitch that intentionally sounds like it is part of an opinion |
|
|
Results can sometimes be dramatically affected by |
Missing data |
|
|
Precise numbers |
Because that figure is precise many people incorrectly assume that it is also accurate |
|
|
To find a percentage of an amount |
Drop the percent symbol and then divide the % value by 100 then multiply |
|
|
Parameter |
A numerical measurement describing some characteristic of a population |
|
|
Statistic |
A numerical measurement describing some characteristic of a sample |
|
|
Quantitative or numerical data |
Consists of numbers representing counts or measurements |
|
|
Categorical or qualitative or attribute data |
Consist of names or labels that are not numbers representing counts or measurements |
|
|
In a survey of 521 subjects each was asked how often he or she played Sports. The survey subjects were internet users who responded to a question that was posted on a News website. |
It is flawed because it is a voluntary response sample. |
|
|
In a survey of 614 human resource professionals, each was asked about the importance of the appearance of a job applicant. The survey subjects are randomly selected by posters from a reputable market research firm. |
It appears to be sad because the data are not biased in any way. |
|
|
Quantitative or numerical data example |
The ages in years of survey respondents |
|
|
Categorical data as labels |
Categorical data as labels: the political party affiliations of survey respondents for example Democrat Republican independent |
|
|
Categorical data as numbers |
The numbers 12, 47, 75, 88, and 25 were soon on the jerseys of the starting offense for the New Orleans Saints when they won a recent Super Bowl. Those numbers are substitutes for names. They don't measure or count anything, so they are categorical data. |
|
|
Quantitative data can be further described by |
Distinguishing between discrete and continuous types. |
|
|
Discrete data |
Finite values, or buckets Number of children in household. Number of languages a person speaks. Number of people sleeping in stats class. |
If there are infinitely many values, the collection of values is countable if it is possible to count them individually, such as the number of tosses of a coin before getting tails. |
|
Continuous numerical data |
Result from infinitely many possible quantitative values, where the collection of values is not countable. |
Height of children Weight of cars Time to wake up in the morning Speed of the train |
|
Discrete data of the finite type |
The number of eggs that hens lay in one week or discrete data because they are finite numbers, such as five and seven that result from a counting process. |
|
|
Discrete data of the infinite type |
Consider the number of rolls of a die required to get an outcome of 2. It is possible that you could roll a die forever without getting a 2, but you can still count the number of rolls as you proceed. The collection of rolls is countable, because you can count them, even though you might go on Counting forever. |
|
|
Continuous data |
During the year, account might yield amount of milk that can be of any value between 0 liters and 7000 liters. There are infinitely many values between 0 liters and 7000 liters, but it is impossible to count the number of different possible values on such a continuous scale. |
|
|
The numbers of cans of cola are discrete data |
Correct grammar dictates that we use fewer for discrete amounts. |
|