• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/19

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

19 Cards in this Set

  • Front
  • Back

Observational study

I.e. Correlations

Sample study

Randomly sample source. More sample the better. To estimate. To get a sense of something.

Experimental study

Experimental group, control group. Randomly put in either group. Find causality.

Define correlations and causality

Todo

What is the mean sign


what is the standard deviation sign



µ (mu mean sign)


σ (sigma standard deviation)

Set: X \ Y means?

X not in Y.




X = {1,2}


Y = {1,2,3}


X \ Y = {}

P(A or B) = P(A) + P(B) - ?

P(A or B) = P(A) + P(B) - P(A and B)

Empirical Rule?

68% - 95% - 99.7% for normal distribution problems

how to calculate z-score? define what is a z-score.

z = (X - population mean) / standard deviation




z-score or standard score indicates how many standard deviation an element is from the mean



When two events, A and B, are dependent, the probability of both occurring is:




P(A and B) = ?

P(A and B) = P(A) * P(B|A)




P(A and B) = P(B) * P(A|B)




P(B|A) = P(A and B) / P(A)

Data Types

Discrete Continuous


Ordered Ordinal (S,M,L) Fields (temp)


(values are Quantitative


comparable) (1,2,3)




Unordered Nominal (shape) Cyclic values


Categories (directions, hues)


(nationality)

Give some examples of construct and operational definition

construct: intelligence, hunger, maturity


operational definition: grams of food consume, number of questions answered correctly

Define the core activities for each of the following analysis:


Descriptive Analysis, Inference, Prediction, Design

Descriptive analysis: EDA, quantification, summarization, clustering


Inference: estimation, sampling, variability, defining populations


Prediction: machine learning, supervised learning


Design: design experiments

What is A/B (split) testing?

Compare two variants A and B in a controlled experiment. For example, showing two variant of a web page to similar visitors at the same time to see which gives a better conversion rate.

In data analysis, a descriptive question is...

one that seeks to summarize a characteristic of a set of data. e.g. proportion of males, mean number of servings of fresh fruits and vegetables per day. There is no interpretation of the result itself as the result is a fact.

In data analysis, an exploratory question is...

one in which you analyze the data to see if there are patterns, trends or relationships between variables. Also called "hypothesis-generating" analyses because rather than testing a hypothesis, you are looking for patterns that would support proposing a hypothesis. e.g. propose the hypothesis that "among adults, eating at least 5 servings a day of fresh fruit and vegetables is associated with fewer viral illnesses per year"

In data analysis, an inferential question is...

a restatement of a proposed hypothesis as a question and would be answered by analysing a different set of data. By analysing this different set of data you are both determining if the association you observed in your exploratory analysis holds in a different sample.

In data analysis, a predictive question is...

one you are less interested in a cause but just to predict a result. For example higher income may be one of the final set of predictors, and you may not know (or even care) why people with higher incomes are more likely to eat diet high in fresh fruits and vegetables, but what is most important is that income is a factor that predicts this behavior.

In data analysis, a mechanistic question is...

a question that asks how a diet high in fresh fruits and vegetables leads to a reduction in the number of viral illnesses would be a mechanistic question