• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/28

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

28 Cards in this Set

  • Front
  • Back

what is a single independent variable model?

an equation with only one independent variable




Y = β0 + β1X

what is a multivariate regression model?

an equation with two or more independent variables




Y = β0 + β1X1 + β2X2

how do you evaluate the quality of a regression equation? (8 questions)

is the equation supported by sound theory?




how well does the estimated regression fit the data?




is the dataset reasonable large and accurate?




is OLS the best estimator for the equation?




how well do the estimated coefficients correspond to expectations developed by the researcher before the data were collected?




are all the obviously important variables included in the equation?




has the most theoretically logical functional form been used?




does the regression appear to be free of major econometric problems?

what is r-squared?

r-squared is the ratio of explained sum of squares to total sum of squares




or the amount of change in the dependent variable explained by the changes in the independent variable/s




adding another independent variable will at least keep R2 constant, but generally increases it

what is r?

r is the simple correlation coefficient, a measure of the strength and direction of the linear relationship between two variables




in single independent variable models, r*r == R2




meaning


r = 1 - perfectly positively correlated


r = -1 - perfectly negatively correlated


r = 0 - totally uncorrelated

what is adjusted r-squared?

adjusted r-squared is r-squared taking into account degrees of freedom




it fixes the problem of new independent variables increasing r-squared even if they don't add anything to the equation

what are the steps in regression analysis? (6 steps)

1. review the literature and develop the theoretical model




2. specify the model: select the independent variables and the functional form




2.1 choosing independent variables and how they should be measured


2.2 choosing functional form of variables


2.3 choosing properties of the stochastic error term




3. hypothesize the expected signs of the coefficients




4. collect, inspect, and clean the data




5. estimate and the evaluate the equation




6. document results

what are the classical assumptions? (6 required, 1 optional)

1. the regression model is linear, is correctly specified, and has an additive error term




2. the error term has a zero population mean




3. all explanatory variables are uncorrelated with the error term




4. observations of the error term are uncorrelated with the each other (no serial correlation)




5. the error term has a constant variance (no heteroskedasticity)




6. no explanatory variable is a perfect linear function of any other explanatory variables (no perfect multicollinearity)




7. the error term is normally distributed (optional, but usually invoked)

what are the properties of the sampling distribution of OLS estimators? (4 properties)

mean and unbiasedness - an estimator is unbiased if its sampling distribution has as its expected value, the true value of β; E(β) = β




properties of the variance - desirable for the sample distribution to be as narrow as possible; as sample size increases, variance decreases




standard error - since this is square root of variance, a larger sample size will cause it to fall




standard deviation - same as standard error, but used for population measurements, not sample measurements

what is the Gauss-Markov Theorem?

given classical assumptions 1-6 are true, then the OLS estimator of a coefficient is the minimum variance estimator from among the set of all linear unbiased estimators




means that estimators are BLUE (best [min. variance], linear, unbiased, estimators)

what are the properties of OLS estimators?

unbiased: E(β) = β, OLS estimates are centered around the true value




minimum variance: distribution of coefficient estimates around the true parameter values is as tightly distributed as possible




consistent: as sample size approaches infinity, estimates converge to true population parameters




normally distributed: due to distribution, various statistical tests based on the normal distribution may be applied to the estimates

what are the steps of a hypothesis (t) test?

1. set up null and alternative hypotheses


null: opposite of what you believe


alternative: what you believe to be true




2. choose level of significance, which gives you a critical t-value




3. run the regression and obtain an estimated t-value




4. apply the decision rule by comparing calculated t to the critical t in order to reject the null hypothesis or not

what are the different types of errors in a hypothesis test?

Type 1: you reject a null hypothesis that is true




Type 2: you do not reject a null hypothesis that is false

what is a t-test?

used to test significance of a hypothesis about a single coefficient




appropriate when stochastic error term is normally distributed and when variance of distribution must be estimated

what is a confidence interval?

a range that contains the true value of an item a specified percentage of the time (percentage is level of confidence)

what is an F-test?

an f-test is a way to test multiple coefficients simultaneously; two main types:




1. Joint Hypothesis Test - a null hypothesis that contains multiple hypotheses or a single hypothesis about a group of coefficients; appropriate when underlying economic theory specifies values for multiple coefficients simultaneously




2. Test of Overall Significance - null hypothesis in an f-test of overall significance is that all the slope coefficients in the equation equal zero simultaneously; used to show that overall fit of an equation is statistically significant

what is a p-value?

the probability of observing a t-score that size or larger if the null hypothesis were true




it is two times area under the curve of the t-distribution between the absolute value of the t-score and infinity




need a small p-value to reject null hypothesis




decision rule: reject H0 if p-value(k) < the level of significance and βk has the sign implied by HA

what is omitted variable bias? what are its effects?

omitted variable bias takes place when a relevant independent variable is left out of a regression equation; it causes inaccurate estimates of all included coefficients




classical assumption III (all explanatory variables are independent of the error term) is violated with an omitted variable because all variables are generally correlated in some way, making included variables and the error term take on changes that would have been attributed to the omitted variable

what are irrelevant variables?

an irrelevant variable has no place in an equation; it increases the variances of the estimated coeffcients

what are the 4 specification criterion for independent variables?

theory: is the variable's place in the equation unambiguous and theoretically sound?




t-test: is the variable’s estimated coefficient significant in the expected direction?




Adj. R2: does the overall fit of the equation improve when the variable is added to the equation?




bias: do other variable’s coefficients change significantly when the variable is added to the equation?

what is a constant term? what are its components?

constant term: expected value of Y when all explanatory variables and the error term are zero




3 components:


1. the true value of the constant


2. the constant impact of any specification errors (like omitted variables)


3. the mean of the error term for the correctly specified equation (if not equal to zero)

what is the linear functional form?

Y=β0+β1X1

what is double log functional form?

lnY= β0+ β1(ln X1)

what is semi-log functional form?

lnY= β0+ β1X1




AND




Y= β0+ β1(ln X1)+β2X2



what is polynomial form?

Y= β0+ β1X1+β2(X1)^2

what is inverse form?

Y= β0+ β1(1/X1)+β2X2

what is a lagged independent variable?

in time series models, effects are not always instant, you want to sometimes lag independent variables, represented by subscript (t-1)

what is a dummy variable? what are the two types?

dummy variable: a binary condition that is represented by a 1 if true, or 0 otherwise




slope dummies:


- Y= β0+ β1X1+β2D1+β3D1X1


- slope dummies allow the slope of of the relationship between the dependent variable and an independent variable to be different depending on if the dummy condition is met




intercept dummies:


- Y= β0+ β1X1+β2D1


- slope stays constant if dummy condition is met, but changes intercept