• Shuffle
    Toggle On
    Toggle Off
  • Alphabetize
    Toggle On
    Toggle Off
  • Front First
    Toggle On
    Toggle Off
  • Both Sides
    Toggle On
    Toggle Off
  • Read
    Toggle On
    Toggle Off
Reading...
Front

Card Range To Study

through

image

Play button

image

Play button

image

Progress

1/44

Click to flip

Use LEFT and RIGHT arrow keys to navigate between flashcards;

Use UP and DOWN arrow keys to flip the card;

H to show hint;

A reads text to speech;

44 Cards in this Set

  • Front
  • Back
  • 3rd side (hint)
Define a Regression Model
A regression model provides the user with a functional relationship between the response variable and explanatory variables that allows the user to determine which of the ex- planatory variables have an effect on the response. The regression model allows the user to explore what happens to the response variable for specified changes in the explanatory variables.
reference to future values
prediction
reference to current or past values
explanation
In order for prediction to work there must be a _______ between x and y.
For prediction (or explanation) to make much sense, there must be some connection between the variable we’re predicting (the dependent variable) and the variable we’re using to make the prediction (the independent variable).
In order for prediction to work there must be a _____.
i.e there should be an entity that relates the two variables.
Unit of Association
With time-series data, the unit of association may simply be ______.
time
UOA
For _______, an economic or physical entity should connect the variables.
cross-sectional data
Type of Data
The variables may be measured at the same time period or, for genuine prediction, the independent variable may be measured at a time period _____ the dependent variable.
before
simple linear regression analysis, in which there is a __________ and the equation for predicting a dependent variable y is a linear function of a given independent variable x.
single independent variable
yˆ=2.0+3.0x
The constant term, such as the 2.0, is the _____ and is interpreted as the predicted value of y when x =0.
intercept term
yˆ=2.0+3.0x
The coefficient of x, such as the 3.0, is the _____ of the line, the predicted change in y when there is a one-unit change in x.
slope
what is the prediction equation ?
yˆ= b0ˆ+b1ˆx
where b0ˆ is the intercept and b1ˆ is the slope
The basic idea of simple linear regression is to use data to fit a prediction line
that relates a dependent variable __ and a single independent variable ___.
y , x
According to the assumption of linearity, the _____ of the equation does not change as x changes.
slope
corresponding to each y we introduce a random error term ei and assume the model
y=b0 +b1x+e. Why?
Assuming linearity, we would like to write y as a linear function of x: y=b0+ b1x.
However, according to such an equation, y is an exact linear function of x; no room is left for the inevitable errors (deviation of actual y values from their predicted values). So we introduce εi
We assume the random variable y to be made up of a predictable part (a linear function of x) and an unpredictable part, the ________.
random error term. εi
y=b0 +b1x+ε
The coefficients __ and ___ are interpreted as the true, underlying intercept and slope.
b0 and b1
The error term ε includes the effects of all other factors, ___ or ____.
known or unknown.
factors that might have been used in prediction but were not
The combined effects of unpredictable and ignored factors yield the random error terms ε.
In regression studies, the values of the independent variable (the xi values) are usually taken as predetermined _______, so the only source of randomness is the ei terms
constants
If the data are based on a random sample of applicants, xi (as well as yi) is a ________ variable.
random
If the xis are random, we can simply regard all probability statements as conditional on the _______ xis.
observed
Formal assumptions of regression analysis are? there are 4.
1. The relation is, in fact, linear, so that the errors all have expected value zero: E(εi) = 0 for all i.

2. The errors all have the same variance: Var(εi) = σ^2 for all i.

3. The errors are independent of each other.

4. The errors are all normally distributed; εi is normally distributed for all i.
The actual values of the dependent variable are distributed ______, with _______ values falling on the regression line and the same ______ at all values of the independent variable.
normally, mean, standard deviation
This is simply a plot of each (x, y) point, with the independent variable value on the horizontal axis, and the dependent variable value measured on the vertical axis.
scatterplot of the data
scatterplot: Look to see whether the points basically fall around a straight ______ or whether there is a definite ______ in the pattern. Also look to see whether there are any evident _______ falling far from the general pattern of the data.
line, curve, outliers
_______ have been developed to sketch a curve through data without necessarily assuming any particular model.
smoothers
If such a smoother yields something close to a straight line, then ___________ is reasonable
linear regression
Roughly, a smoother takes a relatively narrow “slice” of data along the x axis, calculates a line that fits the data in that slice, moves the slice slightly along the x axis, recalculates the line, and so on. Then all the little lines are connected in a smooth curve.
LOWESS (locally weighted scatterplot smoother).
LOWESS: The width of the slice is called the _______.
bandwidth
It can be understood as taking a narrow slice of data, fitting a curve (often a cubic equation) to the slice, moving to the next slice, fitting another curve, and so on. The curves are calculated in such a way as to form a connected, continuous curve.
spline fit. Another type of scatterplot smoother
Many economic relations are not linear. For example, any diminishing returns pattern will tend to yield a relation that ________, but at a ________ rate.
increases, decreasing
If the scatterplot does not appear linear, by itself or when fitted with a LOWESS curve, it can often be “straightened out” by a ________ of either the inde- pendent variable or the dependent variable.
transformation
We can try several transformations of the independent variable to find a scatterplot in which the points more nearly fall along a straight line. Three common transformations are ?
square root, natural logarithm, and inverse(one divided by the variable).
Finding a good transformation often requires______ and ______.
trial and error.
There are two key features to look for in a scatterplot. What are they?
First, is the relation nonlinear? Second, is there a pattern of increasing variability along the y (vertical) axis? If there is, the assumption of constant variance is questionable.
choosing a transformation:
If the plot indicates a relation that is increasing but at a decreasing rate, and if variability around the curve is roughly constant then..
transform x using square root, logarithm, or inverse transformations.
Symmetric where the relation is decreasing at a decreasing rate
choosing a transformation:
If the plot indicates a relation that is increasing at an increasing rate, and if variability is roughly constant then..
try using both x and x2 as predictors. This method uses two variables, the multiple regression methods are needed.
Symmetric where the relation is
decreasing at an increasing rate
choosing a transformation:
If the plot indicates a relation that increases to a maximum and then decreases, and if variability around the curve is roughly constant,
try using both x and x2 as predictors.
choosing a transformation:
If the plot indicates a relation that is increasing at a decreasing rate, and if variability around the curve increases as the predicted y value increases,
try using y2 as the dependent variable.
Symmetric where the relation is decreasing at a decreasing rate
choosing a transformation:
If the plot indicates a relation that is increasing at an increasing rate, and if variability around the curve increases as the predicted y value increases,
try using ln(y) as the dependent variable. It sometimes may also be helpful to use ln(x) as the independent variable. Note that a change in a natural logarithm corresponds quite closely to a percentage change in the original variable. Thus, the slope of a transformed variable can be interpreted quite well as a percentage change.
Symmetric where the relation is
decreasing at an increasing rate
The assumptions we made in this section allow us to make _______ about the true parameter values from the sample data.
inferences
The population intercept, slope, and error variance all have to be _______ from limited sample data
estimated
For a simple regression with a single predictor, careful checking of a _____, ideally with a smooth curve fit through it, will help avoid serious blunders.
scatterplot
Once we have decided on any mathematical transformations, we must estimate the actual equation of ________.
the regression line