Consequences Of Regression Regression

Improved Essays
Regression Method
Based on the data we collected, our expected regression equation is:
Walmart revenue = β0 – β1CPI + β2CSI + β3ECOM – β4GAS + β5POP –β6INT – β7UNEMP + β8EARN
Then we ran the regression on Minitab and got an actual regression equation. We used the method (i.e. five questions and nine problems) to evaluate the equation Minitab produced. For each of the nine problems, we ask five questions: What is the problem? How can I detect it? What are the consequences of the problem? What are the possible fixes to the problem? What are the consequences of the fixes?
Problem 1: The residuals should be normally distributed. This can be detected visually using the standardized normal probability plot and histogram. If all the residuals are not within two standard deviations in the normal probability plot or form a bell curve in the histogram, then they are not normally distributed. When the residuals are not normally distributed, it means the
…show more content…
Then run a regression model using the residuals square as the dependent variable and fits square as independent variable. The regression will produce a p-value for the fits square, which will be compared to the selected alpha level (it could be 1%, 5%, or 10%). If the p-value is less than the alpha level, we reject the null hypothesis in favor of the alternate. Therefore, the residuals are heteroscedastic. However, if the p-value is greater than the alpha level, we accept the null hypothesis that the residuals are homoscedastic. Heteroscedasticity increases the standard error coefficient just like serial correlation. To resolve the problem of heteroscedasticity, we collect more data or develop a better model to explain the changes in the dependent variable. It is important to note that the KB test is not the most powerful test for heteroscedasticity. The White General test is more powerful in testing for heteroscedasticity. Unfortunately, it is not available on

Related Documents

  • Improved Essays

    Conversely, if 0 was in place of 1, it would be the outcome of the untreated group. Pre_it is a dummy variable identifying observations during a pretest period prior where the treatment has yet to be implemented. If Pre_it=1, this would reflect observations for the treatment group during the pretest period. The θ_P parameter is a fixed difference of conditional mean outcomes across pretest and posttest periods. A unknown smoothing function is represented by the g(A_i ), and it is assumed to be constant across the pre- and posttest time periods (for further discussion of a smoothing parameter see Peng, 1999).…

    • 1016 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    According to this equation, if the test is perfectly reliable, the true score variance would equal to the observed score variance and the reliability would equal to 1. Reliability can be also expressed as r_(XX^ ' )=〖1-S〗_E^2/S_X^2. Reliability coefficient could be estimated by several ways and through different ways the value of reliability coefficients may vary. However, the reliability coefficient cannot estimate the individual’s test score, we use standard error of measurement to estimate it. Second, it provided the definition of standard error of measurement.…

    • 729 Words
    • 3 Pages
    Improved Essays
  • Great Essays

    Formative indicators are used to form a superordinate construct where the individual indicators are weighted according to their relative importance in forming the construct (Chin, 1998). Moreover, the normality of data distribution not assumed, thus data with non-normal distributions can be conducted in structural equation modeling since its application is performed in a non parametric way. PLS is also recommended when either cross-sectional, survey, or quasi-experimental research designs are used; when a large number of manifest and latent variables are modeled or when too many or too few cases are available (Falk, 1992).These conditions apply to this study because it will adopt a survey design; the sample size in this study is relatively small (126) and the Likert- scale used in this study normally do not…

    • 1267 Words
    • 6 Pages
    Great Essays
  • Improved Essays

    Slide 13: If we looked at the row titled “Level of Significance of 1-Tailed Test’, a significance level of 0.0005 is found. Again, I want to emphasize that SPSS will calculate whether or not a significant change has occurred. We can say that our obtained t statistic of 4.34 is greater than 3.373 or the probability of obtaining a t statistic of 4.34 is less than .001 (p <.001). This p value is below our .05 alpha level. The probability of obtaining the difference of between the variables, if the null hypothesis were true, is extremely low.…

    • 1875 Words
    • 8 Pages
    Improved Essays
  • Improved Essays

    Generally, the goal is to reduce the disparities among the observed dataset and the linear approximation of the data. The linear fit that matches up with the patterns of the set of the data pairs. Two shortcomings that come with OLS is that outliners can be perceptively bad and skews the results. This can play a big impact because the square of the number grows. This makes the data set…

    • 834 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    The reason for using time sampling is to either systemically or randomly sample behavior. In naturalistic observation, situation sampling is the observing of behavior in different locations and under a specific circumstance or condition. The reason for using situation sampling is to improve to likelihood of probable findings. 6. One factor that decreases interobserver reliability by not clearly defined and observers left to assume the conclusion for themselves.…

    • 760 Words
    • 4 Pages
    Improved Essays
  • Great Essays

    The probability of getting a critical ratio as large as 2.961 in absolute value is less than 0.003. In other words, the regression weight for Perceived Quality towards Purchasing Decision is significantly different from zero at the 0.01 level (two-tailed). Thus the hypothesis of Perceived Quality towards Purchase Decision is significant and supported by the data. The next data as can be seen from the table 4.7 above is the P value of Brand Awareness is 0.926. This means the probability of getting a critical ratio as large as 0.093 in absolute value is 0.926.…

    • 1896 Words
    • 8 Pages
    Great Essays
  • Improved Essays

    Chi Square Test Lab Report

    • 2147 Words
    • 9 Pages

    The expected value is what it should be in order for us to accept our hypothesis. The p-value in a Chi Square test tells you the likely hood of your hypothesis being within a 95% confidence interval. If the p-value is smaller or equal to the variable, then reject your null hypothesis in favor of your alternative hypothesis. If the p-value is greater than the variable, then don’t reject null hypothesis. A chi square (X2) value is a tool to help determine whether distributions of categorical variables differ from each other.…

    • 2147 Words
    • 9 Pages
    Improved Essays
  • Improved Essays

    TOPSIS Model

    • 936 Words
    • 4 Pages

    Similarly, alternative A- indicates the least preferable alternative or the negative ideal solution. [8-9] The relative importance or weight of a criterion indicates the priority assigned to the criterion by the decision-maker while ranking the alternatives in a Multi criteria Decision-Making (MCDM) environment. The Entropy Method estimates the weights of the various criteria from the given payoff matrix and is independent of the views of the decision-maker. This method is particularly useful to explore contrasts between sets of data. These sets of data can be mapped as a set of alternative solutions in the payoff matrix where each alternative solution is evaluated in terms of its outcome.…

    • 936 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    However, the JND decreased between lines size two and size three although the size of the stimulus between those two lines increased (figure one). When Weber’s law was applied to the original data set, the relationship between line length and Weber’s fraction was almost completely linear and had a slope of zero. At line size two, however, there is a discrepancy (figure two). The result from line size two contradicts Weber’s law since the product of creating that fraction should produce a constant, k, which if plotted should create almost a horizontal…

    • 767 Words
    • 4 Pages
    Improved Essays