The Definitions Of Linear Regression

Improved Essays
Linear Regression
By p Nitin
Feb 16, 2013
Linear Regression Definition states that it can be measured by using lines of regression. Regression measures the amount of average relationship or mathematical relationship between two variables in terms of original units of data. Whereas, correlation measures the nature of relationship between two variables. i.e.., positive or negative or uncorrelated.

Regression is used for estimating the value of one variable if you know the value of other variable. i.e.., One of the variable is independent variable and other variable is dependent variable.

Let ( Xi , Yi ) ; i = 1, 2, 3, ...................n the n pairs of observations are given now plot all these points in XY-plane which reserves a scatter diagram.
…show more content…
Let ( Xi , Yi ) ; i = 1,2,3,................ n be n-pairs of observations are given and there are representing a linear regression.

We know that, coefficient of correlation

r = cov ( X , Y ) / (sigmaX sigmaY ) where cov ( X , Y ) = ( 1/n) sum XiYi - barX barY

and X2 = (1/n) sum ( Xi - barX
…show more content…
( 5 )

cov ( X, Y ) = ( 1/n)sum Xi Yi - barX barY

sigma X2 = ( 1/n )sum X2i - barX 2

Multiplying equation ( 4 ) with X and sub from ( 5 )

cov ( X , Y ) = b sigma X2

b = cov ( X , Y ) / sigma X2

Substitute the value of b in ( 4 )

rArr barY = a + cov ( X , Y ) barX / sigma X2

a = barY - cov ( X , Y ) barX / sigma X2

Substitute the value of a in ( 1 )

Y = barY - cov ( X , Y ) barX / sigma X2 + cov ( X , Y ) X / sigma X2

Y - barY = cov ( X , Y ) ( X - barX ) / sigma X2

Y - barY = rsigma X sigma Y ( X -barX ) / sigma X2

Y - barY = r sigma Y ( X - barX ) / sigma X

Y - barY = bYX ( X - barX )

Similarly, we can prove that regression equation of X-on-Y is

X - barX = bXY ( Y - barY )

Types of Linear Regression

There are two different types of linear regression. They are,

Simple Linear Regression
Multiple Linear Regression

Examples

Given below are some examples on how to calculate linear regression.

Example:

10 observations on price X and supply "Y" the following data was obtained sum X = 130, sum Y = 220, sum X2 = 2288, sum2 = 5506, sum XY = 3467 Find the line of regression of Y on X. Y

Solution:

The line of regression of Y on

Related Documents

  • Improved Essays

    Nike, Inc.: Case Study

    • 1474 Words
    • 6 Pages

    Individual Assignment Part I: According to the data of Nike, Inc. and Timberland Bancorp, Inc. from December 2008 to December 2013 showed by the attached excel, the annualized expected return, standard deviation and correlation of the two stocks are calculated and revealed. Using the result of the closing price minus the opening price to divide by the opening price can figure out the monthly returns during the 5 years in order to obtain the average monthly return. Then, the average monthly return time 12 months that can lead to the annualized return.…

    • 1474 Words
    • 6 Pages
    Improved Essays
  • Great Essays

    There is little doubt public education in the United States is under intense scrutiny from the American public in recent years. In Mississippi, years of negative student outcomes have created a great deal of negative publicity for the state’s public schools. It is important for educators, whether at the district level, research level, or policymaking level, to continue to study available data to identify areas of weakness with the realm of education and work collaboratively to remedy those issues. This report is a summary of several specific data sets collected by the Mississippi Department of Education and statistically analyzed for potential implications. This report will briefly highlight the necessity of such an analysis, and provide the…

    • 2057 Words
    • 9 Pages
    Great Essays
  • Great Essays

    1. State the title of your research simulation High-quality child care important is important for a child’s Development. 2. Describe the major topic and the overall goal…

    • 1251 Words
    • 6 Pages
    Great Essays
  • Improved Essays

    Reaction Lab

    • 1809 Words
    • 8 Pages

    The experiment began with a reaction of a 1cm3 cube of lamb liver and 5mL of hydrogen peroxide solution being put in test tubes in a beaker filled with 10℃ water until the temperatures of the liver and hydrogen peroxide were 10℃ also. The liver was put into a 100mL measuring cylinder and the hydrogen peroxide was also poured in as the timer was started. After twenty seconds, several millilitres of froth had been produced from the chemical concoction. This froth was recorded into a results table, and the previous steps were repeated but with increasing temperature increments of 10℃ until 50℃. After carefully completing the experiment, the final results obtained by measuring the volume of the froth produced after 20 seconds, were recorded in a table - see Table 1, and able to be observed in a graph, see Figure 7.…

    • 1809 Words
    • 8 Pages
    Improved Essays
  • Improved Essays

    Shot Vs Nasal Flu

    • 1452 Words
    • 6 Pages

    The first research question is which flu vaccine is more effective for preventing the flu? I think most of us has had the flu or know someone who has had the flu so therefore, we know how important it is in preventing the flu. Two different types of vaccines have been tested: the shot and the nasal spray. A lot of people are aware of the shot but not the nasal spray. The nasal spray is better for children ages two and up and for those who are scared of needles.…

    • 1452 Words
    • 6 Pages
    Improved Essays
  • Improved Essays

    The key distinction between correlation and casual connection is the effect that one has on the other regarding the outcomes in scientific test results. In a study, the investigator is looking for the possible effect on the dependent variable that might be induced by changing the independent variable. Casual connection is an occurrence or action that can cause another and the results are predictable and certain. With that being said, casual connection can also determine how the two events are related to each other.…

    • 587 Words
    • 3 Pages
    Improved Essays
  • Great Essays

    Problem: how does the angle of trajectory effect the horizontal distance of a bottle rocket Background research: When an object is in the air and has no forces acting on it other than gravity it becomes a projectile. (Henderson, 1996) Gravity causes the projectile to accelerate toward the ground in a uniform arc like a rainbow does. this shape is known by scientists and mathematicians as a parabola, and objects that fly in the shape of a parabola are said to be in parabolic motion.…

    • 1986 Words
    • 8 Pages
    Great Essays
  • Improved Essays

    In conducting a national analysis of the hypothesis on the effectiveness of the strictness of drunk driving laws, a one-tailed correlation test will be conducted to determine if a negative correlation exists between the two variables. Before conducting the actual statistical analysis, a scatter chart was made in order to help visualize the relationship between the variable. As can be seen in the scatter plot, there is indeed a negative relationship between the variables. However, the relationship is difficult to visualize without the use of a line of best fit which shows the downward trend of the relationship. It works in such a way that as x value or the strictness factor increases, the y value or drunk driving death rate decreases.…

    • 1018 Words
    • 5 Pages
    Improved Essays
  • Superior Essays

    Bounce Tables Lab Report

    • 954 Words
    • 4 Pages

    Introduction In this lab, we are measuring the ranges R of the ball as the function of the heights, H, that the ball is dropped from. I will use these data to determined the relationship between the height and the range. Procedure…

    • 954 Words
    • 4 Pages
    Superior Essays
  • Improved Essays

    Vanessa Kehdi November 8, 2017 Statistics 352 Lab Assignment 5 Problem 1: The null hypothesis is that there is no association between the political party affiliation and the opinion held of President Obama’s performance. The alternative hypothesis is that there is an association between the political party affiliation and the opinion held of President Obama’s performance. Based on the bar chart, there is an association between the political party and President Obama’s opinion because there was a higher percentage of the democrats that approved of his opinions while republicans did not.…

    • 668 Words
    • 3 Pages
    Improved Essays
  • Decent Essays

    Unit 3 Maths Case Study

    • 183 Words
    • 1 Pages

    Problem I wonder if there is a relationship between the height of saplings and the diameter at the base of the sapling, based on the data sampled from the alluvial and marine terraces at Waitutu Forest, New Zealand from 2001-2008. My explanatory variable will be the base diameter, and the response variable will be the height of the sapling. Plan I will use NZGrapher to input the data and then create a scatter graph to display the data.…

    • 183 Words
    • 1 Pages
    Decent Essays
  • Decent Essays

    1. Z- test is used when the sample size is large enough n>30, the population standard deviation is known, and population is normally distributed. Example: Is the mean of systolic blood pressure level for women use of oral contraceptives is identical with women does not use of oral contraceptives, the sample mean > 30, we known the population standard deviation and it is normally distributed.…

    • 402 Words
    • 2 Pages
    Decent Essays
  • Superior Essays

    By conduction a two-was mixed group ANOVA 2 X (3), on the two factors time of day which can be broken down into a further 3 levels (morning, afternoon and evening) and gender which can also be broken down into 2 levels (male and female) on short term memory recall. This was conducted to test for differences between the two independent groups whilst exposing participants to repeated measures. When analysing variance there are a number of assumptions which need to be considered and applied. Firstly, it is required that participants are selected at random to adequately represent the target population. Secondly, the data must be measured at an interval or ratio level (Pallant, 2010).…

    • 806 Words
    • 4 Pages
    Superior Essays
  • Improved Essays

    1990 V. 2000 Case Study

    • 642 Words
    • 3 Pages

    in 2011, it will be found in the correlation coefficient, which is represented by “r”. The correlation coefficient for the data is r=0.815. The correlation is telling us the direction, strength, and linearity of the regression line to the data point. Based on the 0.815 is a positive number, the direction is positive and it can be seen in the picture of the graph (Figure 5). The strength of the regression line is really strong, meaning there is a high correlation.…

    • 642 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Unit Of Analysis Essay

    • 398 Words
    • 2 Pages

    Cross-tabs Analysis The first variable I would use for cross-tab Analysis would be the variable of literacy. The second variable I would use is the variable of what country the individual is from. Correlation Coefficients i. For this scatterplot, there is a strong negative correlation between people who read and Fertility.…

    • 398 Words
    • 2 Pages
    Improved Essays