Descriptive Analysis Of Rr % And Gr

Improved Essays
1. Descriptive analysis of RR% and GR%
The mean is a measure of central tendency that obtained by dividing the sum of observed values by the number of observations, n. Data points can fall above, below, or even on the mean, it is widely considered a good estimate for predicting subsequent data points. . For the retention rate (RR %), the mean is 57.41 while it is 41.76 for GR%, the dependent variable.
The minimum and maximum are basically the least and highest observed value. For the retention rate (RR %) it is 4 and 100 respectively, while it is 25 and 61 for the dependent variable (GR %). Both can be used to identify any possible outliers or a data entry error. By comparing minimum and maximum, one can assess the spread of the data.
The
…show more content…
Scatter Diagram
In Figure 1, the straight line fits through the data plotted below showing that a positive linear relationship exists. The small scatter around the line shows the strong linear correlation as well as positive slope. Figure 1: Scatter diagram showing strong positive linear relationship
3. The estimated linear equation is Yi = b0 + b1X1 + e1, where Y is the estimated dependent value for the observation, X is the estimated independent variable for the observation, e is the random error, B0 is the estimate of the regression intercept and B1 the estimate of the regression slope coefficient. Therefore: GR = B0 + B1Rr1 + e1
4. The regression equation is therefore derived as: Graduation rate (%) = 25.423 + 0.2845 (Retention rate %), that will also be regression line fitted to the given data. The equation shows that the coefficient for retention rate in percentage is 0.2845. It indicates that for every additional percentage in retention rate (independent variable X), the model predicts an increase in graduation rate by 0.2845. In order words, for every 10% point increase in RR, GR increases by an average of
…show more content…
R2 is a measure of goodness of fit. It shows how much the behaviour of Y (GR %) is explained by the behaviour of X (RR %). A value of 0.449 may be acceptable depending on the data been analysed but quite low in analysis of these observations. From the summary output R-squared is 0.449 or it is 44.9%, then it can be said that 44.9% of the variance is explained by the model. The result of a R² = 0.449 means that the best fit equation for the data shows a correlation lesser of 50%. It therefore indicates a somewhat "weak" relationship hence not a good fit. Figure 5: Good fit: Residual plots showing a random pattern

A key part of statistical modelling is examining residuals. By carefully looking at the residuals, assumptions become either reasonable or unreasonable which in turn determines whether the model in itself is appropriate. Residuals are the differences between the observed and predicted values. The variation unexplained by the fitted model can help in decision making. In Table 2 below, the observation is that there are no outliers as none of the standardized residuals (in absolute value) is greater than +3 or less than -3.
Observation Predicted GR (%) Residuals Standardized Residuals Observed GR %
WIU 27.41458565 -2.414585651 -0.329782615 25
South University 39.93372978 -14.93372978 -2.039639575

Related Documents

  • Improved Essays

    Nt1310 Unit 4 Assignment

    • 812 Words
    • 4 Pages

    Five frequency (f) tables will be constructed for univariate analysis. Standard deviation (SD) will be calculated for treatment outcomes. The standard deviation will allow the researcher to examine how much the individual values of the tables differ from the mean and other values (Royse, 2010). Percentage (%) will be calculated for all nominal variables to help visual the data through historgrams. Mean (M) will be calculated for age and education level which will help determine the average of each nominal variable.…

    • 812 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    Rnr Model Essay

    • 595 Words
    • 3 Pages

    Briefly describe the essential features of the RNR model and the GLM and analyze their strengths. When reviewing the Risk-Need-Responsivity model (RNR), there are three main principles. These principles are the risk principle, the need principle, and responsibility principle. The risk principle states the treatment plan must match the level of service to the sex offender’s risk to re-offend (Andrews & Bonta, 2007).…

    • 595 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Vanessa Kehdi November 8, 2017 Statistics 352 Lab Assignment 5 Problem 1: The null hypothesis is that there is no association between the political party affiliation and the opinion held of President Obama’s performance. The alternative hypothesis is that there is an association between the political party affiliation and the opinion held of President Obama’s performance. Based on the bar chart, there is an association between the political party and President Obama’s opinion because there was a higher percentage of the democrats that approved of his opinions while republicans did not.…

    • 668 Words
    • 3 Pages
    Improved Essays
  • Improved Essays

    Results The dependent variable in this is whether or not the veterans suffer from depression. It is the dependent variable because in this case, depression is dependent on the number of years has served in the military or if they have served at all. The independent variable is the variable that is not dependent on another. In this case, having depression is not a determining factor for serving in the military.…

    • 635 Words
    • 3 Pages
    Improved Essays
  • Decent Essays

    Early Turnover Metric

    • 212 Words
    • 1 Pages

    Early turnover Metric The early turnover metric is to measure the percentage of voluntary and involuntary terminations of recruits in the first year of employment with the company. Turnover is the portion of the workforce that has left during a period of time oppose to retention which measures the portion of the workforce that stays. Voluntary terminations are defined by an employee leaving by choice and can have some prediction success. Involuntary terminations are employees who are asked to leave the company for issues such as incompetence of the job, insubordination, or elimination of the position.…

    • 212 Words
    • 1 Pages
    Decent Essays
  • Superior Essays

    It is estimated by the CDC that every 43 seconds, someone in the United States has a heart attack. There are various factors that result to this condition, transportation being one of them. A change and detailed analysis of how changes in the transportation may result in better health should be researched and implemented to nether this disease condition at least to some extent. Mortality: Premature Death: It is the measure of the years of life lost before the age of 75. This measure basically marks 75 years as the average life span of an individual.…

    • 1563 Words
    • 7 Pages
    Superior Essays
  • Great Essays

    The task As an assistant manager of an insurance company my task is the prediction of which customers are potentially interested in a caravan insurance policy based on both socio-geographic and personalized data. For model building, data of 4000 customers and 86 variables, including the target variable was available. Also, give an explanation why these customers would buy the caravan insurance company. Make my insights useful and action in order to report it to my boss with no prior knowledge of computational learning technology.…

    • 942 Words
    • 4 Pages
    Great Essays
  • Improved Essays

    Capital asset pricing model According to Ross, Westerfield and Jordan (2008) capital asset pricing model is the equation of the security market line showing the relationship between expected return and beta. It is use to calculate the rate of return for risky asset. CAPM state that expected return of a security or a portfolio equals the rate on a risk free security plus a risk premium. Formula for CAPM E(Ri)=Rf + [{E(Rm) - Rf}] βi Where, E(Ri)= return required on financial asset I, Rf= risk free rate of return, E(Rm)= average return on the capital market, βi= beta value for financial asset i (Mike 2013)…

    • 1175 Words
    • 5 Pages
    Improved Essays
  • Great Essays

    1.0 REFERENCES 1.1 Sample Preparation and Calculations for Dissolved Gas Analysis in Water Samples Using a GC Headspace Equilibration Technique, EPA National Risk Management Research Laboratory, RSKSOP-175, Rev.2, May 2004. 1.2 Light Hydrocarbons in Aqueous Samples via Headspace and Gas Chromatography with Flame Ionization Detection (GC/FID), Pennsylvania Department of Environmental Protection Bureau of Laboratories, PA-DEP 3686, Rev.0, April 2012. 1.3 Lange’s Handbook of Chemistry, 14th edition (1992), McGraw Hill. 1.4 Gas Encyclopedia, Air Liquide, online edition (2009). 2.0 SCOPE AND APPLICATION 2.1 Samples reported using this procedure reference the sample preparation SOP commonly referred to as “RSK-175” for use in West Virginia.…

    • 1252 Words
    • 5 Pages
    Great Essays
  • Improved Essays

    Autocorrelation is not expected to be an issue with the model, given that the it is not using time series data, and it can therefore be assumed that there is no autocorrelation present in the model. To make sure that there is no perfect collinearity present in the model, it is recommended to run a collinearity test of the model (see: Exhibit 1.1). If a variable generate a VIF (Variance Inflation Factors) value above 10.0, the model might have a collinearity problem and attempts to correct the model should be made. However, the test did not generate any VIF values above 10.0 for any of the variables, thus not suggesting any problems and also implying that the model shows no perfect collinearity i.e. not violating the assumption.…

    • 1462 Words
    • 6 Pages
    Improved Essays
  • Improved Essays

    Romanian Police Image

    • 892 Words
    • 4 Pages

    Chapter VI. THE EMPIRICAL ASSESSMENT OF THE DETERMINANTS OF THE ROMANIAN LAW ENFORCERS PUBLIC IMAGE If the first part of the research followed an exploratory-descriptive pattern, the last one focused on an explanatory approach. It was based on cross-sectional design, by analyzing a sample at one point in time (Bachman, & Schutt, 2003 Maxfield & Babbie, 2008).…

    • 892 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    My purpose is to see if there is any relationship between the height and diameter at breast height (DBH) in podocarp trees. My response variable is the DBH and my explanatory variable is height. All of my data comes from a random sample collected from the Waitutu Forest during 2001-2008. By looking at this graph, I can see that there appears to be a clear linear trend with a positive relationship. Therefore, it would be sensible to use a linear regression model to investigate the relationship.…

    • 785 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    Regression-Discontinuity Design. A powerful, alternative design for causal inference that is underutilized in the health and intervention sciences is the regression discontinuity (RD) design (Thistlewaite & Campbell, 1960). In its simplest form, the RD design involves the use of a screening measure of some form that is continuous and given to all persons. A cut point or criterion is set, which determines whether individuals are assigned to an intervention condition or a comparison condition. The cut point is determined on the basis of need or a cost-benefit analysis.…

    • 1016 Words
    • 4 Pages
    Improved Essays
  • Improved Essays

    What are the consequences of the problem? What are the possible fixes to the problem? What are the consequences of the fixes? Problem 1: The residuals should be normally distributed. This can be detected visually using the standardized normal probability plot and histogram.…

    • 1261 Words
    • 6 Pages
    Improved Essays
  • Superior Essays

    2.5. Statistical analysis Multivariate analysis was performed using two algorithms: PCA and 2-D HCA heat map. The PCA, based on the correlation matrix, was performed using XLStat-Pro 2015 software. The 2-D HCA heat map was carried out with the ArrayTrack, and the Ward's minimum-variance method was used for runs and hydrocarbons clustering. A probability level of p=0.05 was considered as significant difference.…

    • 2062 Words
    • 9 Pages
    Superior Essays