predicted value – which is the value on the regression line. You can check it by hand, using y‐hat = ‐8.09 + 11.3*2 = ‐8.09 + 22.6 = 14.51 (There's a difference in the answer from Minitab because Minitab used many more decimal places in the

predicted value – which is the value on the regression line. You can check it by hand, using y‐hat = ‐8.09 + 11.3*2 = ‐8.09 + 22.6 = 14.51 (There’s a difference in the answer from Minitab because Minitab used many more decimal places in the... We define a residual to be the difference between the actual value and the predicted value (e = Y-Y'). It seems reasonable that we would like to make the residuals as small as possible, and earlier in our example, you saw that the mean of the residuals was zero. The criterion of least squares defines 'best' to mean that the sum of e

Generate the predicted y values (yhat) and residual values in Stata. Graph the regression line, the predicted values against the residuals. Also, correlate the independent variable with the residuals. Which assumptions are you testing (albeit in a very informal manner)? What conclusions do you draw about your model?

PREDICTED Y is the height of the line directly above or below that point (denoted by *'s in the scatterplot below). The residuals, which are computed by subtraction (RESIDUAL=Y-PREDICTED Y), tell us how far each point is above or below the line. Points above the regression line will have positive residual. Points that are below the line will have negative residual.

