Interpreting Residual Plots

advertisement
Name________________________________________ Per # _____
NOTES/Practice
Math 8 ACC
Date _____________
Interpreting Residual Plots
The sum of the residuals provides an idea of the
degree of accuracy when using the least-squares line to make predictions.
The following guidelines apply only if you have established this is a “causal” correlation.
 The closer the sum is to zero the more reliable the prediction MAY be.
 The greater the sum of the residuals the less reliable the predictions MAY be.
Graph I: Original Scatter plot
 The x-values are the inputs from

the original scatter plot data.
The y-values are the outputs from
the original scatter plot data.
Graph II: Residual Plot (same data)
 The x-values are the inputs from the

original scatter plot data.
The y-values are the RESIDUALS
found by calculating the differences
between the original outputs and
predicted outputs.
Characteristics of a residual plot that implies there is a linear relationship are:
There are no odd patterns or curved trends in the plot. It actually looks a bit scattered.
The irony is that even though a scattered appearance implied no correlation in a scatter
plot, a scattered appearance in a residual plot means “there is” a linear relationship.
 The sum of the residuals is zero
 The points are equally represented above and below the x-axis.

Be Careful!!!!
Sometimes, a scatter plot can look linear, but it really is not. Notice that the output (y-values)
scales are in increments of 25. Because, of the large scale value, it is hard to see the deviations
in the line.
When the residual differences are plotted, the appearance that would imply a linear connection is
not present. We will learn in upcoming lessons, what type of relationship this really is.
Constructing Residual Plots
EXAMPLE: Recall , from the previous residual notes, that we made linear regression of
y = x + 9 for the following data.
x
y
Actual
y
Predicted
Residual
10
20
19
+1
15
19
24
-5
18
30
27
+3
20
29
29
0
25
31
34
-3
25
35
34
1
Sum of the Residuals→
-3
In the previous example we questioned if the -3 implied a good linear fit or not (What is
close enough to zero and what is not?)The residual plots would imply that there, in fact, is
a linear relationship, since the residual show no obvious pattern.
Even if the linear function we created is adequate, one should investigate to see if there is
one that is more accurate.
Practice: Construct a residual plot based on the date below.
Line of Best Fit: y = 0.5x
y
y
original
predicted
5
3
2.5
0.5
10
4
5
-1
15
9
7.5
1.5
20
7
10
-3
25
13
12.5
0.5
x
RESIDUALS
Does it imply that the data is linear or nonlinear? _________________________
Explain your answer.
Download