Warm-up Turn in HW – Ch 8 Worksheet

advertisement
Warm-up
O Turn in HW – Ch 8 Worksheet
O Complete the warm-up that you picked up by the
door. (you have 10 minutes)
Objective
O Define and create Residual Plots. (By hand
and in the calculator.
O Use Residual Plots to determine if using a
linear model is appropriate.
O Define and calculate R2(Coefficient of
Determination).
O Use R2 to explain how much of the variation
is accounted for by the model.
Residuals
O The difference between an observed value
of response variable and value predicted by
the regression line..
residual  y  yˆ
O
e represents residual
O 𝑦 represents the predicted response value
O y
represents the actual
response value
ŷ
Residuals
o Negative residual
means the model
OVER PREDICTS the
y value.
o Positive residual
means the model
UNDER PREDICTS
the y value.
Residual Plots
O A scatterplot of the residuals against the
explanatory variable.
O Help us assess how well a regression line fits
the data.
O Should show no obvious pattern.
O Should be relatively small in size
Residual Plot Practice
O Do the first page of the Worksheet
Residual Plot (calculator)
O Enter x values in L1 and y values in L2.
O Scroll to put cursor on L3 . Press 2nd ,STAT,
Enter, 1. (RESID) This calculates the
residuals and puts them in L3 .
O Go to STAT PLOT. Turn on Scatterplot. Pick L1
for X list, and L3 (RESID) for Y list. ZOOM 9
Residual Plot Practice
O Go back to your worksheet. Do # 4 using
your calculator to create the scatterplot.
Standard Deviation of
Residuals
O Give the approximate size of a “typical” or
“average” prediction error.
O𝑠 =
𝑟𝑒𝑠𝑖𝑑𝑢𝑎𝑙𝑠 2
𝑛−2
O This can be found on the calculator by using
STAT-CALC-1-Var Stats for the Residuals and
looking up standard deviation.
Coefficient of Determination
O R2 is the fraction of the variation in the
values of y that is accounted for by the LSRL
of y on x.
Interpreting R2
O “ __(R2)_% of the variation in _(response
variable)_ is accounted for by the linear
model relating _(response variable) to
_(Explanatory variable) .
O Example: From the Roller Coaster warm-up,
where 𝑑𝑢𝑟𝑎𝑡𝑖𝑜𝑛 = 91.033 + 0.242(𝑑𝑟𝑜𝑝)
O If we calculate that R2 is .82 we would
interpret that by saying “82% of the variation
in duration of the ride is accounted for by
the linear model relating duration to initial
drop.”
Is the linear model
appropriate?
O Scatter plot must meet the “Straight enough
condition”
O Correlation Coefficient- 𝑟 𝑐𝑙𝑜𝑠𝑒 𝑡𝑜 1.
O Residual Plot – random & not too far from
the line.
Download