Stat 301 Lab 5: Due October 13 Fall 2014

advertisement
Stat 301
Lab 5: Due October 13
Fall 2014
1. Data on new vehicles for the 2014 model year are available from the Environmental
Protection Agency. A random sample of 200 vehicles was selected. Using these data
we wish to predict the CO2 emissions of the vehicles in city driving. The explanatory
variables are listed below. The JMP data set is posted on the course web site.
•
•
•
•
•
•
Engine – displacement of the engine in liters (Min = 1, Max = 6.8)
Cylinder – number of cylinders (Min = 3, Max = 12)
City MPG – Fuel economy in city driving (MPG) (Min = 11, Max = 40)
Gears – number of gears (Min = 1, Max = 9)
Intake – Number of intake valves per cylinder (Min = 1, Max = 2)
Exhaust – Number of exhaust valves per cylinder (Min = 1, Max = 2)
The data are on the class web page
http://www.public.iastate.edu/~wrstephe/stat301.html
a) Use JMP’s Fit Model to fit a multiple regression model with City CO2 as the
response variable and Engine, Cylinder, City MPG, Gears, Intake and Exhaust as
six explanatory variables. Use the JMP output to answer the following questions.
Attach the JMP output at the end of your lab assignment.
b) Give the multiple regression equation.
c) Give the value of the estimate of the error standard deviation, σ.
d) Give the value and an interpretation of R2.
e) What is the value of the adjusted R2?
f) Test the hypothesis H 0 : β1 = β 2 = β 3 = β 4 = β 5 = β 6 = 0 versus the alternative that
at least one slope parameter is not zero. What does this test tell you about the
relationship between the City CO2 and the six explanatory variables?
g) Look at the plot of residuals versus City MPG. What does this plot tell you about
the adequacy of the linear model?
h) Describe what you see in the analysis of residuals; histogram, box plot and normal
quantile plot. What does this indicate about the conditions necessary for multiple
regression analysis?
2. Use JMP’s Fit Model to fit the models necessary to answer the following questions.
Be sure to support your answers with the appropriate test statistic and P-value from
the appropriate JMP output. For all tests use α = 0.05 as your level of significance.
Attach the JMP output at the end of your lab assignment.
a) Is there a statistically significant relationship between City CO2 and the
displacement of the engine?
b) Does fuel economy in city driving add significantly to the model that already
contains the displacement of the engine? How much does adding fuel economy in
city driving to the model with engine displacement add to explaining variation in
City CO2? Give either the additional sum of squares or the change in R2.
1
c) Does the number of gears add significantly to the model that already contains fuel
economy in city driving? How much does adding the number of gears to the
model with fuel economy in city driving add to explaining variation in City CO2?
Give either the additional sum of squares or the change in R2.
d) Does the displacement of the engine add significantly to the model that already
contains the two variables number of cylinders and city fuel economy? How
much does adding displacement of the engine to the model with number of
cylinders and city fuel economy add to explaining variation in City CO2? Give
either the additional sum of squares or the change in R2.
e) Does displacement of the engine add significantly to the model that already
contains the three variables fuel economy in city driving, number of cylinders and
number of gears? How much does adding displacement of the engine to the
model that already contains the three variables add to explaining variation in City
CO2? Give either the additional sum of squares or the change in R2.
2
Download