AP Statistics Linear Regression Name_____________________

advertisement
AP Statistics Linear Regression
Name_____________________ Hr____
1. The golf scores of 11 members of a women's golf team in two rounds of tournament play are
listed in the table below.
Round 1
89
90
87
95
86
81
105
83
88
91
79
Round 2
94
85
89
89
81
76
89
87
91
88
80
a) Plot the data with Round 1 as your independent (explanatory) variable. Do you think there
might be an influential observation? If so, list the point.
b) Find the regression equation first with the observation included, then without it. Is the point
influential?
Regression equation with point included:
Regression equation with point removed:
c) Make a residual plot. Is the data linearly related? Explain.
2. A study of nutrition in developing countries collected data from the Egyptian village of
Nahra. Listed below are the mean weights of 170 infants in Nahya who were weighed each
month during their first year of life.
Age (months)
1
2
3
4
5
6
7
8
9
10
11
12
Mean wt. (kg)
4.3
5.1
5.7
6.3
6.8
7.1
7.2
7.2
7.2
7.2
7.5
7.8
a) Plot the data. Find the regression equation and correlation coefficient (r).
b) Graph a residual plot. What does this plot tell you about the data?
3. Is the magnitude of an earthquake related to the depth below the surface of at which the quake
occurs?
Magnitude
Depth (km)
2.9
5.0
4.2
10.0
3.3
11.2
4.5
10.0
1.6
7.9
3.2
3.9
3.4
5.5
3.9
8.1
a) Sketch a scatterplot of the data. Find the correlation and the LSRL.
b) Does the data appear to have an influential point? Justify your answer.
c) Is the magnitude of an earthquake linearly related to the depth below the surface at which the
quake occurs? EXPLAIN.
4. A study compared the body weight (kg) of a child to his/her metabolic rate. Descriptive
statistics gives us the following results.
s x  6.568
x  22.5
y  5.888
s y  2.987
r  .984
a) Find the equation of the regression line.
b) Predict the metabolic rate of a child who weighs 17.5 kg.
5. Use the following data set to answer the questions.
x1
y1
1
2
3
1
4
6
7
5
x2
y2
2
1
1
3
a) Find the correlation and LSRL for each data set. What do you notice?
b) Add 5 to each number in x1 . What does this do to the correlation of the data?
c) Multiply each number in x1 by 5. What happens to the correlation?
6
4
5
7
6. After teaching AP Statistics for several years, Mrs. Sapp has found that the correlation, 𝑟,
between the students’ total number of points before the final exam and the number of points
scored on their final exam is 0.8. The pre-final-exam point totals for all students in this year’s
course have mean 280 and standard deviation 30. The points on the final exam have mean 75
and standard deviation 8. Mrs. Sapp’s dog ate Julie’s final exam, but Mrs. Sapp knows that her
total number of points before the exam was 300. She decides to predict Julie’s final exam score
from her pre-final-exam total. What score is Julie predicted to have?
7. The following are data gathered by anthropologists digging in Mayan ruins. X represents the
depth of the soil in centimeters and Y is the percent of the mineral montmorillonite in the soil.
Depth
a)
Make a scatterplot on your calculator and determine the
association (pos or neg) and describe it in a sentence.
40
50
60
70
80
90
100
b)
Find the equation of the least squares linear regression line.
c)
Describe the meaning of the slope of the line in a sentence.
d)
Find the correlation coefficient. What does it tell you about the data?
e)
What is 𝑟 2 and what does it tell you about the data?
f)
Make a residual plot on your calculator and interpret the plot.
%
montmorillonite
58
34
32
30
28
27
22
g)
Are there any outliers/influential points? If so, remove them and recalculate the
regression equation. What effect was there on the correlation and regression equation?
8. The following are data on various fast food hamburgers for various states.
X represents the grams of fat in the burger and Y is the number of calories.
a) Make a scatterplot on your calculator and determine the association
(pos or neg) and describe it in a sentence.
b) Find the equation of the least squares linear regression line.
Fat
19
31
34
35
39
39
43
Calories
410
580
590
570
640
680
660
c) Describe the meaning of the slope of the line in a sentence.
d) Find the correlation coefficient. What does it tell you about the data?
e) What is 𝑟 2 and what does it tell you about the data?
f) Make a residual plot on your calculator and interpret the plot.
g) Are there any outliers/influential points? If so, remove them and recalculate the
regression equation. What effect was there on the correlation and regression equation?
9. Which of the following is true of the least-squares regression line?
A)
The slope is the change in the response variable that would be predicted by a unit
change in the explanatory variable.
B)
It always passes through the point ( x , y ), where x and y are the means of the
explanatory and response variables, respectively.
C)
It will only pass through all the data points if r = ± 1.
D)
All of the above.
10. The following is a scatter plot of the calories and sodium
content of several brands of meat hot dogs. The least-squares
regression line has been drawn in on the plot.
Estimate the value of the residual for the point labeled 𝑥.
Are there any influential points? If so, what makes it
influential?
Download