AP Stats – Minitab Name_________________________________ 1. After analyzing the salaries and batting averages of many baseball players in both leagues, a sports commentator concluded that higher-paid players will have higher batting averages. Let n = 10. a. Determine the least-squares regression line. b. What is the correlation coefficient? c. What percent of the variation in batting average can be explained by the least squares line? d. What is the estimated batting average for any player who is earning $875,000? e. Determine an estimate of the slope using a 90% confidence level. Interpret it. f. Use a 5% level of significance to test the sports commentator’s claim. Predictor Constant Annual Salary (x) S = 0.00845737 Coef 0.285630 0.00000004 R-Sq = 72.5% SE Coef 0.004830 0.00000001 T 59.14 4.59 P 0.000 0.002 R-Sq(adj) = 69.1% 2. Galton believed that a very close relationship exists between the heights of fathers and the heights of their sons. To test this claim, a scientist selected ten men at random and recorded their heights and the heights of their sons. a. Determine the least-squares regression line. b. What is the correlation coefficient? c. What percent of the variation in son’s height can be explained by the father’s height? d. What is the estimated height of a son whose father is 71 inches tall? e. Determine an estimate of the slope using a 90% confidence level. Interpret it. f. Use a 5% level of significance to test the sports commentator’s claim. Predictor Constant Dad Ht (in inches) S = 0.501934 Coef 12.023 0.84238 SE Coef 3.232 0.04658 R-Sq = 97.6% T 3.72 18.08 P 0.006 0.000 R-Sq(adj) = 97.3% 3. City officials believe that the number of complaints to the city’s heat complaint control board (for lack of heat) is negatively related to the outdoor temperature. Let n = 7. a. b. c. d. e. f. Determine the least-squares regression line. What is the correlation coefficient? What is the coefficient of determination? Interpret it What is the estimated number of complaints for 22˚F? Determine an estimate of the slope using a 98% confidence level. Interpret it. Use a 4% level of significance to test the sports commentator’s claim. Predictor Constant Temp (in F) S = 2.22807 Coef 125.536 -2.26429 SE Coef 1.518 0.08421 R-Sq = 99.3% T 82.69 -26.89 P 0.000 0.000 R-Sq(adj) = 99.2% 4. Is there a linear relationship between the calorie contents and the fat contents (in grams) of 6- to 8-ounce servigs for 10 hot chocolate products. Use a 5% significance level. Calories Fat 262 6.3 140 2 150 3.5 159 3.5 120 2.5 140 3.5 185 6.8 150 3 80 3 80 2.5 5. Use a 90% confidence interval to estimate the slope for x = the number of wins and y = the earned run averages for eight professional baseball pitchers in the 2009 regular season. Wins Earned Run Avg. 19 2.63 17 2.79 16 3.75 15 3.23 15 3.47 14 3.96 12 4.05 9 4.12 6. Six students were asked to indicate how many hours they studied before taking their statistics examination. Their responses were then matched with their grades on the exam which had a maximum score of 100. y y 2 438.86 y y 2 45.11 x 6.75 x 2 8.6875 a. What proportion of observed variation in scores can be attributed to the linear relationship between number of study hours and test scores? b. Calculate the typical deviation from the least squares line and the standard error of the slope.