Test 14B

advertisement
Test 14B
A P Statistics
The following information is used in questions 1–4.
A random sample of 80 companies from the Forbes 500 list was selected and the relationship between sales
(in hundreds of thousands of dollars) and profits (in hundreds of thousands of dollars) was investigated by
regression. A least-squares regression line was fit to the data using statistical software, with sales as the
explanatory variable and profits as the response variable. Here is the output from the software:
Dependent variable is Profits
R squares = 66.2%
s = 466.2 with 80 – 2 = 78 degrees of freedom
Variable
Constant
Sales
Coefficient s.e. of Coefficient
–176.644
61.16
0.092498
0.0075
P-value
0.0050
≤0.0001
1.
Using the above data, approximately what is the intercept of the least-squares regression line?
(a) 0.0925
(b)
0.0075
(c)
–176.64
(d) 61.16
(e)
None of the above. The answer is ___________________
2.
Using the above data, approximately what is a 90% confidence interval for the slope of the leastsquares regression line?
(a) 0.0925 ± 0.0075
(b)
0.0925 ± 0.012
(c)
–0.0925 ± 0.0075
(d) –0.0925 ± 0.012
(e)
None of the above. The answer is __________________
3.
Using the above data, what is the value of the t statistic for testing whether the slope of the leastsquares regression line is 0?
(a) 0.0075
(b)
0.082
(c)
0.092
(d) 12.73
(e)
None of the above. The answer is __________________
4.
Using the above data, is there strong evidence (and if so, why) of a straight line relationship between
sales and profits?
(a) Yes, because the slope of the least-squares line is positive.
(b) Yes, because the P-value for testing if the slope is 0 is quite small.
(c)
No, because the value of the square of the correlation is relatively small.
(d) It is impossible to say because we are not given the actual value of the correlation.
(e)
None of the above. The answer is
.
Questions 5–7 refer to the following situation:
Growth hormones are often used to increase the weight gain of chickens. In an experiment using 15
chickens, five different doses of growth hormone (0, .2, .4, .8, and 1.0 mg/kg) were injected into chickens
(three for each dose) and the subsequent weight gain was recorded. An experimenter plots the data and
finds that a linear relationship appears to hold. The output from SAS follows:
PARAMETER
INTERCEPT
DOSE
5.
6.
ESTIMATE
3.7816
4.0416
T FOR H0:
PARAMETER=0
3.23
2.85
PR > |T|
0.0066
0.0137
STD ERROR OF
ESTIMATE
1.1705
1.4195
The fitted regression line is:
(a) WtGain = 4.04 + 3.78 Hormone
(c) WtGain = 2.85 + 3.23 Hormone
(e) WtGain = 1.17 + 1.42 Hormone
(b)
(d)
WtGain = 3.23 + 2.85 Hormone
WtGain = 3.78 + 4.04 Hormone
A 95% confidence interval for the slope is:
(a) 4.04 ± 1.96 (1.42)
(c) 4.04 ± 2.16(1.42)
(e) 3.78 ± 2.16 (1.17)
(b)
(d)
4.04 ± 1.77 (1.42)
3.78 ± 1.77 (1.17)
7.
It is suspected that weight gain should increase with dose. An appropriate null and alternate hypothesis
to test the slope, the test statistic, and the p-value are:
(a) H0:  = 0, Ha:  ≠ 0, t = 2.85, p-value = .0069
(b) H0:  = 0, Ha:  ≠ 0, t = 3.23, p-value = .0066
(c) H0:  = 0, Ha:  > 0, t = 2.85, p-value = .0137
(d) H0:  = 0, Ha:  > 0, t = 3.23, p-value = .0033
(e) H0:  = 0, Ha:  > 0, t = 2.85, p-value = .0069
Free Response
Consider the following data on height (in inches) and number of siblings for a simple random sample of 10
female students at a large university.
Height
# of Siblings
64.2
2
65.4
0
64.6
2
66.1
6
65.1
3
65.5
1
1.
Make a scatterplot of the data. Does there appear to be
a linear relationship between height and # of siblings?
2.
a) Use your calculator to obtain the equation of
the least-squares regression line and the correlation.
67.2
2
66.4
2
63.3
0
61.7
1
b) Is the value of your correlation coefficient
consistent with your answer to #1.
3.
Explain in words what the slope  of the true
regression line says about height and # of siblings.
4.
What is the estimate of  from the data? What is your
estimate of the intercept  of the true regression line?
5.
Use your calculator to calculate the residuals. Report the sum of the residuals and the sum of the
squares of the residuals. Then use these results to estimate the standard deviation  in the regression
model and interpret it in the context of this problem.
6.
Calculate SEb and interpret it in the context of the problem.
7.
Carry out a two sided significance test to decide whether height is a good predictor for # of siblings.
a) Write the hypotheses.
b) Determine the test statistic, the degrees of freedom, and the P-value of t against the alternative.
c) Write your conclusions in plain language. Would you use height to predict the number of siblings a
woman has?
8.
Compute a 95% confidence interval for the slope ß of the true regression line.
Answers
Multiple choice:
1) c
2) b
3) d
4) b
5) d
6) c
7) e
Free Response:
1) Not very linear.
2) a) y = -25.934+.4285x, r = .396 b) Yes, since r is not close to 1.
3) On average, all women have .4285 more siblings for each inch of height.
4) the estimate for ß is b = .4285. the estimate for α is a = -25.934
5) sum of residuals = 0, sum of residuals squared = 22.6789, s = 1.6837 which tells us that, on average, the
actual number of siblings will differ from the predicted number of siblings by 1.6837 in.
6) SEb = 1.6837/√22.985 = .3512 which tells us that in repeated samples, the slope will differ from the
slope of the true (population) regression line by, on average, .3512 siblings per inch.
7) a) H0: ß = 0 (no association between height and # of siblings)
H0: ß ≠ 0 (there is an association between height and # of siblings)
b) df = 8, t = 1.22, P = .257
c) No, I would not use height to predict number of siblings. We have
evidence (P=.257) that there is no association between height and number of siblings.
8) .4285±2.306(.3512) = (-.381, 1.238) This means that the mean # of siblings increases by between -.381
and 1.238 for every inch of height for all women. Because this interval contains 0, this is consistent with
our answer to #7 that there is no linear relationship between these two variables.
Download