hwk_6 - Homework Market

advertisement
HWK 6
1) Analyze the residual plot below and identify which, if any of the conditions for an adequate
linear model is not met.
2
0
-2 5
15
25
(I cannot make the plots but they are below and above the middle line. There is one dot in conjunction
of -2 and 5;one dot on the line of 15 and one on the above 25 located high of both previous. All other
plots were everywhere that is how I can explain it)
Which of the conditions for an adequate linear model is not met?
A-Patterned residuals
B-None
C-Constant error variance
D-Outlier
2) Consider the data set given in the accompanying table. Complete parts (a through g)
X
-2
-1
0
1
2
Y
-2
0
2
3
5
a) Draw a scatter diagram treating x as the explanatory variable and y as the response variable.
Choose the correct graph below.(please just make the correct draw and I will figure it out)
b) Fine the equation of the line containing the points (-2,-2) and (2,5). The equation of the line
is y = (…….)x + (……).
c) Graph the line found in part (b) on the scatter diagram. Choose the correct graph below
(Please make the right graph then I will figure it out).
d) Determine the least-squares regression line. Choose the correct answer.
A-the least-squares regression line is y = 1.5x + 1.75.
B-the least-squares regression line is y =1.7x + 1.75.
C-the least-squares regression line is y =1.7x + 1.6.
D-the least-squares regression line is y =1.6x + 1.7.
e) Graph the least-squares regression line on the scatter diagram. Choose the correct graph
below.(Please graph the least squares line and I will figure it out).
f) Compute the sum of the squared residuals for the line found in part (b). The sum of the
squared residuals for the line founds in the part (b) is (…..)
g) Compute the sum of the squared residuals for the least-squares regression line found in part
(d). The sum of the squared residuals for least squares regression line is (……).
3) For the data set shown below, do the following. (a) Compute the standard error, the point
estimate for σ. (b) Assuming the residuals are normally distributed, determine Sb1. (c) Assuming
the residuals are normally distributed, test H0: β1=0 verses H1: β1≠0 at he a=0.05 level of
significance.
X
20
30
40
50
60
Y
100
93
91
85
72
a) Determine the point estimate for σ. Se = (…….) (do not round until the final answer then
round to four decimal places as needed).
b) Find the sample standard error of b1. Sb1 = (……)(Use the answer from part (a) to find this
answer. Round to four decimal places as needed.)
c) Which of the following conclusions is correct?
A-Do not reject H0 and conclude that a linear relation exists between x and y.
B-Do not reject H0 and conclude that a linear relation does not exist between x and y.
C-Reject H0 and conclude that a linear relation does not exist between x and y.
D-Reject H0 and conclude that a linear relation exists between x and y.
4) Match the linear correlation coefficient to the scatter diagram. The scales on the x- and y- axis
are the same for each scatter diagram.
(a) r= -0.969, (b) r= -1, (c) r= -0.049.
R
R
Reponse
I
Exp
few dots between the 2 lines;
II
Exp
many dots over the place in 2 lines;
III Explanatory
dots on the straight line.
A-Scatter diagram ( Ii/ I/ III)
B-Scatter diagram (II/ I/ III)
C-Scatter diagram (I/III/II)
5) A scatter diagram is shown to the right with one of the points drawn in blue. In addition, two
least squares regression lines are drawn. The line drawn in red is the least squares regression
line with the point in blue excluded. The line drawn in blue is the least squares regression line
with the point in blue included. On the basis of these graphs, do you think the point in the blue
is influential?
20
red line
response
15
10
blue line
5
10
15
Explanatory
(There are 5 dots above the blue and red line and 5 below red and blue line and three on the red and
blue)
Does the point in the blue seem to be influential?
A-No
B-Yes
6) Because colas tend to replace healthier beverages and colas contain caffeine and phosphoric
acid, researcher wanted to know weather consumption of colas is associated with lower bone
mineral density in women. The data shown in the accompanying table represent the typical
number of cans of soda consumed in a week and the bone mineral density of the femoral neck
for a sample of 15 women. The data were collected through a prospective cohort study.
Complete parts (a) through (f).
Number of
Colas per week
0
0
1
1
2
2
3
3
4
5
5
6
7
7
8
Bone mineral
density (g/cm2)
0.897
0.884
0.891
0.877
0.888
0.871
0.868
0.876
0.873
0.875
0.871
0.867
0.862
0.872
0.865
a) Find the leas squares regression line treating cola consumption per week as the explanatory
variable. Choose the correct answer below.
A-The least squares regression line is ŷ= -0.0030x + 0.8866.
B- The least squares regression line is ŷ= 0.8866x – 0.0030.
C- The least squares regression line is ŷ= 0.0030x - 0.8866.
D- The least squares regression line is ŷ= -0.8866x + 0.0030.
b) Interpret the slope.
For each additional cola consumed per week, bone mineral density will (increase/decrease)
by (0.0030; 1.0053; 0.5423; 0.8866) g/cm2, on average.
c) Interpret the intercept. Choose the correct answer below.
A-For each additional cola consumed per week, bone mineral density will decrease by
0.0030g/cm2, on average.
B-For a woman who does not drink cola, bone mineral density will be 0.030 g/cm2.
C-it is not appropriate to interpret the y-intercept. It is outside the scope of the model.
D-For a woman who does not drink cola, bone mineral density will be 0.8866 g/cm2.
d) Predict the bone mineral density of the femoral neck of a woman who consumes four cola
per week.
The Predict value of the bone mineral density of the femoral neck of this woman is (……)
g/cm2. (Round to four decimal places as needed)
e) The researcher found a woman who consumes of four colas per week to have a bone
mineral density of 0.873 g/cm2. Is this woman’s bone mineral density above or below
average among all women who consume four cola per week?
A-Above average
B-Below average
f) Would you recommend using the model found in part (a) to predict the bone mineral
density of a woman who consumes two cans of cola per day?
A- No
B- Yes
7) The time it takes for a planet to complete its orbit around a particular star is called a
planet’s sidereal year. The sidereal year of the planet is related to the distance the planet is from
the star. The companying data show the distances of the planets from a particular star and there
sidereal years. Complete part (a) through (e).
Planet
Distance from the star
Sidereal year y,
X, millions of miles
Planet1
36
0.24
Planet2
67
0.62
Planet3
93
1.00
Planet4
142
1.86
Planet5
483
11.8
Planet6
887
29.3
Planet7
1785
82.0
Planet8
2797
163.0
Planet9
3675
248.0
a) Draw a scatter data of the diagram treating distance from the star as the explanatory
variable.
b) Determine the correlation between distance and sidereal year. The correlation between
distance and sidereal year is (…..). (Round to three decimal places as needed).
-Does this imply a linear relation between distance and sidereal year?
A-No
B-Yes
c) Compute the least squares regression line. Choose the correct answer below.
A- y= -0.0654x + 12.6256
B- y= 0.0654x + 12.6256
C- y= 0.0654x – 12.6256
D- y= -0.0654x + 132.1412
d) Plot the residuals against the distance from the star.
e) Do you think the least squares regression is a good model?
A-No
B-Yes
8) For the following data set (a) Draw a scatter diagram, (b) by hands compute the correlation
coefficient, and (c) Comment on the type of relation that appears to exist between x and y.
X
1 4
8
8 9
Y
1.3 1.8 2.3 2.2 2.6
(b) r= (…..) (Round to four decimal place as needed)
(c) What type of relation appears to exist between x and y?
A- There appears to be a positive linear association
B- There is a little or no evidence of a linear association
C- There appears to be a negative linear association.
9) The data in the accompanying table represent the rate of return of a certain company stock for
11 months, compare with the rate of return of a certain index of 500 stocks. Both are in percent.
Complete parts (a) through (b)
Month
Rate of return of the index, x
Rate of return of company stock, y
Apr 07
4.23
3.38
May07
3.25
5.09
Jun 07
-1.78
0.54
Jul 07
-3.20
2.88
Aug07
1.29
2.69
Sep 07
3.58
7.41
Oct 07
1.48
-4.83
Nov 07
-4.40
-2.38
Dec 07
-0.86
2.37
Jan 08
-6.12
-4.27
Feb 08
-3.48
-3.77
a) Treating the rate of return of the index as the explanatory variable, x, determine the
estimates of β0 and β1.
The estimate of β0 is (…..)(Round to four decimal places as needed)
The estimate of β1 is (…..)(Round to four decimal places as needed)
b) Compute the standard error of the estimate. The standard error of the estimate is (….)(
Round to four decimal places as needed).
c) Determine whether the residuals are normally distributed. Choose the correct answer
below
A-The residuals are normally distributed
B-The residuals are not normally distributed
d) If the residuals are normally distributed, determine Sb1. Choose the correct answer
below.
A-Sb1=0.2649
B-Sb1= 0.2884
C-Sb1= 0.4115
D-The residuals are not normally distributed.
e) If the residuals are normally distributed, test weather a linear relation exists between the
rate of return of the index, x, and the rate of return for the company stock, y at the a=
0.1level of significant. State the appropriate conclusion. Choose the correct answer
below.
A-There is a sufficient evidence to conclude that a linear relation exist between the rate
of return of the index and the rate of return of the company.
B-The residuals are not normally distributed.
C-There is not sufficient evidence to conclude that a linear relation exist between the rate
of the index and the rate of return of the company.
f) If the residuals are normally distributed, construct a 90% confidence interval for the slop
of the true last squares regression line. Choose the correct answer below.
A-The residuals are not normally distributed
B-The 90% confidence interval is (0.2072, 1.2986)
C-The 90% confidence interval is (0.2072, 1.4546)
D-The 90% confidence interval is (0.2412, 1.2986)
g) What is the mean rate of return for the company stock if the rate of return of the index is
3.15%?
The mean rate of return for the company stock if the rate of return of the index is 3.15%
is (…..) (Round to three decimal places as needed)
10) Supposed that the response variable y is related to the explanatory variable x1 and x2 by the
regression equation parts (a) through (f)
Y= 2+0.5x1 – 1.4x2
(a) Construct a graph showing the relationship between the expected value of y and x1 for x2
= 10, 20, and 30.
(b) Construct a graph showing the relationship between the expected value of y and x2 for
x1= 40, 50, and 60.
(c) How can we tell from the graphs alone that there is on interaction between x1 and x2?
A-The lines are not parallel.
B-The line are parallel
C-The slopes of the lines are positive
D-The slops of the line are negative
(d) Construct a graph showing the relationship between the expected value y and x1 for x2 =
10, 20, and 30 with the interaction term 0.05x1x2 added to the regression equation.
(e) Construct a graph showing a relationship between the expected value of y and x2 for x1 =
40, 50, and 60 with the interaction term 0.05x1x2 added to the regression equation.
(f) How do the graphs from parts (d) and (e) differ from the graphs in parts (a) and (b)?
A-The lines are now parallel in the graphs from parts (d) and (e)
B-The y intercepts of the lines are now equal in the graphs from parts (d) and (e)
C-The slopes of the lines are now negative in the graphs from parts (d) and (e)
D-The lines are no longer parallel in the graphs from parts (d) and (e)
11) Researchers initiated a long term study of population America black bears. One aspect of the
study was to develop a model that could be used to predict a bear’s weight (since it is not
practical to weight bears in the field). One variable thought to be related to weigh is the length of
the bear. The accompanying data represent the lengths and weights of 12 American black bears.
Complete part (a) through (d)
Table length cm
Weight (kg)
138.0
110
138.0
60
130.0
90
120.5
60
149.0
95
141.0
105
141.0
110
150.0
85
166.0
155
151.5
140
129.5
105
150.0
110
(a) Which variable is explanatory variable based on the goals of the research?
A-The number of bears
B-The weight of the bear
C-The length of the bear
(b) Draw a scatter diagram of the data.
(c) Determine the linear correlation coefficient between weight and height
The linear correlation coefficient between weight and height is r= (….) (Round to three decimal
places as needed)
(d) Does a linear relation exist between the weight of the bear and its height?
A- No, there is no linear relation between the weight of the bear and it height.
B- Yes, there is a negative linear relation between the weight of the bear and its height
C- Yes, there is a positive linear relation between the weight of the bear and its height
12) The multiple regression equation y = - 5 – x1 + 8x2 is obtained from a set of sample data.
Complete parts (a) through (e).
(a) Interpret the slope coefficient for x1. Choose the correct answer below
A-The slope coefficient of x1 is -1. This indicates that y will decrease 1 unit for everyone unit
increase in x1, x2 remain constant.
B- The slope coefficient of x1 is -5. This indicates that y will decrease 5 unit for everyone unit
increase in x1, x2 remain constant.
C- The slope coefficient of x1 is -1. This indicates that y will increase 1 unit for everyone unit
increase in x1, x2 remain constant.
Interpret the slope coefficient for x2. Choose the correct answer below.
A- The slope coefficient of x2 is 8. This indicates that y will increase 5 units, for everyone
unit increase in x2, x1 remains constant.
B- The slope coefficient of x1 is -5. This indicates that y will increase 1/8 units, for everyone
unit increase in x1, x2 remains constant.
C- The slope coefficient of x2 is 8. This indicates that y will increase 8 units, for everyone
unit increase in x2, x1 remains constant.
(b) Determine the regression equation with x1 = 10. Choose the correct answer below.
A- y= -5 + 8x2
B- y= -15 + 8x2
C-y= -5 +8x1
D-y= 30 + 8x2
Graph the regression equation with x1 = 10
(c) Determine the regression equation with x1 = 15
A- y= -20 + 8x2
B- y= -5 + 8x1
C-y= 32 + 8x2
D-y= -5 + 8x2
Graph the regression equation with x1 = 15.
(d) Determine the regression equation with x1 = 20
A-y= -5 + 8x2
B-y= 45+ 8x2
C-y= -5 + 8x1
D-y= -25 + 8x2
Graph the regression equation with x1 = 20.
(e) What is the effect of changing the value x1 on the graph of the regression equation?
A- No changes
B- Changes in the slope
C-Changes in the y interception
Download