252y0312

advertisement
252y0312 3/10/03
ECO252 QBA2
FIRST HOUR EXAM
February 20, 21 2003
Name ______Key__________
Hour of class registered _____
Class attended if different ____
Show your work! Make Diagrams!
I. (8 points) Do all the following.
x ~ N 11,3
16  11 

1. Px  16   P z 
  Pz  1.67   Pz  0  P0  z  1.67   .5  .4525  .0475
3 

16  11 
 11  11
z
  P0  z  1.67   .4525
3 
 3
2. P11  x  16   P
3. F 10  (The cumulative probability up to 10) F 10   Px  10 
10  11 

 P z 
  Pz  0.33  Pz  0  P 0.33  z  0  .5  .1293  .3707
3 

4.
x.125 This is a value of x with .125 above it. We first find the correct value for z . Make a diagram
showing a Normal curve centered at zero. The probability below zero is .5. The probability above z .125 is
.1250. So the probability between zero and z .125 is .3750. P0  z  z.125   .3750 . The closest we can
come is P0  z  1.15   .3749 . So z.125  1.15 and x    z  11  1.153  14 .45.
14 .45  11 

Check:  Px  14.45   P z 
  Pz  1.15   Pz  0  P0  z  1.15   .5  .3749  .1251
3


Note that if you made diagrams for x the mean is 11 but if you made diagrams for z the mean is
zero. I’m very annoyed that most of you knew that problem 4 above would be on the exam but failed
to prepare for it.
252y0312 3/10/03
II. (6 points-2 point penalty for not trying part a.)
A random sample is taken of the endowments of private colleges in the US. The following data is
found.
College
1
2
3
4
Endowment
($Millions)
60
47
235
3909
a. Compute the sample standard deviation, s , of the endowments. Show your work! (3)
b. Compute a 99% confidence interval for the mean endowment,  .(3)
Solution: a)
Index
x
x2
1
47
2209
2
60
3600
3
235
55225
4
3909
15280281
Total
4251
15341315
x 4251
x
 1062 .75 , s 
n  4, x   
15341315  41062 .75 2
n
4
n 1
3
s
1899 .354
10823564 .75

 949 .718

 3607854 .92 . So s  3607854.92  1899.354 and s x 
3
n
4
b) The formula for a confidence interval is   x  t n1 s .   .01, so use t 3  5.841. The confidence
2
2
 nx 2

2

x
.005
interval is   1062 .75  5.841949 .718   1062 .75  5547 .30. Make a diagram including the upper and
lower limits for the mean or note that P4485    6610   .99 .
2
252y0312 3/10/03
III. Do all of the following Problems (17 points) Show your work except in multiple choice questions.
1. Which of the following would be an appropriate null hypothesis (1)?
a) The population proportion is less than 0.65.
b) The sample proportion is less than 0.65.
c) *The population proportion is no less than 0.65.
d) The sample proportion is no less than 0.65.
Note that a hypothesis (i) contains equalities and (ii) deals with a population.
Question: How many of you looked at ‘Things You Should Never Do on an Exam or Anywhere Else’
before this exam? It should alert you to problems like the first two here.
TABLE 9-1
Microsoft Excel was used on a set of data involving the number of parasites
found on 46 Monarch butterflies captured in Pismo Beach State Park. A
biologist wants to know if the mean number of parasites per butterfly is
over 21. She will make her decision using a test with a level of 0.10. The
following information was extracted from the Microsoft Excel output for the
sample of 46 Monarch butterflies:
n = 46; Arithmetic Mean = 29.00; Standard Deviation = 25.92; Standard
Error = 3.82;
Null Hypothesis: H 0 :   21 .000 ;  = 0.10; df = 45; T Test Statistic =
2.09;
One-Tailed Test Upper Critical Value = 1.301; p-value = 0.021; Decision
= Reject.
2.
Referring to Table 9-1, the parameter the biologist is interested in is (1):
a) the mean number of butterflies in Pismo Beach State Park.
b) the mean number of parasites on these 46 butterflies.
c) *the mean number of parasites on Monarch butterflies in Pismo Beach State Park.
d) the proportion of butterflies with parasites.
Note that a parameter refers to a population.
3.
Referring to Table 9-1, Mark the following true or false (3):
a) The null hypothesis would be rejected if a 5% probability of committing a Type I error is
allowed. T
b) The null hypothesis would be rejected if a 2% probability of committing a Type I error is
allowed. F
c) The null hypothesis would be rejected if a 1% probability of committing a Type I error is
allowed. F
Repeat after me: If a p-value is below the significance level, reject the null hypothesis.
4. Referring to Table 9-1, state the alternate hypothesis for this study (1).
Answer: H1 :   21 .000
3
252y0312 3/10/03
5. The marketing manager for an automobile manufacturer is interested
in determining the proportion of new compact-car owners who would
have purchased a passenger-side inflatable air bag if it had been
available for an additional cost of $290. The manager believes from
previous information that the proportion is 0.30. Suppose that a
survey of 200 new compact-car owners is selected and 76 indicate that
they would have purchased the inflatable air bags.
If you were to conduct a test to determine whether there is evidence that the proportion is different
from 0.30, which test would you use? (1)
2
a)  -test of population variance.
b) z-test of a population mean
c) *z-test of a population proportion
d) t-test of population mean
6. In problem 5, using a 5% significance level when appropriate:
H : p  .30
a) State the null and alternative hypotheses (2) Solution:  0
H 1 : p  .30
b) Do the problem using a test ratio (3) Solution: From the formula table we have:
Interval for
Confidence
Hypotheses
Test Ratio
Critical Value
Interval
Proportion
p  p0
p  p  z 2 s p
pcv  p0  z 2  p
H 0 : p  p0
z

H1 : p  p0
p
pq
p0 q0
sp 
p 
n
n
q  1 p
q  1 p
0
  .05 . n  200 .  p 
z
p  p0
p

0
p0 q0
x
76
.30 .70 
 .380 .

 .00105  .02340 . p  
n 200
n
200
.380  .30
 3.42 . This is a 2-sided test. Make a diagram. Show a Normal curve with a mean
.02340
at zero and rejection zones (shaded) above z .025  1.960 and below z .025  1.960 . Since your value of
z is above 1.960, reject H 0 .
c) Find a p-value for the test ratio (1) Solution: pval  2Pz  3.42 
 2Pz  0  P0  z  3.42   2.5  .4997   .0006 . Note that a p-value must be between zero and
one.
pq
.380 .620 

d) Do the problem using a confidence interval (2) Solution: s p 
n
200
 .001178  .03432. p  p  z 2 s p  .380  1.960.03432  .380  .067 , or .313 to .447. Since p 0  .30
is not in this interval, reject H 0 .
e) Do the problem using a critical value for a proportion (2) Solution: pcv  p0  z  p ,
2
 p  .02340 p cv  .30  1.960 .02340   .30  .046 , or .253 to .346. Since p  .380. is not in this interval,
reject H 0 . Or Make a diagram. Show a Normal curve with a mean at .30 and rejection zones (shaded)
above .346 and below .253. Since p  .395. is above .253, reject H 0 .
f) (Extra credit) Remember what you did on page 1 and do e) again using a 25%
significance level (2) Solution: pcv  p0  z  p .   .25 . z  z.125  1.15 .
2
2
p cv  .30  1.15.02340   .30  .027 , or .273 to .327. We still reject H 0 .
4
252y0312 3/10/03
7. We believe that the standard deviation for household income per year in Hooverville is $2900. We
take a sample of n  150 households and find a sample standard deviation of $2600.
a) Test the hypothesis that the population standard deviation is $2900 assuming that the
underlying distribution is Normal. (2)
b) (Extra credit) Do a confidence interval for the population standard deviation.(2)
c) (Extra credit) Test the hypothesis that the population standard deviation is less than
$2900. (2)
Solution: From the formula table (but the outline is better)
Interval for
Confidence
Hypotheses
Test Ratio
Critical Value
Interval
VarianceH 0 :  2   02
n  1s 2
n  1s 2
n  1s 2
2
2
2






Small Sample
cv
 02
 .25.5 2 
 .25 .5 2 
H1: :  2   02
s 2DF 
z 
VarianceH 0 :  2   02
 
Large Sample
 z  2DF 
2  2  2DF   1
H1 :  2   02
2
We use the lower group of formulas because of the large value of n  150 . Assume   .05 .
 H 0 :   2900
n  1s 2  149 2600 2  119 .7669 .
a) Only the test ratio method is normally used. 
2 
 02
2900 2
 H 1 :   2900
DF  n  1  149 . z  2  2  2 DF  1  2119 .7669   2149   1  15 .4768  17 .2337  1.757 .
Make a diagram. Show a Normal curve with a mean at zero and rejection zones (shaded) above
z .025  1.960 and below z .025  1.960 . Since your value of z is not below -1.960, do not reject H 0 .
b)
2 DF  2149   17 .2627

s 2 DF
 z  2  2 DF

2600 17 .2627 
. Since
 1.960  17 .2627
2600 17 .2627 
2600 17 .2627 
 2334 .9 and
 2933 .0 , we can say 2334 .9    2933 .0 .
 1.960  17 .2627
 1.960  17 .2627
 H 0 :   2900
c) Oh, let’s go for broke. I’ll do it 3 ways. 
OK, I’ve never seen it done 3 ways before.
 H 1 :   2900
Test ratio method: Use the value of z that you computed above. Small values of s result in small values
of z , so make a diagram. Show a Normal curve with a mean at zero and a rejection zones (shaded) below
z .05  1.645 . Since your value of z is below -1.645, reject H 0 .
2

Critical value method:  cv
s reversed. s cv 
n  1s 2
 .25.5 2 
 2 DF
 z  2  2 DF
won’t work, so try the confidence interval formula with  and
We want a lower critical value, so use s cv 
 2 DF
 z   2 DF
2900 17 .2627 
 2648 .3 Make a diagram. Show a Normal curve with a mean at 2900 and a rejection
1.645  17 .2627
zones (shaded) below 2648.3. Since your value of s is below 2648.3, reject H 0 .
5
252y0312 3/10/03
 H :   2900
s 2 DF
Confidence interval method:  0
We have  
. If we look at the alternate
H
:


2900
 z  2  2 DF
 1
hypothesis, we want a confidence interval that gives an upper limit to  .

s 2 DF
 z  2  2 DF

2600 17 .2627 
 2873 .9 . Since it is impossible that   2873 .9 and   2900
 1.645  17 .2627
are both true, reject H 0 .
It’s remarkable how many of you decided that the H 0 in the proportions problem was   .30 and in the
standard deviation problem was   3000 . Were you asleep?
6
252y0312 3/10/03
ECO252 QBA2
FIRST EXAM
February 20, 21 2003
TAKE HOME SECTION
Name: _________________________
Social Security Number: _________________________
IV. Do the first two problems (at least 10 each) (or do sections adding to at least 20 points - Anything extra
you do helps, and grades wrap around) . Show your work! State H 0 and H 1 where appropriate. You
have not done a hypothesis test unless you have stated your hypotheses, run the numbers and stated
your conclusion. Use a 95% confidence level unless another level is specified.
1. (Kazmier) You want to be sure that the set-up time for new equipment is not more than 9 minutes for
each hour of operation. From a random sample of 40 hours selected from the records of another company
that has already bought the equipment, we find a sample mean of 10.09 minutes. Assume a population
standard deviation of 3.00 minutes. Use a 99% confidence level.
a) State your null and alternative hypotheses. (1)
b) Find a critical value for the sample mean and specify where your ‘reject’ region is (a diagram
is suggested) (1)
c) Do you reject the null hypothesis? Show why. (1)
d) Create a power curve for this test. (6)
e) Do a 2-sided confidence interval for the mean (1)
f) (Extra credit) It is possible to determine the required sample size in a one-sided test for the
mean given certain levels of the probability of type I and a type II error. If  0 is the
population mean from the null hypothesis and we want to be able to keep the probability of a
type II error to  when the mean is actually 1 , use the following sample size:
n
z  z  2  2
1   0 2
. Assume that   .01 ,   .05 and that 1  9.1. Show that we
will need a much larger sample size than was proposed. (3)
Solution:
a) Our hypotheses were H 0 :   9 and H1 :   9 . The problem said n  40 , x  10 .09 ,   3.00 and
  .01 , we can use z  z.01  2.327 .
b) From the formula table we have:
Interval for
Confidence
Interval
Mean (
  x  z 2  x
known)
Hypotheses
Test Ratio
H0 :   0
z
H1 :    0
x  0
x
Critical Value
xcv    z  2  x
This is a one-sided test, so we are only worried about values of the sample mean above 9 . We use

3.00
x 

 0.4743 . Because of the alternate hypothesis, we wanted a critical value above 9, so we
n
40
used xcv    z  x  9  2.327 0.4743   10.10.
c) Make a diagram with a Normal curve centered at 9 and a reject zone above 10.10. Since x  10.09 is
not in the ‘reject’ zone, we do not reject H 0 . Lots of you said not to reject H 0 because xcv  10.10 is
above 9. Given H1 :   9 , the critical value will be above  0 whether H 0 is true or false.
7
252y0312 3/10/03
d) Half of the distance between 9 and 10.10 is 0.55, so I add this to  0  9 four times and find the power at
1  9, 1  9.55, 1  10 .10, 1  10 .65, and 1  11 .20 . Remember that Power  1   , and


that   P Not rejecting H 0 H 0 is false . We do not reject H 0 if x is less than or equal to the critical

x  1 
10 .10  1 

value of 10.10, so   Px  10 .10   1   P  z  cv
We get the
.
  Pz 
0.4743 
x 


following:
 10 .10  9 
1  9 :   P z 
  Pz  2.32  .5  .4898  .99 . power  1    .01. You don’t have to
0.4743 

do this one since you learned in class that if 1   0 ,   1   and power   .
1  9.55 :

  Pz 

10 .10  9.55 
  Pz  1.16  .5  .3770  .8770 . power  1    .1230
0.4743 
 10.10  10.10 
1  10 .10 :   P z 
  Pz  0  .5. You don’t have to do this one since you learned in
0.4743 

class that if 1  xcv ,   .5 and power  .5.
 10.10  10.65 
1  10 .65 :   Pz 
  Pz  1.16  .5  .3770  .1230 . power  1    .8770
0.4743 

 10.10  11.20 
1  11 .20 :   Pz 
  Pz  2.32  .5  .4898  .0102 . power  1    .9898.
0.4743 

You now have the power for 5 points above or equal to 9. Make a diagram. Put zero through one on the yaxis and 9 to 11.5 on the y-axis.
e) The confidence interval has the formula   x  z s x . Since   .01 , we know that
2
z 2  z.005  2.576. Recall that x  10.09 and  x 


3.00
 0.4743 . So
n
40
  10.09  2.576 0.4743   10.09  1.222 or 8.87 to 11.29. Many of you seem to think that we were
testing the hypothesis H 0 :   10 .09 . This is impossible because 10.09 is a sample mean. Using the
formula for critical values for this hypothesis gets you the right answer for a very wrong reason and is only
worth half credit.
f) (Extra credit) It is possible to determine the required sample size in a one-sided test for the
mean given certain levels of the probability of a type I and a type II error. If  0 is the
population mean from the null hypothesis and we want to be able to keep the probability of a
type II error to  when the mean is actually 1 , use the following sample size:
n
z  z  2  2
1   0 2
. Assume that   .01 ,   .05 and that 1  9.1.
Solution: The original problem says that   3.00 and that  0  9. This part says that   .01 ,   .05
and that 1  9.1. So z  z.01  2.327 and z   z.05  1.645 .
n
z  z  2  2 2.327  1.645 2 32

1   0 2
9.1  92
 14199 .1 . So our sample size is 14200.
8
252y0312 3/10/03
2. (Kazmier) A company takes a random sample of 100 men in a large community and finds that 42% prefer
its blades.
a) Create a 95% confidence interval for the proportion that favor the blade. (2)
b) How large a sample would we need if the proportion must be known within .05 ? (3)
c) Assume that the company is testing the hypothesis that the proportion is greater than or equal
to 45%, find a critical value for the sample proportion and use it to test the hypothesis using the
data at the beginning of this problem. (2)
d) What would the p-value be in for your hypothesis in c) if (i) the sample proportion was .44 and
(ii) the sample proportion was .46? (3)
e) (Extra credit) Assume that the actual proportion is 43%, what is the power of the test in c)? (3)
Solution: From the formula table we have:
Interval for
Confidence
Hypotheses
Interval
Proportion
p  p  z 2 s p
H 0 : p  p0
pq
n
q  1 p
sp 
H1 : p  p0
a) The problem says that   .05, n  100 , and p 
sp 
Test Ratio
z
p  p0
p
Critical Value
pcv  p0  z 2  p
p0 q0
n
q0  1  p0
p 
x
 .42 . q  1  p  .58. z 2  z.025  1.960.
n
pq
.42 .58 

 .002436  .0494 . p  p  z 2 s p  .42  1.960.0494  .42  .097 or .323 to
n
100
.517.
b) The sample size formula appears in the outline and says n 
is .42 or .5. n 
.42 .58 1.960 2
.05 2
pqz 2
e2
It doesn’t make much difference if p
 374 .3 , so we use n  375 . With .5 we get 385.
 H 0 : p  .45
x
c) 
q0  1  p0  1  .45  .55 .   .05, n  100 , and p   .42 .
n
 H 1 : p  .45
p0 q0
.45.55 

 .002475  .0497 . The critical value formula is pcv  p0  z  p , but since
2
n
100
the alternate hypothesis alleges that the proportion is below .45, we use
pcv  p0  z  p  .45  1.645.0497  .45  .082  .368. Make a diagram showing a Normal curve with
p 
a mean at .45 and a ‘reject’ region below .368. Since .42 does not fall in the ‘reject’ region, do not reject
H 0 . Many of you said to accept or reject H 0 because p cv is below .45. But the critical value would be
below .45 whether the null hypothesis was true or false.
9
252y0312 3/10/03
p  p0
. We already know that  p  .0497 . Since the alternate hypothesis
p
alleges that the proportion is below .45, we are only worried about values of p below .45. Such values will
make z negative. We are thus interested in the probability of getting the value of p that we actually get,
d) The test ratio formula is z 
or something smaller. (i) If p  .44, z 
p  p0
p

.44  .45
 .20 , so that
.0497
pval  P p  .44   Pz  .20   .5  .0793  .4207 .
(ii) If p  .46, z 
p  p0
p

.46  .45
 .20 , so
.0497
that pval  P p  .46   Pz  .20   .5  .0793  .5793 .
e) In c) we said to make a diagram showing a Normal curve with a mean at .45 and a ‘reject’ region below
.368. Power  1   , and that   P Not rejecting H 0 H 0 is false . We do not reject H 0 if p is less than


or equal to the critical value of .368. The problem says to assume that the actual proportion is 43%, so
p1  .43 and   P p  .368 p  .43 If we assume that p  .43, we must have


p0 q0
.43.58 

 .002494  .0499 , though using .047 shouldn’t make much difference.
n
100
.368  .43 

  Pp  .368 p  .43  P z 
  Pz  1.24   .5  .3925  .1075 .
.0499 

Power  1    1  .1075  .8925.
p 
10
252y0312 3/10/03
3. A new product assembly system is introduced and we are trying to find out if the median number of units
assembled per workshift is larger than the 80 units per workshift we got under the old system. The following
data is assembled on the number of units assembled in a workshift.
75, 85, 92, 80, 94, 90, 91, 76, 88, 82, 96, 83
a) Test the hypothesis that the median is above 80 at the 5% level. (4)
b) (Extra credit) Use the second highest number and the second lowest number to create a twosided confidence interval and find its significance level. (3)
c) Lets say that you had a sample of 150 numbers and that 100 were above 80. Repeat a). (2)
d) (Extra credit) If you have 150 numbers and took two numbers at an equal distance (in Order)
from the highest and lowest to create an approximately 95% confidence interval for the median,
what would they be?
Solution: It’s remarkable how many of you thought that this question concerned the mean. There is
no point in answering a question that you haven’t read!
H 0 :   80
a) The numbers in order are (75, 76, 80, 82, 83, 85, 88, 90, 91, 92, 94, 96). 
. Let p be the
H 1 :   80
 H : p  .5
proportion above 80. then, according to the outline, the hypotheses become  0
. n  12 and,
 H 1 : p  .5
under the null hypothesis, we expect half, or less than half of the numbers to be above 80. We should drop
the 80 and revise n so that n  11 . There are x  9 numbers above 80 and
pval  Px  9  1  Px  8  1  .96729  .0371 . Since   .05 and pval   , we reject H 0 .
b) It is questionable whether to drop the 80 in this case. If we do not drop the 80, the interval is 76    94
. This interval is wrong if eleven or more numbers are above the median, or eleven or more numbers are
below the median. For n  12 , we have   2Px  11  21  Px  10 
 21  .99976  2.00024  0048 . If we choose to drop the 80, n  11 and we have
  2Px  10   21  Px  9  21  .99951   .00980 .
 H 0 : p  .5
x
c) Again 
, but, this time n  150 and x  100 . Then p   .6667 . Because of the large
H
:
p

.
5
n
 1
sample, no continuity correction is needed and  p 
z
p  p0
p

.5.5
pq

 .040825 . We use
n
150
.6666  .5
 4.07 . Since this is larger than any value of z from the bottom of the t-table, we
.040825
reject H 0 .
d) We can copy this answer from the outline. The interval is x63  x  x88 .
11
Download