113 - uwcentre

advertisement
BUS-122 Applied Statistics Assignment
Instructions:
Answer all the questions on an A4 size paper, all answers must be in HANDWRITTEN form.
Attach the Cover sheet on top of the A4 answer sheets.
All staples must only be stapled on the top left hand corner.
Answer the questions in the number order as given and label them clearly.
1. If a real estate market is strong there will be a close relationship between the asking
price for homes and the selling price. Suppose that one analyst believes that the mean
difference between asking price and selling price for homes in a particular market area
is less than $2,000. To test this using an alpha level equal to 0.05, a random sample of n
= 15 homes that have sold recently was selected. The sample differences between
asking price and selling price are in the following table:
Based on these sample data, what is the critical value expressed in dollars?
(10 marks)
ANSWER:
The analyst claims that the mean difference will be less than $2,000. Given this research
hypothesis, we can form the following null and alternative hypotheses to test statistically:
H o :   $2,000
H a :   $2,000
The test is a one-tail, lower-tail test with 0.05 area in the rejection region. Since we do not
know the population standard deviation and the sample size is relatively small, if we assume
that the population is normally distributed, the t-distribution can be used. The critical t-value
for degrees of freedom 15-1 = 14 and 0.05 in one tail is –1.7613. We find the critical value as
follows:
 t
s
n
= 2000  1.7613
s
.
n
We compute the sample standard deviation, s, from the data as follows:
s
 (x  x)
n 1
2
 $506.59 .
Then the critical value is 2000  1.7613
506.59
15
 $1,769.62
Thus, if x < $1,769.62 reject the null hypothesis; otherwise do not reject.
follows:
x
x
n

$27, 466
 $1,831.067 .
We find x as
Since $1,831.067 > $1,769.62, we do
15
not reject the null hypothesis.
2. A major U.S. oil company has developed two blends of gasoline. Managers are
interested in determining whether a difference in mean gasoline mileage will be
obtained from using the two blends. As part of their study, they have decided to run a
test using the Chevrolet Impala automobile with automatic transmissions. They
selected a random sample of 100 Impalas using Blend 1 and another 100 Impalas using
Blend 2. Each car was first emptied of all the gasoline in its tank and then filled with the
designated blend of the new gasoline. The car was then driven 200 miles on a
specified route involving both city and highway roads. The cars were then filled and
the actual miles per gallon were recorded. The following summary data were
recorded:
Sample Size
Sample Mean
Sample St. Dev.
Blend 1
100
23.4 mpg
4.0 mpg
Blend 2
100
25.7 mpg
4.2 mpg
Based on the sample data, using a 0.05 level of significance, what conclusion should the
company reach about whether the population mean mpg is the same or different for the
two blends? Use the test statistic approach to test the null hypothesis. (8 marks)
ANSWER:
The hypothesis test involves a test for the difference between population means and is
based on large samples from each population. The appropriate null and alternative hypotheses
are:
H o : 1   2
H a : 1   2
.
This will be a two-tailed test since we are interested in testing whether there is a difference
between the two population means with respect to miles per gallon and neither blend of
gasoline is predicted to be superior to the other.
The critical value for a two-tailed test with a significance level of 0.05 is found in the Standard
Normal table to be z = +1.96. The test statistic in this case is computed using:
z
x1  x2  0
s12 s 22

n1 n2

(23.4  25.7)  0
16 17.64

100 100

 2.3
 3.9655 .
.58
Since z = -3.9655 < -1.96, we reject the null hypothesis and conclude that the population means
are not equal. Based on the samples, we infer that Blend 2 will provide higher mean mileage.
3. A PC company uses two suppliers for rechargeable batteries for their notebook
computers. Two factors are important quality features of the batteries: mean use time
and variation. It is desirable that the mean use time be high and the variability be low.
Recently, the PC maker conducted a test on batteries from the two suppliers. In the
test, 9 randomly selected batteries from Supplier 1 were tested and 12 randomly
selected batteries from Supplier 2 were tested. The following results were observed:
Supplier 1
n1 = 9
Supplier 2
n2 = 12
x1  67.25 min.
x2  72.4 min.
s1 = 11.2 min
s2 = 9.9 min.
Based on these sample results, can the PC maker conclude that a difference exists
between the two batteries with respect to the population mean use time? Test using a
0.10 level of significance. (12 marks)
ANSWER:
This test calls for us to test whether the population means are equal. Since the sample
sizes are small and the population variances are unknown, the t-distribution should be
employed if we assume that the populations are normally distributed and the populations have
equal variances.
We can test the following null and alternative hypotheses:
H o : 1   2
H a : 1   2
The critical value for a t-test for a two-tailed test is found from the t-distribution with degrees of
freedom equal to n1 + n2 – 2 = 9 + 12 – 2 = 19. Given a 0.10 significance level, the critical
value from the t-distribution with degrees of freedom equal to 19 is equal to +1.7291.
The test statistic is computed using:
t
( x1  x 2 )  0
sp
where sp is the pooled standard deviation and is computed as:
1
1

n1 n2
sp 
(n1  1) s12  (n2  1) s 22

n1  n2  2
(8)11.2 2  (11)9.9 2
 10.4671 .
19
Then the test statistic is:
t
( x1  x 2 )  0
sp
1
1

n1 n2

(67.25  72.4)  0
1 1
10.4671 
9 12
 1.1158 .
Since t = -1.1158 > –1.7291, we do not reject the null hypothesis. Thus, based on the sample
data, we do not have evidence to indicate that the two batteries differ in mean use time.
Moderate
Keywords: t-test, mean difference, independent
4. The National Football League (NFL) is interested in testing to see whether there is a
difference in the proportion of male fans that prefer instant replay to review officials’
calls and the proportion of female fans that prefer instant replay. It is believed that
males tend to favor the practice to a higher degree than do females. To test this,
random samples of male fans and female fans were selected and the following results
were obtained:
Sample Size
Number Preferring
Male Fans
300
234
Female Fans
100
57
Using a significance level equal to 0.05, what conclusion should be reached based on
the sample data? (10 marks)
ANSWER:
The parameter of interest in this situation is the difference in population proportions.
The research hypothesis states that the proportion of males favoring instant replay exceeds the
proportion of females who favor it. Thus, we can formulate the following null and alternative
hypotheses:
Ho :1   2
H a :1   2
.
This means that we are dealing with a one-tailed hypothesis test. The critical value for a
one-tailed test for the difference between population proportions with a 0.05 significance level
is z = 1.645 from the standard normal distribution. The test statistic is computed using:
( p1  p 2 )  0
z
p (1  p )(
where p 
1
1
 )
n1 n2
x1  x2
234  57

 .7275 and
n1  n2 300  100
p1 
x1 234

 .78 and
n1 300
p2 
x2
57

 .57 .
n2 100
Then, the test statistic is:
z
( p1 p 2 )  0
p (1  p )(
1
n1


(.78  .57 )  0
1
1
)
n2
(.7275 )(.2725 )(
300
 4.0846 .

1
)
100
Since 4.0846 > 1.645, we reject the null hypothesis and conclude that based on the sample data,
the male fans do favor instant replay by a higher proportion than do female fans.
5. The Department of Weights and Measures in a southern state has the responsibility
for making sure that all commercial weighing and measuring devices are working
properly. For example, when a gasoline pump indicates that 1 gallon has been
pumped, it is expected that 1 gallon of gasoline will actually have been pumped. The
problem is that there is variation in the filling process. The state’s standards call for
the mean amount of gasoline to be 1.0 gallon with a standard deviation not to exceed
0.010 gallons. Recently, the department came to a gasoline station and filled 10 cans
until the pump read 1.0 gallon. They then measured precisely the amount of gasoline
in each can. The following data were recorded:
Based on these data what should the Department of Weights and Measures conclude if
they wish to test whether the standard deviation exceeds 0.010 gallons or not, using a
0.05 level of significance? (8 marks)
ANSWER:
The variation requirement specifies that the standard deviation can not exceed 0.01
gallon, we need to use a test for a single population variance. The appropriate null and
alternative hypotheses are:
H o :  2  .0001
H a :  2  .0001
.
Note that we have converted from standard deviation to variances since there is no test for
standard deviation directly. In this situation, the test statistic is a chi-square value computed as
follows:
2 
(n  1) s 2
2
=
(10  1).0388 2
= 135.49.
.012
This test statistic can be compared to the chi-square value from the table with 10 - 1 = 9
degrees of freedom and a one-tail area of 0.05. That value is 16.9190. Since
 2  135.49  16.9190 , the null hypothesis is rejected. This means that the variation
standard is being exceeded based on these data. The gasoline station will need to get
the pump checked out and the variation of fill around the mean reduced.
6. A PC company uses two suppliers for rechargeable batteries for their notebook
computers. Two factors are important quality features of the batteries: mean use time
and variation. It is desirable that the mean use time be high and the variability be low.
Recently, the PC maker conducted a test on batteries from the two suppliers. In the
test, 9 randomly selected batteries from Supplier 1 were tested and 12 randomly
selected batteries from Supplier 2 were tested. The following results were observed:
Supplier 1
n1 = 9
Supplier 2
n2 = 12
x1  67.25 min.
x2  72.4 min.
s1 = 11.2 min
s2 = 9.9 min.
Based on these sample results, can the PC maker conclude that a difference exists
between the two batteries with respect to the population standard deviations? Test
using a 0.10 level of significance. (12 marks)
ANSWER:
Ideally, we would like to test whether the two population standard deviations are equal.
However, there is no test available for doing this test directly. Instead, we must convert to
variances and perform the F-test. The following null and alternative hypotheses would be
appropriate in this situation:
H o :  12   22
H a :  12   22
This will be a two-tailed test since we are testing to see whether a difference exists and are not
predicting which battery will have more or less variability. In a two-tailed test of variances,
the test statistic is an F-value that is formed by the ratio of the two sample variances and by
placing the larger sample variance in the numerator:
F
s12 11.2 2 125.44


 1.2799 .
98.01
s 22
9.9 2
We then compare this value to a critical value from the F-distribution table. Since this is a
twotail test with alpha = 0.10, we use the table with 0.05 in the upper-tail of the F-distribution.
Two sets of degrees of freedom are used with the F-distribution. Across the top of the F-table
we look for degrees of freedom corresponding to n-1 where n is the sample size associated with
the sample variance that was placed in the numerator of the F-test statistic. In our case, n = 9,
so the degrees of freedom is 9-1 = 8 for the numerator. The degrees of freedom down the side
in the F-table is n-1 where n corresponds to the sample size for the sample variance in the
denominator of the F-test statistic. In our case, that would be n = 12. Thus, the denominator
degrees of freedom are 12-1 = 11. Then the critical F from the 0.05 table with 8 and 11 degrees
of freedom is 2.948. The decision rule is:
If F calculated > 2.948 reject Ho, otherwise do not reject.
Since our calculated F-test statistic is F = 1.2799 < 2.948, we do not reject the null hypothesis.
Thus, based on the sample information, we have no basis for believing that there is a difference
in the two batteries with respect to variability in use time.
Download