Hypothesis Testing

advertisement
Hypothesis Testing
Dhon G. Dungca, PIE, M.Eng’g.
Hypothesis Testing
• Hypothesis – is an assumption about a
population or an assertion about the
possible value of a population parameter.
• Hypothesis Testing – is a statistical process
of determining whether a hypothesis made
is reasonable or not, based upon sample
evidence.
Hypothesis Testing
• Example:
– Hypothesis : “The mean of a population is 50”
– Test: We decide to take 40 samples and
determine x to test if our hypothesis is valid.
Steps in Hypothesis Testing
• Step 1: Formulate the Null Hypothesis
– Denoted by H0
– Is the hypothesis we hope to reject
– This is the hypothesis used for testing and is the starting
point of the testing process
– Must always express the idea of a no significant
difference or relationship
– It is a precise statement of equality, such as =10.
– We always first consider that the null hypothesis is true
unless proven otherwise by sample evidence.
Steps in Hypothesis Testing
• Assuming k to be any constant, we have the
following possible forms of the null hypothesis:
– H0 :  = K
– H0 :  = K
• As a precaution, we never make hypothesis tests
regarding the statistic. Hence, the null hypothesis
H0 : x = K is not acceptable. We only hypothesize
about the value of the population.
Steps in Hypothesis Testing
• Step 2: Formulate the Alternative
Hypothesis
– Denoted by H1
– Is opposite the null hypothesis
– It specifies existence of a difference or a
relationship
– The acceptance of H1 would mean that the
difference between the statistic and population
parameter hypothesized is too great for us to
allow the acceptance of H0.
Steps in Hypothesis Testing
• Assuming k to be any constant, we have the
following possible forms of the alternative
hypothesis:
–
–
–
–
H1 :  = K
H1 :  = K
H1 :  < K or H1 :  > K
H1 :  < K or H1 :  > K
• If H1 is in the form of H1 :  = K, we have a
two-tailed test.
Steps in Hypothesis Testing
H0:  = K
H1:  = K
Two-tail Test
H0 is rejected

H0 is not rejected
H0 is rejected
Steps in Hypothesis Testing
H0:  = K
H1:  < K
One-tail Test

H0 is rejected
H0 is not rejected
Steps in Hypothesis Testing
H0:  = K
H1:  > K
One-tail Test

H0 is not rejected
H0 is rejected
Steps in Hypothesis Testing
• Step 3: Specify the level of significance, 
–
–
–
–
Specifies the area within H1 is accepted.
1- is the Level of confidence
 is the Level of significance
 divides the graph into 2 regions, the region of
the acceptance of H0 and the region of the
acceptance of H1 (critical region).
– It is customary to use an  of 0.05 or 0.01.
Level of Significance
H0:  = K
H1:  = K
Two-tail Test
1-
/2
/2
H0 is rejected
H0 is not rejected
H0 is rejected
Level of Significance
H0:  = K
H1:  < K
One-tail Test
1-

H0 is rejected
H0 is not rejected
Level of Significance
H0:  = K
H1:  > K
One-tail Test
1-

H0 is not rejected
H0 is rejected
Steps in Hypothesis Testing
• Step 4: Decide which sampling distribution
to choose
– Determine the appropriate test statistic whose
sampling distribution is known under the
assumption that H0 is true.
– Parameter Tested
Test Statistic Used
•
z or t
•
x2
Steps in Hypothesis Testing
• Step 5: Determine the critical value and the
critical region
– The critical value is the dividing point between
the acceptance and rejection points.
– The critical region is the region within which
H0 is rejected.
– The critical region (CR) depends upon the
value of the level of significance selected in
Step 3.
Critical Regions
CR: Z < -Z /2
H0:  = K
Z > Z 1-/2
H1:  = K
Two-tail Test
1-
/2
/2
H0 is rejected
-Z /2
H0 is not rejected
Z 1-/2
H0 is rejected
Critical Regions
CR: Z < -Z 
H0:  = K
H1:  < K
One-tail Test
1-

H0 is rejected
-Z 
H0 is not rejected
Critical Regions
CR: Z > Z 1-
H0:  = K
H1:  > K
One-tail Test
1-

H0 is not rejected
Z 1-
H0 is rejected
Steps in Hypothesis Testing
• Step 6: Compute the value of the statistic
– Using the data from a sample of size n,
compute the value of the test statistic z, t or x2
whichever was selected in Step 4 using the
appropriate formula for each case.
• Step 7: State the conclusion and make the
decision
– Determine whether the test statistic is in the
critical region or not. If the test statistic is in
the CR, we accept H1. Otherwise, H0 is
selected.
Test of Means
Gives us an indication of the true
average of a population.
Test of Means (Summary)
H0:  = K
H1:  = K,  < K or  > K
 : 0
CR: Z < -Z /2 or
Z < -Z 
or
Z > Z 1-
or
t> t
Z > Z 1-/2
t < -t /2
t > t /2
or
t < -t 
Test of Means (Summary)
Computations:
Used if n  30
z=
x-
/ n
Used if n < 30
t=
x-
s/ n
Conclusion:
or
z= x-
s/ n
Example 1:
• A new process will be installed if its mean
processing time is at most 20 minutes. The
new procedure was tried. In a random
sample of 50 trials, an average processing
time of 22.2 minutes with a standard
deviation of 4.3 minutes was obtained. At a
level of significance =0.05, should the
new process be installed?
Answer:
H0:  = 20 (install the new process)
H1:  > 20 (do not install)
: 0.05
CR: z > 1.645
Computations:
z = 22.2 – 20
H0 is not rejected
H0 is rejected
4.3 / 50
1.645
3.62
= 3.62
Conclusion:
Since z is greater than 1.645, we reject H0. We
should not install the new process.
Example 2:
• A company that makes chocolates claims
that the mean weight of a bag of chocolates
is 240 grams with a standard deviation of
20.5 grams. Using a 0.05 significance level,
would you agree with the company if a
random sample of 50 bags of chocolates
was found to have a mean weight of 230
grams?
Answer:
H0:  = 240
H1:   240
: 0.05
CR: z < -1.96
H1 is accepted
H1 is accepted
z > 1.96
H0 is accepted
Computations:
z = 230-240
-1.96
1.96
20.5 / 50
-3.45
= -3.45
Conclusion:
Reject Ho. The claim of the manufacturer is not true
Example 3:
• The Edison Electric Institute has published figures
on the annual number of kilowatt-hours expended
by various home appliances. It is claimed that a
vacuum cleaner expends an average of 46
kilowatt-hours per year. If a random sample of 12
homes included in a planned study indicates that
vacuum cleaners expend an average of 42
kilowatt-hours with a standard deviation of 11.9
kilowatt-hours, does this suggest at the 0.05 level
of significance that vacuum cleaners expend, on
the average, less than 46 kilowatt-hours annually?
Answer:
H0:  = 46
H1:  < 46
: 0.05
CR: t < -1.796 w/ v = n-1
degrees of freedom
Computations:
t = 42 - 46
11.9 / 12 H1 is accepted
H0 is accepted
= -1.16
Conclusion:
-1.796
-1.16
Do not reject Ho.
Example 4:
• A new process for producing synthetic diamonds
can be operated at a profitable level only if the
average weight of the diamonds is greater than
0.5K. To evaluate the profitability of the process,
6 diamonds are generated with a mean and a
standard deviation of 0.53 and 0.0559
respectively. Do the 6 diamonds’ measurements
present sufficient evidence to indicate that the
average weight of the diamond produced by the
process is in excess of 0.5K?
Answer:
H0:  = 0.5
H1:  > 0.5
: 0.05
CR: t > 2.015 w/ v = 5
degrees of freedom
Computations:
t = 0.53 – 0.5
.0559 / 6
= 1.32
Conclusion:
Do not reject Ho.
H1 is accepted
H0 is accepted
1.32
2.015
Test of Variance
Gives us the degree of dispersion of
the population in consideration
Test of Variance (Summary)
H 0 : 2 = K
H1: 2 = K, 2 < K or 2 > K
 : 0
CR: x2 < x2 1-/2 or
x2 < x21-
or
x2 > x2
x2 > x2/2
Computations:
x2 =
(n-1) S2
2
Conclusion:
Accept Ho
Test of Variance (Summary)
x2 > x2 
x2 < x21-
Accept H1
Accept Ho
Accept Ho
x21-
Accept H1
x2
x2 < x2 1-/2
x2 > x2/2
Accept H1
Accept Ho
x2 1-/2
x2 /2
Accept H1
Example 5:
• In the past, scores in DIFFCAL test have been
found to be normally distributed with a variance of
100. A class of 29 students now taught by an
outstanding teacher whose method of teaching is
expected to have a wider dispersion of test scores.
Suppose that the class can be considered to be a
random sample and suppose that the variance of
student scores in the final examinations is found to
be 150; can we conclude that this instructor’s
method of teaching results in a wider dispersion at
the 5% level of significance?
Answer:
H0: 2 = 100 (no wider dispersion)
H1: 2 > 100 (there is wider dispersion)
: 0.05
CR: x2 > 41.337 (x20.05 at v = 29-1 = 28)
degrees of freedom
Computations:
x2 = (29-1)(150)
Accept H1
100
Accept Ho
= 42
Conclusion:
41.337
42
Reject Ho. Teaching method results in a wider
dispersion
Example 6:
• The volume of containers of a particular
lubricant is known to be normally
distributed with a variance of 0.03 liter.
Test the hypothesis that 2 = 0.03 against
the alternative that 2  0.03 for the random
sample of 10 containers if their variance is
0.06. Use 0.01 level of significance.
Answer:
H0: 2 = 0.03
H1: 2  0.03
: 0.01
CR: x2 < 1.735 (x20.995 at v = 10-1 = 9)
x2 > 23.589 (x20.005 at v = 10-1 = 9)
Computations:
x2 = (10-1)(0.06)
0.03
Accept Ho
= 18
Accept H1
Accept H1
Conclusion:
1.735
23.589
Do not reject Ho.
18
Example 7:
• Past experience indicates that the time for
high school seniors to complete a
standardized test is normal random variable
with a standard deviation of 6 minutes. Test
the hypothesis that  = 6 against the
alternative that  < 6 if a random sample of
20 high school seniors has a standard
deviation s = 4.51. Use a 0.05 level of
significance.
Answer:
H0: 2 = 36
H1: 2 < 36
: 0.05
CR: x2 < 10.117 (x20.95 at v = 20-1 = 19)
Computations:
x2 = (20-1)(20.34)
36
Accept Ho
= 10.74
Accept H1
Conclusion:
10.117
Do not reject Ho.
10.74
Extra Challenge 1:
• For some years now, a certain hospital has been using
Formula  for patients known to have a certain type of
disease. It has been proven that the mean recovery time of
patients under Formula  is equal to 20 days with a
standard deviation of 4 days. A new type of treatment is
now available. This new type of treatment is branded as
Formula . A random sample of 25 patients was created
with this new formula and showed a mean recovery time of
18 days with a standard deviation of 3.5 days. The hospital
is not willing to gamble on new formula unless it is proven
to be better than the old formula as far as mean recovery
time and variability are concerned. Decide whether the
hospital should employ the new formula at the 5% level of
significance. (Hint: 2 tests)
Extra Challenge 2:
• The specifications for a component of a particular
item provide that the mean length should not be
less than 4 cm with a standard deviation 0.45 cm.
A random sample of 50 components drawn from a
very large shipment showed a mean length of 3.95
cm and a standard deviation of 0.45 cm. If you
were the manufacturer of this item who had
received this shipment, would you accept it?
Assume 5% level of significance. What if the
standard deviation is 0.55 cm?
Download