Hypotheses and Test Procedures

advertisement
Hypotheses and Test Procedures
Some statements:
a. The mean strength of the pipe welds used in a nuclear power plant is
less than or equal to 100 lb/in2 .
a’. The mean strength of the pipe welds used in a nuclear power plant is
larger than 100 lb/in2 .
b. The proportion of certain power supply units need service during the
warranty period is at most 10%.
b’. The proportion of certain power supply units need service during the
warranty period is more than 10%.
c. The discharge-water temperature of the water used for cooling by a
power plant is normally distributed.
c’. The discharge-water temperature of the water used for cooling by a
power plant is exponentially distributed.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
1 / 10
Hypotheses and Test Procedures
Definition
The null hypothesis, denoted by H0 , is the claim that is initially assumed
to be true (the “prior belief” claim). The alternative hypothesis,
denoted by Ha , is the assertion that is contradictory to H0 .
The null hypothesis will be rejected in favor of the alternative hypothesis
only if sample evidence suggests that H0 is false. If the sample does not
strongly contradict H0 , we will continue to believe in the truth of the null
hypothesis.
Remark:
There are only two possible conclusions from a hypothesis testing: reject
H0 or fail to reject H0 ; or corresponding accept Ha or fail to
accept Ha .
We NEVER reject Ha .
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
2 / 10
Hypotheses and Test Procedures
Problem of interest: if the strength of the pipe welds used in a nuclear
power plant is assumed to be normally distributed , whether the mean
value is less than or equal to 100 lb/in2 .
H0 : µ ≤ 100 v.s. Ha : µ > 100.
After an experiment, we recorded 10 sample data:
101.9 100.4 101.2 100.9 101.7
with X = 101.10.
101.5 100.9 100.1 101.6 100.8
If we believe the sample mean should not exceed 100, then we should
reject H0 from this sample data.
However if we believe the sample mean could be a little bit larger than 100
but not exceed 102, then we should not reject H0 from this sample data.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
3 / 10
Hypotheses and Test Procedures
Problem of interest: whether the proportion of certain power supply units
need service during the warranty period is at most 10%.
H0 : p ≤ .1 v.s. Ha : p > .1.
After an experiment, we find that 3 units need service during the warranty
period among 20 samples. If the null hypothesis is true, then the average
number of units need service for this sample should not exceed
n · p = 20 · (.1) = 2.
If we believe the sample mean should not exceed 2, then we should reject
H0 from this sample data.
However if we believe the sample mean should not exceed 3, then we
should not reject H0 from this sample data.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
4 / 10
Hypotheses and Test Procedures
A test procedure is specified by the following:
1. A test statistic, a function of the sample data on which the decision
(reject H0 or do not rejece H0 ) is to be based;
2. A rejection region, the set of all test statistic values for which H0
will be rejected.
The null hypothesis will then be rejected if and only if the observed or
computed test statistic value falls in the rejection region.
Remark: The rejection region is more subjective.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
5 / 10
Hypotheses and Test Procedures
Definition
A type I error consists of rejecting the null hypothesis H0 when it is true,
traditionally denoted the probability of making type I error by α.
A type II error involves not rejecting H0 when it is false, traditionally
denoted the probability of making type I error by β.
Mathematically,
α = P(Type I Error) = P(reject H0 | H0 )
β = P(Type II Error) = P(fail to reject H0 | Ha )
Remark:
There are no error-free test procedures unless we have the information for
the whole population.
Therefore, we look for those test for which the probability of making either
type of error is small.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
6 / 10
Hypotheses and Test Procedures
Problem of interest: if the strength of the pipe welds used in a nuclear
power plant is assumed to be normally distributed , whether the mean
value is less than or equal to 100 lb/in2 .
H0 : µ = 100 v.s. Ha : µ > 100.
After an experiment, we recorded 10 sample data:
101.9 100.4 101.2 100.9 101.7
with X = 101.10.
101.5 100.9 100.1 101.6 100.8
If we choose the cut-off value to be 100, i.e., to reject H0 when X > 100,
what is the probability of making type I error and type II error?
However if we choose the cut-off value to be 102, i.e., to reject H0 when
X > 102, what is the probability of making type I error and type II error?
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
7 / 10
Hypotheses and Test Procedures
Problem of interest: whether the proportion of certain power supply units
need service during the warranty period is at most 10%.
H0 : p = .1 v.s. Ha : p > .1.
After an experiment, we find that 3 units need service during the warranty
period among 20 samples. If the null hypothesis is true, then the average
number of units need service for this sample should not exceed
n · p = 20 · (.1) = 2.
If we choose the cut-off value to be 2, i.e., to reject H0 when X > 2, what
is the probability of making type I error and type II error?
However if we choose the cut-off value to be 3, i.e., to reject H0 when
X > 3, what is the probability of making type I error and type II error?
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
8 / 10
Hypotheses and Test Procedures
Proposition
Suppose an experiment and a sample size are fixed and a test statistic is
chosen. Then decreasing the size of the rejection region to obtain a
smaller value of α results in a larger value of β for any particular
parameter value consistent with Ha .
Remark:
1. In other words, there is no rejection region that will simultaneously
make both α and all βs small.
2. First specify the largest value of α that can be tolerated and then find
a rejection region having that value of α rather than than anything
smaller. The resulting value of α is often referred to as the
significance level of the test.
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
9 / 10
Hypotheses and Test Procedures
Problem of interest: if the strength of the pipe welds used in a nuclear
power plant is assumed to be normally distributed , whether the mean
value is less than or equal to 100 lb/in2 .
H0 : µ = 100 v.s. Ha : µ > 100.
After an experiment, we recorded 10 sample data:
101.9 100.4 101.2 100.9 101.7
with X = 101.10.
101.5 100.9 100.1 101.6 100.8
What is a level 5% test?
Liang Zhang (UofU)
Applied Statistics I
July 21, 2008
10 / 10
Download