Uploaded by Alena Dimayacyac

Inferential Statistics & Hypothesis Testing Lecture Notes

advertisement
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
I.
II.
III.
IV.
V.
VI.
VII.
VIII.
IX.
X.
XI.
XII.
XIII.
XIV.
XV.
OUTLINE
Inferential Statistics
Hypothesis Testing
What is Hypothesis?
The Null and Alternative Hypothesis
One-tailed and Two-tailed Tests
Critical Value
Statistical Significance
Type I Error and Type II Error
Central Limit Theorem
Steps in Hypothesis Testing
Critical Value Using Z-Test
Selecting the Appropriate Statistical Test
One Sample Mean Test (Z-Test)
One Sample Mean Test (T-Test)
Simple Tests of Hypothesis
●
A Hypothesis is a tentative explanation for
certain events, phenomena, and behavior.
●
A Hypothesis is a statement of prediction of the
relationship between or among variables.
●
A Hypothesis is a statement about the concept
that refers to observable phenomena which
may be judge as true or false and is subject to
empirical testing.
●
A Hypothesis is testable which means that the
relationship between these variables can be
put into test by means of the application of
appropriate statistical test on the data
gathered about variables.
●
A Hypothesis a statement that require us
(Industrial Engineers) to decide whether to
accept or reject about some statistical
parameter.
●
INFERENTIAL STATISTICS
●
The theory of statistical inference consists of
methods by which one makes inference or
generalizations about a population.
TWO MAJOR AREAS OF STATISTICAL INFERENCE
1.
2.
●
Point Estimate: A single value used to
estimate a population parameter.
A Hypothesis is a statement or tentative theory
or assertion or conjecture which aims to explain
facts about the real world.
●
Interval Estimate: A range of values used
to estimate a population parameter.
TWO TYPES OF STATISTICAL HYPOTHESES
ESTIMATION OF PARAMETERS
TESTS OF HYPOTHESIS
●
1.
Testing assumptions or claims about a
population parameter based on sample
data.
NULL HYPOTHESIS (𝐇𝟎 )
●
A null hypothesis must always express
the idea of non-significance of
difference or the idea of no relationship.
●
It is the hypothesis to be tested and it
represents what the investigation
doubts to be true.
●
It is the hypothesis that we hope to
reject.
●
It is the starting point of the testing
process.
HYPOTHESIS TESTING
●
●
Engr. Romeo Ilagan|1|PRELIM
Hypothesis testing is the process of
determining the probability or acceptability
of a hypothesis as derived from a theory
through scientific data collection and through
application of appropriate statistical test.
This is the decision-making procedure about
the hypothesis.
2.
ALTERNATIVE HYPOTHESIS (𝐇𝟏 )
●
An alternative hypothesis is the opposite
of the null hypothesis.
●
It generally represents the hypothetical
statement that the researcher wants to
prove or claim the manufacturer eager to
substantiate.
●
It is the operational statement of the
theory that the experimenter or
researcher believe to be true and wishes
to prove.
●
Alternative hypotheses are classified as
either non-directional (two-tailed) or
directional hypotheses (one-tailed).
PURPOSE OF STATISTICAL HYPOTHESIS TEST
●
To permit generalizations (about population
parameters) from a sample to the population
which it came.
●
This process involves:
○
assumptions about the population
○
the use of probabilities to estimate the
likelihood of the results obtained in the
sample
○
a random sample has been properly selected
WHAT IS HYPOTHESIS?
●
A Hypothesis is basically a statement about the
target population. This is formulated as a result
of years of observation and researches.
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
Non-directional (two-tailed): Suggests that there is
a difference but does not specify the direction of the
effect.
Directional (one-tailed): Indicates the expected
direction of the effect (e.g., an increase or decrease).
BSIE 2-1 | ALJANE
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
Engr. Romeo Ilagan|1|PRELIM
ALTERNATIVE HYPOTHESES
A.
B.
ONE TAILED AND TWO TAILED TESTS
NON-DIRECTIONAL HYPOTHESIS (𝐇𝒂 )
ONE-TAILED TEST (DIRECTIONAL)
●
One which asserts that one value is
different from another (or others).
●
The rejection region which is equal to alpha (α
) is a place on side of the distribution.
●
It is an assertion that there is a
significant difference or significant
relationship between two statistical
measures (or among three or more
summary measures).
●
A one tailed test has one critical value, either
positive or negative.
●
It implies non-equality (≠).
●
Non-directional hypotheses are
called two-sided hypotheses.
TWO-TAILED TEST (NON-DIRECTIONAL)
●
The two tails of the distribution are considered
the rejection region and a value of alpha (α) is
equally divided into these two tailed.
●
A two tailed test has two critical values, one
positive and the other negative.
●
The critical value of the test statistics is the
value that divides the distribution of sample
means into rejection and acceptance regions.
●
It corresponds to the vertical line in the above
figure.
●
A
test
which
involves
non-directional
alternative hypothesis has two shaded regions.
also
DIRECTIONAL HYPOTHESIS (𝐇𝟏 )
●
It is sometimes
hypothesis.
called
predictive
●
It is an assertion that one measure is less
than (or greater than) another measure
of similar nature.
●
A directional hypothesis involves one of
the order relations, “less than” (<) or
“greater than” (>).
●
Directional hypotheses are also known as
one-sided hypotheses.
THE NULL AND ALTERNATIVE HYPOTHESIS
CRITICAL VALUE
DECISION TO REJECT OR TO ACCEPT
THE NULL HYPOTHESIS
THE NULL AND ALTERNATIVE HYPOTHESIS
●
If the null hypothesis is rejected as a result of
the sample evidence, then the alternative
hypothesis is the conclusion.
●
If there is no sufficient evidence to reject the
null hypothesis, it is retained, but not
accepted.
○
In this case, the null hypothesis is not necessarily
true, but it just cannot be rejected from the
current evidence from the sample data.
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
●
When the computed value of the test statistic
falls within the acceptance region, the null
hypothesis is accepted while if it falls within the
rejection region, the null hypothesis is
rejected.
●
When the absolute value of its critical value test
statistic is less than or equal the absolute
value of its critical value, accept the null
hypothesis; otherwise, reject the null
hypothesis
BSIE 2-1 | ALJANE
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
Engr. Romeo Ilagan|1|PRELIM
STATISTICAL SIGNIFICANCE
●
The difference in the sample means may exist
for one of two reasons:
1.
NO ACTUAL DIFFERENCE (CHANCE)
●
2.
There is no actual difference between the
groups because both samples were
taken from the same population – the
observed difference is just a chance
occurrence due to the error involved in
the sampling.
CENTRAL LIMIT THEOREM
●
It states that as the sample size (n) increases,
the distribution of the sample means
approaches a normal distribution, regardless
of the original shape of the population
distribution (skewed, flat, normal, or any other
shape).
●
The distribution of the sample means will
become more normally distributed as n
becomes large.
IMPORTANCE
A REAL DIFFERENCE EXISTS
●
Difference actually exists because each
sample
came
from
a
different
population and the difference is
therefore real.
●
When the sample is sufficiently large, you can
assume that the distribution of sample means is
normal.
●
It is therefore easy to compare the samples you
have collected to the theoretical distribution of
sample means
Null Hypothesis: Claims that no real difference exists
between the sample means (any observed difference is
due to chance).
Alternative Hypothesis: Asserts that a real difference
exists between the sample means.
FACTORS INFLUENCING
STATISTICAL SIGNIFICANCE
●
the number of samples
●
the variances of the population and the sample
●
the size of the risk (confidence) taken by the
researches that their conclusion will be wrong
STEPS IN HYPOTHESIS TESTING
1.
2.
3.
4.
5.
6.
TYPE I ERROR AND TYPE II ERROR
●
Data contained in published and unpublished
documents, reports, statistics, manuscripts,
letters, diaries.
State the null hypothesis.
Select an appropriate alternative hypothesis
(H1 ).
Choose the appropriate statistical test.
Select the desired level of significance to be
used. The most common level is 0.05, but 0.01
is also widely used. Sometimes, other alpha(α)
levels such as 0.001, 0.025, or 0.10 are also used.
Compute the calculated value and determine
the critical test value.
Make the decision. REJECT the null hypothesis
if the calculated value is greater than the
critical value, otherwise “do not reject the null
hypothesis”.
CRITICAL VALUE USING Z-TEST
TYPE I ERROR
●
Sometimes called alpha error (α).
●
If the researcher decides to reject the null
hypothesis when it is actually true.
TYPE II ERROR
●
sometimes called beta error (𝜷).
●
If the researcher fails to reject the null
hypothesis when it is actually false.
SELECTING THE APPROPRIATE STATISTICAL TEST
Z-TEST
LEVEL OF SIGNIFICANCE (𝜶)
●
It is the probability of committing a type 1 error.
●
This is the value of 1-α.
LEVEL OF CONFIDENCE (𝜷)
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
●
If the sample size is large (n ≥ 30).
●
When the population standard deviation is
known.
●
If the population standard deviation is
unknown, a z-test may still be used for large
samples (n≥30) due to the Central Limit
Theorem.
BSIE 2-1 | ALJANE
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
Engr. Romeo Ilagan|1|PRELIM
T-TEST
●
If the sample size is small (n < 30).
●
When the population standard deviation is
unknown.
ONE TAILED OR TWO-TAILED TEST
●
One-tailed test: Used when the alternative
hypothesis is directional, meaning it predicts a
specific direction of effect.
●
Two-tailed test: Used when the alternative
hypothesis is non-directional, meaning it does
not predict a specific direction of effect.
●
The one-sample mean z-test is used to
compare a sample mean (X) to a known
population mean (µ) to determine if there is a
significant difference between them.
ONE SAMPLE MEAN TEST (Z-TEST)
𝒁=
(𝑿 − 𝝁) • (√𝒏)
𝝈
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
BSIE 2-1 | ALJANE
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
ONE SAMPLE MEAN TEST (T-TEST)
●
Engr. Romeo Ilagan|1|PRELIM
SIMPLE TESTS OF HYPOTHESIS
The one-sample t-test is used for testing a
hypothesis about a single mean when the
sample size is small (n < 30) and the
population
standard
deviation
(σ)
is
unknown.
𝒕=
(𝑿 − 𝝁) • (√𝒏)
𝒔
Exercises:
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
BSIE 2-1 | ALJANE
INEN 201: STATISTICAL ANALYSIS FOR INDUSTRIAL ENGINEERING 2
INFERENTIAL STATISTICS – HYPOTHESIS TESTING
Engr. Romeo Ilagan|1|PRELIM
BSIE 2-1 | ALJANE
Download