6. Determine if the data support the theory

advertisement
6. Determine if the data support the theory
More details about hypotheses
 A priori,
 Null must be falsifiable.
 Null should give simplest possible explanation – e.g. the
observed difference is because of sampling error (normal
variation that exists for a random variable).
 Alternative is the next simplest – the observed difference is
because of something other than randomness.
 Hypotheses are about parameters ( or p or ρ), the null
contains the equal sign; the alternative contains the
inequalities.
Complete and submit In-Class Activity – Writing Hypotheses
Page 269
The reasoning process for deciding which hypothesis the data
supports is the same for any parameter (p or  or ρ).
1. Assume the null hypothesis is true.
2. Gather data and calculate the statistic.
3. Determine the likelihood of selecting the data that produced the
statistic or could produce a more extreme statistic.
4. If the data are likely, they support the null hypothesis.
However, if they are unlikely, they support the alternative
hypothesis.
Likely and unlikely statistics are found through theoretical
sampling distributions
Suppose your hypotheses were:
H0: p = 0.25
H1: p > 0.25
Assuming the null hypothesis is true, does it seem likely or
unlikely that the value, or more extreme values, would be selected
if the sample proportion is: 0.24? 0.28? 0.30? 0.36? 0.40?
The direction of extreme is the direction on the number line that
will lead to selecting the alternative.
Sampling distribution if p = 0.25.
Notice the distribution and the center of the distribution.
.
Sampling distribution if p = 0.5
Sampling distribution if p = 0.6
In-class Activity Sampling Distributions Page 271
Compare all three sampling distributions when they are on the
same graph.
Suppose our question is: Does a majority of people in the US
believe drilling for oil in the Arctic Ocean should be allowed?
If our hypotheses were
H0: p = 0.5
H1: p > 0.5
If we represent the alternative hypothesis with the p = 0.6
distribution, from which distribution are we most likely to get a
pˆ  0.52 or higher value?
What about
pˆ  0.58 or
higher value?
Now consider the following in which the alternative is based on
p = 0.55.
What if pˆ  0.52 .
We can make errors without doing anything wrong!
Type I error - the data supports the alternative hypothesis
when the null hypothesis is true.
Type II errors –the data supports the null hypothesis when
the alternative is true.
Probability is our tool for understanding errors and making
decisions.
We want to know the probability of making these errors.
Discuss probability briefly.
Probability is always a fraction or a decimal between 0 and 1. This
is shown generically as 0  P(A)  1 where P(A) represents the
probability of event A.
Probability of a Type I error is α, called the level of significance.
The probability of a Type II error is β. The probability of correctly
selecting the alternative is called power (1 – β).
The Evidence
upon which
the decision is
based.
The Data
Supports H0
The Data
Supports H1
The True Hypothesis
H0 Is True
H1 Is True
No Error
Type II Error
Probability: β
Type I Error
No Error
Probability: α Probability:
Power
Because some values clearly support the null hypothesis,
others clearly support the alternative hypothesis but some
do not clearly support either, then a decision has to be
made, before data is ever collected (a priori), as to the
probability of making a Type I error that is acceptable to
the researcher.
In our current example, if a level of significance of 0.05 is
used, a decision rule line is placed on the distribution that
separates the likely, from the 5% of unlikely values,
dependent upon the direction of the extreme.
β
5% of the null values are higher than 0.54 so there is a 5%
chance we would make a Type I error.
19% of the values of the alternative distribution are below
0.54, so there is a 19% chance of making a Type II error.
Power is 81%.
0.54 is called the critical value.
How to Make Statistical Decisions (p-value, alpha, significant)
The choice of which hypothesis is to be supported must be
based on evidence. The evidence is provided by statistics
calculated from the selected data.
The probability of getting the statistic, or more extreme statistics
assuming the null hypothesis is true, is found. This probability is
called a p-value.
If the p-value is less than or equal to α, the data supports the
alternative hypothesis. If the p-value is greater than α, the data
supports the null hypothesis.
When the data supports the alternative hypothesis, the data are said
to be significant. When the data supports the null hypothesis, the
data are not significant.
Demonstrate p-value if
pˆ  0.55 .
Then if
pˆ  0.52 .
In-Class Activity (p-value and alpha) page 272
Conclusions
Conclusions in this course (book and exams) will be expressed in a
way that is similar to that used by researchers who publish in
scholarly journals.
If the data was = 0.55, we write: At the 0.05 level of
significance, the proportion of the population that believes drilling
in the Arctic Ocean should be allowed is significantly greater than
0.50 (p = 0.02, n = 500).
If the data was = 0.52, we write: At the 0.05 level of
significance, the proportion of the population that believes drilling
in the Arctic Ocean should be allowed is not significantly greater
than 0.50 (p = 0.22, n = 500).
Level of Significance
The most commonly used values for α are 0.05, 0.01 and 0.10.
Draw a number line showing alphas and p-values and the zones of
significance.
α:
0 0.01 0.05
0.10
1
The choice for a level of significance should be based on several
factors.
1. If the power of the test is low because of small sample
sizes or weak experimental design, a larger level of significance
should be used.
2. Keep in mind the ultimate objective of research – “to
understand which hypotheses about the universe are correct.
Ultimately these are yes and no decisions.” (Scheiner, 2001) p8.
Statistical tests should lead to one of three results. One result is
that the hypothesis is almost certainly correct. The second result
is that the hypothesis is almost certainly incorrect. The third
result is that further research is justified. P-values within the
interval (0.01,0.10) may warrant continued research, although
these values are as arbitrary as the commonly used levels of
significance.
3. If we are attempting to build a theory, we should use
more liberal (higher) values of α, whereas if we are attempting to
validate a theory, we should use more conservative (lower) values
of α.
The choice of hypothesis that is supported is based on both
alpha and p-value. The researcher determines the former before
data is collected (a priori) and the later is determined from the
evidence. The level of significance cannot be changed if you are
unhappy with the result. This is like “throwing darts against the
wall and then drawing the target around them”.
Notation:
– In the study of statistics there are several words that start with
the letter p and use p as a variable.
– The list of words includes parameters, population,
proportion, sample proportion, probability and p-value.
o The words parameter and population are never
represented with a p.
o Probability is represented with notation that is similar to
function notation you learned in algebra, f(x), which is
read f of x. For probability, we write P(A) which is
read the probability of event A.
o To distinguish between the use of p for proportion and
p for p-value, pay attention to the location of the p.
 When p is used in hypotheses, such as H0: p =
0.5, H1: p > 0.5, it means the proportion of the
population.
 When p is used in the conclusion, such as the
proportion is significantly greater than 0.5 (p =
0.02, n = 500), then it is interpreted as a p-value.
o If the sample proportion is given, it is represented as
= 0.55.
7. Make an evidence-based decision
The meaning of “not significant” - be careful with your words.
Ultimately decisions are stated as being significant (low p-value,
alternative is chosen) or not significant (high p-value, null is
chosen).
 Care must be taken when interpreting the phrase “not
significantly greater than 0.50”.
o This phrasing lets the reader know the null
hypothesis was selected but it does not provide
conclusive evidence that the null hypothesis is
true. It only provides evidence that the alternative
hypothesis is not true.
o The wording can be very tricky. Some authors
are willing to say that the null hypothesis is
accepted. Others say the null hypothesis is not
rejected. Others may say that the alternative
hypothesis is not accepted.
 OLI:
"The data provide enough evidence to reject
claim 1 and accept claim 2"; or
"The data do not provide enough evidence to
reject claim 1."
In particular, note that in the second type of conclusion we
did not say: "I accept claim 1," but only "I don't have enough
evidence to reject claim 1."”
o Claiming that the data support the null hypothesis
means that the data had a high probability of
being selected if the null hypothesis is true.
However, that data could also have been selected
from a population with a parameter that is slightly
different from the one in the null hypothesis.
 For example, a researcher may have the null
hypothesis of H0: p = 0.50 that is not
rejected but the reality could be that the true
proportion is actually p = 0.502.
o By saying we accept the null hypothesis, we
would be saying we really believe the proportion
is 0.500.
o By saying we don’t reject the null hypothesis or
that we support the null hypothesis, we are
concluding that 0.5 is a possible proportion but
acknowledge the true proportion could be some
slightly different value.
o The reality is that if the hypothesis test results are
being used to make a decision, then significant
data leads to the decision that is justified by the
alternative hypothesis while non-significant data
leads to the decision that is justified by the null
hypothesis.
Fracking Video
Example 1 First Attempts, Page 273 - Arsenic
Briefing: Arsenic is a naturally occurring element and also a human produced element
(e.g. fracking, combustion of coal) that can be found in ground water. It causes a
variety of health problems and can lead to death. The EPA limit is 10 ppb, meaning 10
ppb or higher is unsafe.
Problem: A natural gas company wants to put a fracking well on your property. They
will pay you royalties. They promise that the technology they use will not contaminate
the aquifer from which you and others in your community get your water. A year later,
sickness in the community leads health department officials to test your water to
determine if it is contaminated with arsenic. The official will take 5 samples of water
over the next 2 months and decide whether you have safe water or unsafe water based
on the average of these samples. Assume these are the two possible distributions that
exist.
H0: Water is Safe H1: Water is Unsafe
α = 0.30.
What is the direction of the extreme?
Show the decision rule line on both distributions.
What is the critical value?
Label α, β, and power
What is the probability of α?
What is the probability of β?
What is the power?
What is the consequence of a Type I error?
What is the consequence of a Type II error?
Data: What you select from the container that was passed around the classroom
What is the p-value for your data?
Write a concluding sentence:
What decision do you make about your house and water supply?
Example 2: Can a summer job pay for a year’s tuition?
Briefing: There was a time when a student could work a summer job and pay for a
year’s tuition. But that was also a time when the state covered a greater percentage (up
to 90%) of college costs. State funding of higher education has not kept up, so a summer
job is no longer sufficient for paying tuition and students have to rely on loans, which
mean post-graduation debt. The generation that got help with their tuition does not
appear to want to help the next generation.1 Currently, state support is about 30%, but
you think it is less than that.
Is the proportion of tuition paid by tax dollars less than .3 (30%)? If it is, you will
become actively involved in lobbying legislators. If it isn’t, you won’t get involved.
H0: p = 0.30
H1: p < 0.30
α = 0.07.
Show the decision rule line on both distributions.
What is the direction of the extreme?
Label α, β, and power
What is the probability of α?
What is the probability of β?
What is the power?
What is the consequence of a Type I error?
What is the consequence of a Type II error?
Data: A sample of 50 colleges showed that state support was 25%.
Write a concluding sentence:
Based on your decision rule, will you lobby legislators?
1
http://seattletimes.com/html/localnews/2021250505_westneat23xml.html?syndication=rss&utm_source
=feedly
Download