P-value - Department of Statistics and Probability

advertisement
STATISTICAL INFERENCES
• SAMPLING DISTRIBUTIONS FOR
PROPORTIONS AND MEANS
• CONFIDENCE INTERVALS FOR
PROPORTIONS AND MEANS
• HYPOTHESIS TESTING FOR
PROPORTIONS AND MEANS
1
SAMPLING DISTRIBUTION MODELS
• SAMPLING DISTRIBUTION MODEL FOR A
PROPORTION
PROBLEM FORMULATION: SUPPOSE THAT p IS AN
UNKNOWN PROPORTION OF ELEMENTS OF A
CERTAIN TYPE S IN A POPULATION.
EXAMPLES
• PROPORTION OF LEFT - HANDED PEOPLE;
• PROPORTION OF HIGH SCHOOL STUDENTS WHO
ARE FAILING A READING TEST;
• PROPORTION OF VOTERS WHO WILL VOTE FOR
MR. X.
2
ESTIMATION OF p
• TO ESTIMATE p, WE SELECT A SIMPLE RANDOM
SAMPLE (SRS), OF SIZE SAY, n = 1000, AND
COMPUTE THE SAMPLE PROPORTION.
• SUPPOSE THE NUMBER OF THE TYPE WE ARE
INTERESTED IN, IN THIS SAMPLE OF n = 1000 IS x
= 437. THEN THE SAMPLE PROPORTION
p̂
IS COMPUTED USING THE FORMULA
x
pˆ 
n
3
IN THE EXAMPLE ABOVE
437
pˆ 
 43.7%
1000
4
WHAT IS THE ERROR OF ESTIMATION?
• THAT IS, WHAT IS
ˆ  p?
p
•
WHAT MODEL CAN HELP US FIND THE
BEST ESTIMATE OF THE TRUE
PROPORTION OF p?
• LET’S START THE ANALYSIS BY FIRST
ANSWERING THE SECOND QUESTION.
5
APPROACH
• SUPPOSE THAT WE TAKE A SECOND
SAMPLE OF SIZE 1000 AND COMPUTE
P(HAT); CLEARLY, THE NEW ESTIMATE
WILL BE DIFFERENT FROM 0.437. NOW,
TAKE A THIRD SAMPLE, A FOURTH
SAMPLE, UNTIL THE TWO THOUSANDTH
(2000 –TH) SAMPLE, EACH OF SIZE 1000. IT
IS OBVIOUS THAT WE WILL LIKELY OBTAIN
TWO THOUSAND DIFFERENT P(HATS) AS
ILLUSTRATED IN THE TABLE BELOW.
6
TABLE OF 2000 SAMPLES OF SIZE EACH n=1000, AND
THEIR CORRESPONDING P(HATS)
SAMPLES OF SIZE n
P(HATS)
n1
p̂1
n2
p̂2
…
…
n2000
p̂2000
7
WHAT DO WE DO WITH THE DATA FOR
P(HATS)?
• WE CONSTRUCT A HISTOGRAM OF
THESE 2000 P(HATS).
# OF
SAMPLES
p
P(HATS)
8
Remarks On Observing the Histogram
• THE HISTOGRAM ABOVE IS AN EXAMPLE
OF WHAT WE WOULD GET IF WE COULD
SEE ALL THE PROPORTIONS FROM ALL
POSSIBLE SAMPLES. THAT DISTRIBUTION
HAS A SPECIAL NAME. IT IS CALLED THE
SAMPLING DISTRIBUTION OF THE
PROPORTIONS.
• OBSERVE THAT THE HISTOGRAM IS
UNIMODAL, ROUGHLY SYMMETRIC, AND
IT’S CENTERED AT P.
9
WHAT DOES THE SHAPE OF THE HISTOGRAM
REMIND US ABOUT A MODEL THAT MAY JUST BE
THE RIGHT ONE FOR SAMPLE PROPORTIONS?
• ANSWER: IT IS AMAZING AND FORTUNATE
THAT A NORMAL MODEL IS JUST THE
RIGHT ONE FOR THE HISTOGRAMS OF
SAMPLE PROPORTIONS.
• HOW GOOD IS THE NORMAL MODEL?
– IT IS GOOD IF THE FOLLOWING
ASSUMPTIONS AND CONDITIONS HOLD.
10
ASSUMPTIONS AND CONDITIONS
• ASSUMPTIONS
• INDEPENDENCE ASSUMPTION: THE
SAMPLED VALUES MUST BE INDEPENDENT
OF EACH OTHER.
• SAMPLE SIZE ASSUMPTION: THE SAMPLE
SIZE, n, MUST BE LARGE ENOUGH
• REMARK: ASSUMPTIONS ARE HARD – OFTEN
IMPOSSIBLE TO CHECK. THAT’S WHY WE ASSUME
THEM. GLADLY, SOME CONDITIONS MAY PROVIDE
INFORMATION ABOUT THE ASSUMPTIONS.
11
CONDITIONS
• RANDOMIZATION CONDITION: THE DATA VALUES MUST BE
SAMPLED RANDOMLY. IF POSSIBLE, USE SIMPLE RANDOM
SAMPLING DESIGN TO SAMPLE THE POPULATION OF
INTEREST.
• 10% CONDITION: THE SAMPLE SIZE, n, MUST BE NO LARGER
THAN 10% OF THE POPULATION OF INTEREST.
• SUCCESS/FAILURE CONDITION: THE SAMPLE SIZE HAS TO
BE BIG ENOUGH SO THAT WE EXPECT AT LEAST 10
SUCCESSES AND AT LEAST 10 FAILLURES. THAT IS,
np  10 ( SUCCESS)
nq  10 ( FAILLURE )
12
THE CENTRAL LIMIT THEOREM FOR THE
SAMPLING DISTRIBUTION OF A PROPORTION
• FOR A LARGE SAMPLE SIZE n, THE SAMPLING
DISTRIBUTION OF P(HAT) IS APPROXIMATELY

N  p,

pq
n




THAT IS, P(HAT) IS NORMAL WITH
MEAN  E ( pˆ )  p
STANDARD DEVIATION   ( pˆ ) 
pq
n
13
Standard Deviation Versus
Standard Error
14
EXAMPLE
• ASSUME THAT 30% OF STUDENTS AT A
UNIVERSITY WEAR CONTACT LENSES
• (A) WE RANDOMLY PICK 100 STUDENTS. LET
P(HAT) REPRESENT THE PROPORTION OF
STUDENTS IN THIS SAMPLE WHO WEAR
CONTACTS. WHAT’S THE APPROPRIATE MODEL
FOR THE DISTRIBUTION OF P(HAT)? SPECIFY THE
NAME OF THE DISTRIBUTION, THE MEAN, AND THE
STANDARD DEVIATION. BE SURE TO VERIFY THAT
THE CONDITIONS ARE MET.
• (B) WHAT’S THE APPROXIMATE PROBABILITY
THAT MORE THAN ONE THIRD OF THIS SAMPLE
WEAR CONTACTS?
15
SOLUTION TO EXAMPLE
16
EXAMPLE
• INFORMATION ON A PACKET OF SEEDS CLAIMS
THAT THE GERMINATION RATE IS 92%. WHAT’S
THE PROBABILITY THAT MORE THAN 95% OF THE
160 SEEDS IN THE PACKET WILL GERMINATE? BE
SURE TO DISCUSS YOUR ASSUMPTIONS AND
CHECK THE CONDITIONS THAT SUPPORT YOUR
MODEL.
• SOLUTION
17
Sampling Distribution For The Difference
Of Two Sample Proportions
18
Sampling Distribution For The Difference
Of Two Sample Proportions
19
SAMPLING DISTRIBUTION OF THE SAMPLE
MEAN X
RECALL THAT
x1  x2  ...  xn
x
n
APPROACH FOR ESTIMATING
X
SAME AS FOR SAMPLING DISTRIBUTION FOR
PROPORTIONS ILLUSTRATED ABOVE
20
ASSUMPTIONS AND CONDITIONS
• ASSUMPTIONS
• INDEPENDENCE ASSUMPTION: THE SAMPLED
VALUES MUST BE INDEPENDENT OF EACH OTHER
• SAMPLE SIZE ASSUMPTION: THE SAMPLE SIZE
MUST BE SUFFICIENTLY LARGE.
• REMARK: WE CANNOT CHECK THESE DIRECTLY,
BUT WE CAN THINK ABOUT WHETHER THE
INDEPENDENCE ASSUMPTION IS PLAUSIBLE.
21
CONDITIONS
• RANDOMIZATION CONDITION: THE DATA VALUES MUST BE
SAMPLED RANDOMLY, OR THE CONCEPT OF A SAMPLING
DISTRIBUTION MAKES NO SENSE. IF POSSIBLE, USE SIMPLE
RANDOM SAMPLING DESIGN TO ABTAIN THE SAMPLE.
• 10% CONDITION: WHEN THE SAMPLE IS DRAWN WITHOUT
REPLACEMENT (AS IS USUALLY THE CASE), THE SAMPLE
SIZE, n, SHOULD BE NO MORE THAN 10% OF THE
POPULATION.
• LARGE ENOUGH SAMPLE CONDITION: IF THE POPULATION
IS UNIMODAL AND SYMMETRIC, EVEN A FAIRLY SMALL
SAMPLE IS OKAY. IF THE POPULATION IS STRONGLY
SKEWED, IT CAN TAKE A PRETTY LARGE SAMPLE TO
ALLOW USE OF A NORMAL MODEL TO DESCRIBE THE
DISTRIBUTION OF SAMPLE MEANS
22
CENTRAL LIMIT THEOREM FOR THE
SAMPLING DISTRIBUTION FOR MEANS
• FOR A LARGE ENOUGH SAMPLE SIZE, n, THE SAMPLING
DISTRIBUTION OF THE SAMPLE MEAN X IS
APPROXIMATELY
 

N  ,

n

• THAT IS, NORMAL WITH
MEAN  E ( x )    population mean
STANDARD DEVIATION   ( x ) 

n
  population s tan dard deviation
23
Standard Deviation Versus
Standard Error
24
EXAMPLE
• SUPPOSE THE MEAN ADULT WEIGHT, , IS 175
POUNDS WITH STANDARD DEVIATION, , OF 25
POUNDS. AN ELEVATOR HAS A WEIGHT LIMIT OF
10 PERSONS OR 2000 POUNDS. WHAT IS THE
PROBABILITY THAT 10 PEOPLE WHO GET ON THE
ELEVATOR OVERLOAD ITS WEIGHT LIMIT?
• SOLUTION
25
EXAMPLE
• STATISTICS FROM CORNELL’S NORTHEAST REGIONAL
CLIMATE CENTER INDICATE THAT ITHACA, NY, GETS AN
AVERAGE OF 35.4 INCHES OF RAIN EACH YEAR, WITH A
STANDARD DEVIATION OF 4.2 INCHES. ASSUME THAT A
NORMAL MODEL APPLIES.
• (A) DURING WHAT PERCENTAGE OF YEARS DOES ITHACA
GET MORE THAN 40 INCHES OF RAIN?
• (B) LESS THAN HOW MUCH RAIN FALLS IN THE DRIEST 20%
OF ALL YEARS?
• (C) A CORNELL UNIVERSITY STUDENT IS IN ITHACA FOR 4
YEARS. LET y (bar) REPRESENT THE MEAN AMOUNT OF RAIN
FOR THOSE 4 YEARS. DESCRIBE THE SAMPLING
DISTRIBUTION MODEL OF THIS SAMPLE MEAN, y (bar).
• (D) WHAT’S THE PROBABILITY THAT THOSE 4 YEARS
AVERAGE LESS THAN 30 INCHES OF RAIN?
26
SOLUTION TO EXAMPLE
27
Sampling Distribution For The Difference
Of Two Sample Means
28
Sampling Distribution For The Difference
Of Two Sample Means
29
CONFIDENCE INTERVALS FOR PROPORTIONS
ESTIMATION
POINT ESTIMATION PRODUCES A
NUMBER (AN ESTIMATE) WHICH IS
BELIEVED TO BE CLOSE TO THE
VALUE OF UNKNOWN PARAMETER.
FOR EXAMPLE: A CONCLUSION MAYBE
THAT “PROPORTION P OF LEFTHANDED STUDENTS IN MSU IS
APPROXIMATELY O.46”
30
SOME POINT ESTIMATORS
PARAMETER
PROPORTION
MEAN
STANDARD
DEVIATION
P


ESTIMATOR
P̂
X
S
31
INTERVAL ESTIMATION
• PRODUCES AN INTERVAL THAT CONTAINS
THE ESTIMATED PARAMETER WITH A
PRESCRIBED CONFIDENCE.
• A CONFIDENCE INTERVAL OFTEN HAS THE
FORM:
POINT ESTIMATE  MARGIN OF ERROR (ME )
32
DEFINITION
• GIVEN A CONFIDENCE LEVEL C%, THE
*
CRITICAL VALUE C IS THE NUMBER SO
THAT THE AREA UNDER THE PROPER
*
*
CURVE AND BETWEEN  C AND C IS C
(IN DECIMALS).
33
SOME CRITICAL VALUES FOR STANDARD
NORMAL DISTRIBUTION
C % CONFIDENCE
LEVEL
80%
CRITICAL VALUE
90%
1.645
95%
1.960
98%
2.326
99%
2.576
Z*
1.282
34
WHAT DOES C% CONFIDENCE REALLY
MEAN?
• FORMALLY, WHAT WE MEAN IS THAT C% OF
SAMPLES OF THIS SIZE WILL PRODUCE
CONFIDENCE INTERVALS THAT CAPTURE THE
TRUE PROPORTION.
• C% CONFIDENCE MEANS THAT ON AVERAGE, IN C
OUT OF 100 ESTIMATIONS, THE INTERVAL WILL
CONTAIN THE TRUE ESTIMATED PARAMETER.
• E.G. A 95% CONFIDENCE MEANS THAT ON THE
AVERAGE, IN 95 OUT OF 100 ESTIMATIONS, THE
INTERVAL WILL CONTAIN THE TRUE ESTIMATED
PARAMETER.
35
CONFIDENCE INTERVAL FOR PROPORTION P
[ONE-PROPORTION Z-INTERVAL]
•
ASSUMPTIONS AND CONDITIONS
RANDOMIZATION CONDITION
•
10% CONDITION
•
SAMPLE SIZE ASSUMPTION OR
SUCCESS/FAILURE CONDITION
•
•
INDEPENDENCE ASSUMPTION
NOTE: PROPER RANDOMIZATION CAN HELP
ENSURE INDEPENDENCE.
36
CONSTRUCTING CONFIDENCE
INTERVALS
ESTIMATOR
SAMPLE PROPORTION
P̂
STANDARD ERROR
C% MARGIN OF
ERROR
C% CONFIDENCE
INTERVAL
SE ( Pˆ ) 
pˆ qˆ
n
ME ( pˆ )  z SE ( pˆ )
*
pˆ  ME ( pˆ )
37
SAMPLE SIZE NEEDED TO PRODUCE A CONFIDENCE
INTERVAL WITH A GIVEN MARGIN OF ERROR, ME
ˆ)  z
ME ( p
SOLVING FOR n GIVES
*
ˆ qˆ
p
n
ˆ qˆ
(z ) p
n
2
( ME )
* 2
ˆ AND qˆ IS A REASONABLE GUESS. IF WE
WHERE p
CANNOT MAKE A GUESS, WE TAKE p
ˆ  qˆ  0.5
38
EXAMPLE
A MAY 2002 GALLUP POLL FOUND THAT ONLY 8% OF A
RANDOM SAMPLE OF 1012 ADULTS APPROVED OF
ATTEMPTS TO CLONE A HUMAN.
(A)
(B)
(C)
(D)
(E)
FIND THE MARGIN OF ERROR FOR THIS POLL IF WE WANT
95% CONFIDENCE IN OUR ESTIMATE OF THE PERCENT OF
AMERICAN ADULTS WHO APPROVE OF CLONING HUMANS.
EXPLAIN WHAT THAT MARGIN OF ERROR MEANS.
IF WE ONLY NEED TO BE 90% CONFIDENT, WILL THE
MARGIN OF ERROR BE LARGER OR SMALLER? EXPLAIN.
FIND THAT MARGIN OF ERROR.
IN GENERAL, IF ALL OTHER ASPECTS OF THE SITUATION
REMAIN THE SAME, WOULD SMALLER SAMPLES PRODUCE
SMALLER OR LARGER MARGINS OF ERROR?
39
SOLUTION
40
EXAMPLE
DIRECT MAIL ADVERTISERS SEND SOLICITATIONS (a.k.a. “junk
mail”) TO THOUSANDS OF POTENTIAL CUSTOMERS IN THE
HOPE THAT SOME WILL BUY THE COMPANY’S PRODUCT.
THE RESPONSE RATE IS USUALLY QUITE LOW. SUPPOSE
A COMPANY WANTS TO TEST THE RESPONSE TO A NEW
FLYER, AND SENDS IT TO 1000 PEOPLE RANDOMLY
SELECTED FROM THEIR MAILING LIST OF OVER 200,000
PEOPLE. THEY GET ORDERS FROM 123 OF THE
RECIPIENTS.
(A) CREATE A 90% CONFIDENCE INTERVAL FOR THE
PERCENTAGE OF PEOPLE THE COMPANY CONTACTS WHO
MAY BUY SOMETHING.
(B) EXPLAIN WHAT THIS INTERVAL MEANS.
(C) EXPLAIN WHAT “90% CONFIDENCE” MEANS.
(D) THE COMPANY MUST DECIDE WHETHER TO NOW DO A
MASS MAILING. THE MAILING WON’T BE COST-EFFECTIVE
UNLESS IT PRODUCES AT LEAST A 5% RETURN. WHAT
DOES YOUR CONFIDENCE INTERVAL SUGGEST? EXPLAIN.
41
SOLUTION
42
EXAMPLE
IN 1998 A SAN DIEGO REPRODUCTIVE CLINIC
REPORTED 49 BIRTHS TO 207 WOMEN UNDER
THE AGE OF 40 WHO HAD PREVIOUSLY BEEN
UNABLE TO CONCEIVE.
(A) FIND A 90% CONFIDENCE INTERVAL FOR THE
SUCCESS RATE AT THIS CLINIC.
(B) INTERPRET YOUR INTERVAL IN THIS CONTEXT.
(C) EXPLAIN WHAT “90 CONFIDENCE” MEANS.
(D) WOULD IT BE MISLEADING FOR THE CLINIC TO
ADVERTISE A 25% SUCCESS RATE? EXPLAIN.
(E) THE CLINIC WANTS TO CUT THE STATED
MARGIN OF ERROR IN HALF. HOW MANY
PATIENTS’ RESULTS MUST BE USED?
(F) DO YOU HAVE ANY CONCERNS ABOUT THIS
SAMPLE? EXPLAIN.
43
SOLUTION
44
Confidence Intervals For The Difference
In Two Sample Proportions
45
Confidence Intervals For The Difference
In Two Sample Proportions
46
INFERENCES ABOUT MEANS
47
ASSUMPTIONS AND CONDITIONS
• INDEPENDENCE ASSUMPTION: THE DATA VALUES
SHOULD BE INDEPENDENT. THERE’S REALLY NO
WAY TO CHECK INDEPENDENCE OF THE DATA BY
LOOKING AT THE SAMPLE, BUT WE SHOULD
THINK ABOUT WHETHER THE ASSUMPTION IS
REASONABLE.
• RANDOMIZATION CONDITION: THE DATA SHOULD
ARISE FROM A RANDOM SAMPLE OR SUITABLY A
RANDOMIZED EXPERIMENT.
48
ASSUMPTIONS AND CONDITIONS
• 10% CONDITION: THE SAMPLE IS NO MORE
THAN 10% OF THE POPULATION.
• NORMAL POPULATION ASSUMPTION OR
NEARLY NORMAL CONDITION: THE DATA
COME FROM A DISTRIBUTION THAT IS
UNIMODAL AND SYMMETRIC. REMARK:
CHECK THIS CONDITION BY MAKING A
HISTOGRAM OR NORMAL PROBABILITY
PLOT.
49
CONSTRUCTING CONFIDENCE INTERVALS
FOR MEANS
• POINT ESTIMATOR:
• STANDARD ERROR:
• C% MARGIN OF ERROR:
50
WHERE tn-1* IS A CRITICAL VALUE FOR STUDENT’S t
– MODEL WITH n – 1 DEGREES OF FREEDOM THAT
CORRESPONDS TO C% CONFIDENCE LEVEL.
n
(t
*
n1
2
) s
2
ME
2
51
REMARK
52
ILLUSTRATIVE PICTURE
53
FINDING CRITICAL t - VALUES
• Using t tables (Table T) and/or calculator, find or
estimate the
• 1. critical value t7* for 90% confidence level if
number of degrees of freedom is 7
• 2. one tail probability if t = 2.56 and number of
degrees of freedom is 7
• 3. two tail probability if t = 2.56 and number of
degrees of freedom is 7
• NOTE: If t has a Student's t-distribution with
degrees of freedom, df, then TI-83 function
tcdf(a,b,df) , computes the area under the t-curve
and between a and b.
54
EXAMPLES FROM PRACTICE
SHEET
55
Confidence Intervals For The Difference
In Two Population Means
56
Confidence Intervals For The Difference
In Two Population Means
57
TESTING HYPOTHESES ABOUT
PROPORTIONS
• PROBLEM
• SUPPOSE WE TOSSED A COIN 100 TIMES
AND WE OBTAINED 38 HEADS AND 62
TAILS. IS THE COIN BIASED?
• THERE IS NO WAY TO SAY YES OR NO WITH
100% CERTAINTY. BUT WE MAY EVALUATE
THE STRENGTH OF SUPPORT TO THE
HYPOTHESIS THAT “THE COIN IS BIASED.”
58
TESTING
• HYPOTHESES
NULL HYPOTHESIS H 0
– ESTABLISHED FACT;
– A STATEMENT THAT WE EXPECT DATA TO
CONTRADICT;
– NO CHANGE OF PARAMETERS.
ALTERNATIVE HYPOTHESIS H A
– NEW CONJECTURE;
– YOUR CLAIM;
– A STATEMENT THAT NEEDS A STRONG
SUPPORT FROM DATA TO CLAIM IT;
– CHANGE OF PARAMETERS
59
IN OUR PROBLEM
H 0 : COIN IS FAIR;
p  0.5
H A : COIN IS BIASED;
p  0.5
WHERE p IS THE PROBABILIT Y THAT
THE COIN TURNS " HEADS ."
60
EXAMPLE
• WRITE THE NULL AND ALTERNATIVE HYPOTHESES
YOU WOULD USE TO TEST EACH OF THE
FOLLOWING SITUATIONS.
• (A) IN THE 1950s ONLY ABOUT 40% OF HIGH
SCHOOL GRADUATES WENT ON TO COLLEGE.
HAS THE PERCENTAGE CHANGED?
• (B) 20% OF CARS OF A CERTAIN MODEL HAVE
NEEDED COSTLY TRANSMISSION WORK AFTER
BEING DRIVEN BETWEEN 50,000 AND 100,000
MILES. THE MANUFACTURER HOPES THAT
REDESIGN OF A TRANSMISSION COMPONENT HAS
SOLVED THIS PROBLEM.
• (C) WE FIELD TEST A NEW FLAVOR SOFT DRINK,
PLANNING TO MARKET IT ONLY IF WE ARE SURE
THAT OVER 60% OF THE PEOPLE LIKE THE
61
FLAVOR.
ATTITUDE
• ASSUME THAT THE NULL
HYPOTHESIS H 0
IS TRUE AND UPHOLD IT,
UNLESS DATA STRONGLY SPEAKS
AGAINST IT.
62
TEST MECHANIC
• FROM DATA, COMPUTE THE VALUE
OF A PROPER TEST STATISTICS,
THAT IS, THE Z-STATISTICS.
• IF IT IS FAR FROM WHAT IS
EXPECTED UNDER THE NULL
HYPOTHESIS ASSUMPTION, THEN WE
REJECT THE NULL HYPOTHESIS.
63
COMPUTATION OF THE Z – STATISTICS OR
PROPER TEST STATISTICS
pˆ  po
z
SD( pˆ )
SD( pˆ ) 
where,
po .qo
n
64
CONSIDERING THE EXAMPLE AT THE
BEGINNING:
0
.
5
(
0
.
5
)
Pˆ  0.38, PO  0.5, SD( Pˆ ) 
 0.05
100
0.38  0.50
AND zo 
 2.4
0.05
65
THE P – VALUE AND ITS COMPUTATION
• THE PROBABILITY THAT IF THE NULL
HYPOTHESIS IS CORRECT, THE TEST
STATISTIC TAKES THE OBSERVED OR
MORE EXTREME VALUE.
• P – VALUE MEASURES THE STRENGTH OF
EVIDENCE AGAINST THE NULL
HYPOTHESIS. THE SMALLER THE P –
VALUE, THE STRONGER THE EVIDENCE
AGAINST THE NULL HYPOTHESIS.
66
THE WAY THE ALTERNATIVE HYPOTHESIS IS
WRITTEN IS HELPFUL IN COMPUTING THE P - VALUE
HA
p  value
NORMAL
CURVE
H A : p  po P( z  zo )
H A : p  po P( z  zo )
H A : p  po 2P( z  zo )
67
IN OUR EXAMPLE,
• P – VALUE = P( z < - 2.4) = 0.0082
• INTERPRETATION: IF THE COIN IS
FAIR, THEN THE PROBABILITY OF
OBSERVING 38 OR FEWER
HEADS IN 100 TOSSES IS 0.0082
68
CONCLUSION: GIVEN SIGNIFICANCE
LEVEL = 0.05
• WE REJECT THE NULL HYPOTHESIS IF THE
P – VALUE IS LESS THAN THE
SIGNIFICANCE LEVEL OR ALPHA LEVEL.
• WE FAIL TO REJECT THE NULL
HYPOTHESIS (I.E. WE RETAIN THE NULL
HYPOTHESIS) IF THE P – VALUE IS
GREATER THAN THE SIGNIFICANCE LEVEL
OR ALPHA LEVEL.
69
ASSUMPTIONS AND CONDITIONS
• RANDOMIZATION
• INDEPENDENT OBSERVATIONS
• 10% CONDITION
• SUCCESS/FAILURE CONDITION
70
EXAMPLE 1
• THE NATIONAL CENTER FOR EDUCATION
STATISTICS MONITORS MANY ASPECTS OF
ELEMENTARY AND SECONDARY EDUCATION
NATIONWIDE. THEIR 1996 NUMBERS ARE OFTEN
USED AS A BASELINE TO ASSESS CHANGES. IN
1996, 31% OF STUDENTS REPORTED THAT THEIR
MOTHERS HAD GRADUATED FROM COLLEGE. IN
2000, RESPONSES FROM 8368 STUDENTS FOUND
THAT THIS FIGURE HAD GROWN TO 32%. IS THIS
EVIDENCE OF A CHANGE IN EDUCATION LEVEL
AMONG MOTHERS?
71
EXAMPLE 1 CONT’D
• (A) WRITE APPROPRIATE HYPOTHESES.
• (B) CHECK THE ASSUMPTIONS AND CONDITIONS.
• (C) PERFORM THE TEST AND FIND THE P – VALUE.
• (D) STATE YOUR CONCLUSION.
• (E) DO YOU THINK THIS DIFFERENCE IS
MEANINGFUL? EXPLAIN.
72
SOLUTION
73
EXAMPLE 2
• IN THE 1980s IT WAS GENERALLY BELIEVED THAT
CONGENITAL ABNORMALITIES AFFECTED ABOUT
5% OF THE NATION’S CHILDREN. SOME PEOPLE
BELIEVE THAT THE INCREASE IN THE NUMBER OF
CHEMICALS IN THE ENVIRONMENT HAS LED TO AN
INCREASE IN THE INCIDENCE OF ABNORMALITIES.
A RECENT STUDY EXAMINED 384 CHILDREN AND
FOUND THAT 46 OF THEM SHOWED SIGNS OF AN
ABNORMALITY. IS THIS STRONG EVIDENCE THAT
THE RISK HAS INCREASED? ( WE CONSIDER A P –
VALUE OF AROUND 5% TO REPRESENT STRONG
EVIDENCE.)
74
EXAMPLE 2 CONT’D
• (A) WRITE APPROPRIATE HYPOTHESES.
• (B) CHECK THE NECESSARY ASSUMPTIONS.
• (C) PERFORM THE MECHANICS OF THE TEST.
WHAT IS THE P – VALUE?
• (D) EXPLAIN CAREFULLY WHAT THE P – VALUE
MEANS IN THIS CONTEXT.
• (E) WHAT’S YOUR CONCLUSION?
• (F) DO ENVIRONMENTAL CHEMICALS CAUSE
CONGENITAL ABNORMALITIES?
75
SOLUTION
76
INFERENCES ABOUT MEANS
• TESTING HYPOTHESES ABOUT
MEANS
• ONE – SAMPLE t – TEST FOR MEANS
• PROBLEM
• Test HO:  = 0
77
ASSUMPTIONS AND CONDITIONS
• INDEPENDENCE ASSUMPTION
• RANDOMIZATION CONDITION
• 10% CONDITION
• NEARLY NORMAL CONDITION OR
LARGE SAMPLE
78
STEPS IN TESTING
• NULL HYPOTHESIS
• HO:  = 0
•
•
•
•
ALTERNATIVE HYPOTHESIS
HA :  >  0
or HA:  < 0
or HA:  ≠ 0
79
ATTITUDE: Assume that the null hypothesis HO is true
and uphold it, unless data strongly speaks against it.
• STANDARD ERROR
• TEST STATISTICS
• t HAS STUDENT’S t – DISTRIBUTION WITH
n – 1 DEGREES OF FREEDOM.
80
P-value: Let to be the observed value of the
test statistic.
HA
P-value
HA:  > 0
P(t > to)
HA:  < 0
P(t <to)
HA:  ≠ 0
P(t > |to|) +
P(t < -|to|)
NORMAL DISTRIBUTION CURVE
81
CONCLUSION
82
EXAMPLES FROM PRACTICE
SHEET
83
Download