Notes: Comparing Counts - Goodness of Fit Objectives: Identify

advertisement
Notes: Comparing Counts - Goodness of Fit
Objectives:
1) Identify when the Goodness of Fit Test should be used to test a hypothesis.
2) Calculate the GOF test statistic (Chi-Square) and P-value with a graphing calculator.
3) Test a hypothesis using the Goodness of Fit Test.
Types of data you may test:
Does your zodiac sign predict how successful you will be later in life? Fortune magazine collected
the zodiac signs of 256 CEO’s of the largest 400 companies in America. Analyze the following data:
# of Births
23
20
18
23
20
19
18
21
19
22
24
29
Zodiac Sign
Aries (March 21 – April 19)
Taurus (April 20 – May 20)
Gemini (May 21 – June 20)
Cancer (June 21 – July 22)
Leo (July 23 – August 22)
Virgo (August 23 – September 22)
Libra (September 23 – October 22)
Scorpio (October 23 – November 21)
Sagittarius (November 22 – December 21)
Capricorn (December 22 – January 19)
Aquarius (January 20 – February 18)
Pisces (February 19 – March 20)
1. Are the counts independent of
each other?
2. Are the individuals from a
random sample?
3. Are there at least 5 individuals in
each count?
4. Is the sample less than 10% of
the population?
Think about the data:
 We have 256 CEO’s and 12 categories.
 The (ideal) H0 is that birth dates of CEO’s are divided equally among all the zodiac signs.
 That means that (1/12) of 256 = 21.333.
 So, the (ideal) H0 is expecting 21.333 people from each category.
 H1: Birth dates of CEO’s are not divided equally among all the zodiac signs.
 The test statistic looks at how closely the observed data match this ideal situation.
The Goodness of Fit Test (AKA Chi-Square Goodness of Fit Test)
 A way to test a hypothesis when you have many categories
 Scientists use this test to determine significance differences between theoretical data and
experimental data
 This is always a right tailed test
 Degrees of freedom = n – 1 (n is the number of categories)
When can you use the Goodness of Fit test?
1. Counts in the cells are independent of each other
2. Individuals are from a random sample
3. There are at least 5 individuals in each cell
4. The sample is less than 10% of the population
Test Statistic:
X2 
Calculator Steps:
(observed  exp erimental ) 2
exp erimental
Test the claim that birth dates of CEO’s are divided equally among all the zodiac signs.
Practice Problems – Goodness of Fit Test
Identify each H0 and H1. Use a calculator to generate the test-statistic and p-value. Test the
hypothesis. Show all work. Explain your conclusion.
1) Suppose we hypothesize that we have an unbiased six-sided die. To test
this hypothesis, we roll the die 300 times and observe the frequency of occurrence of each of the
faces. Because we hypothesized that the die is unbiased, we expect that the number on each face
will occur 50 times. However, suppose we observe frequencies of occurrence as follows:
Face value
Occurrence
1
42
2
55
3
38
4
57
5
64
6
44
Hints:
H0: The die is fair
H1: The die is not fair
2) In 200 flips of a coin, one would expect 100 heads and 100 tails. But what if 92 heads and 108
tails are observed? Would we reject the hypothesis that the coin is fair?
3) The president of a major university hypothesizes that at least 90 percent of the teaching and
research faculty will favor a new university policy on consulting with private and public agencies
within the state. Thus, for a random sample of 200 faculty members, the president would expect
0.90 x 200 = 180 to favor the new policy and 0.10 x 200 = 20 to oppose it. Suppose, however, for
this sample, 168 faculty members favor the new policy and 32 oppose it. Is the difference between
observed and expected frequencies sufficient to reject the president's hypothesis that 90 percent
would favor the policy?
Download