Skittles Term Project: Confidence interval estimate and Hypothesis

advertisement
BreAnn McKee
Introduction to Statistics 1040
Justin Christensen, T/Th 5:30-6:50pm
Skittles Term Project: Confidence interval estimate and Hypothesis tests.
 The general propose and meaning of a Confidence Interval.
A confidence Interval is a range of values used to estimate the true value of a population
parameter, instead of a single value. A confidence interval gives us a much better sense of how
good an estimate is, and is associated with a confidence level of success rate of the procedure. A
confidence interval can be used to test a claim made about a population proportion, and can
help make an informal judgment based on the result.
 Construct a 99% confidence interval estimate for the true proportion of yellow
candies.

Results
Sample Size: 289
Observed Proportion: 20%
Confidence Level: 99%
Confidence Interval: ±6.06
Range for the true population proportion of yellow candies.
= 13.94% to 26.06%, percent of yellow candies, a proportion estimate in a bag of
Skittles.
 Construct a 95% confidence interval estimate for the true mean number of candies per
bag.
 Results
Sample size =24 bags of candies
Mean= 60.9
Standard deviation= 2.01
Confidence interval of ±0.8
Range for the true population mean = 60.1 – 61.7 for the mean number of candies per
bag of Skittles

Construct a 98% confidence interval estimate for the standard deviation of the
number of candies per bag.

Results

1.537< Ҩ <2.1294, = range mean standard deviation for the number of candies per bag.
With a 98% Confidence interval level, the repeated experiment for the proportion of candies per
bag, skittles would have a confidence interval estimate for the standard deviation such as
intervals contain the estimated population mean would be 98%.
:
 Discussion and interpretation of my results from the three confidence interval estimates.
The results from my three confidence interval estimates are a true value of proportion of the
population for Skittles. The ranges of the width of skittles may vary from the samples I have
collected. If I had a larger sample size the confidence intervals would be a larger range for the
sample size. But my findings have shown that with high confidence levels that the ranges I got
are the best estimate for the true values of Skittles.
Confidence Interval:
±0.8ange
for the population mean:
60.
 The general meaning and purpose of a Hypothesis Tests
In statistics a hypothesis is a claim or statement about a property of a population. A hypothesis
test is a procedure for testing a claim about a property of a population. The use of the word
property correlates with the definition of a population parameter. Using the hypothesis test we
can answer questions with the most effective outcome to a conclusion or a claim that is made.
 Use a 0.05 significance level to test the claim the 20% of all Skittle candies are Red.
 Results
.05 significance level of a claim that 20% of all skittles are Red:
T score = -1154.297
P-value = 0
Since the P-value is < then the significance level of .05, we can reject the null
hypothesis.
µ= 60.9 X= 320 S= 2.01 Ṕ=1461 Sig. level = .05
T- value = 4927.15771, P-value = 2
P-value is > the significance level .05, we cannot reject the null hypothesis.
 Use a 0.01 significance level to test the claim that the mean number of candies in a
bag of Skittles is 55.
 Results
X= 24 µ=60.9 Ҩ=2.01 p=55 Significance level .01
T-score = -136.14812 P-value = 0
P-value is 0 < the sig. level of .01, we can reject the null hypothesis
 Discussion and interpretation of the two hypothesis tests.
The hypothesis is procedure to test the claim made about a value proportions, from my findings
above I have discovered that the claim of more red Skittles is true and the average candies per
bag is not 55. Samples sizes vary so much that the only way to get a broad scope on the average
amount of candies per bag, is to survey a larger population on the amount of skittles per bag.
Then the ranges could be much higher then my findings and a larger mean of candies per bag
could be found.
 Reflection
The conditions and terms for Confidence Intervals and Hypothesis Tests are gathering a sample
size for a claim you would like to verify. With Skittles the idea is to verify that there are is a
specific amount of each color of skittles per bag. The original data collected showed me that this
is always a deviation in amount per bag and amount of color. To get a smaller rage of deviation
we would need to survey the entire student bodies through put SLCC. Then we could get a
better idea of the true amount and a better claim per bag of candy.
There could be several errors made with the calculations for both tests, because of the use of
several different formulas and the rounding off of final counts. Improvements for these types of
problems would be to reduce the amount of formulas need to find ranges or even T-scores, pvalues, and z-scores. There should be an all in one formula and a table to show the additional
information we might need for the claims.
My conclusion to this type of research is the more accurate of findings would be a larger
population needed to survey and a less invasive calculation to find claims. This would be the
best way to get a true idea of a population portion for even the simple things in life, like a bag of
Skittles.
Download