Confidence Intervals with Means What is the purpose of a confidence interval? To estimate an unknown population parameter Formula: Standard deviation of Critical value statistic Confi dence Interval : x z * n statistic Margin of error In a randomized comparative experiment on the effects of calcium on blood pressure, researchers divided 54 healthy, white males at random into two groups, taking calcium or placebo. The paper reports a mean seated systolic blood pressure of 114.9 with standard deviation of 9.3 for the placebo group. Assume systolic blood pressure is normally distributed. Can you find a z-interval for this problem? Why or why not? Student’s t- distribution • Developed by William Gosset • Continuous distribution • Unimodal, symmetrical, bell-shaped density curve • Above the horizontal axis • Area under the curve equals 1 • Based on degrees of freedom df = n - 1 Graph examples of t- curves vs standard normal curve How does the t-distributions compare to the standard normal distribution? • Shorter & more spread out • More area under the tails • As n increases, t-distributions become more like a standard normal distribution Formula: Standard deviation of Standard error – Critical value statistic when you substitute s for . Confidence Interval : s x t * n statistic Margin of error How to find t* • Use B for t distributions CanTable also use invT on the calculator! • Look up confidence level at bottom & Need t* value with 5% is above – df onupper the sides • df = n – 1 so 95% is below invT(p,df) Find these t* 90% confidence when n = 5 95% confidence when n = 15 t* = 2.132 t* = 2.145 Steps for doing a confidence interval: 1) Assumptions – 2) Calculate the interval 3) Write a statement about the interval in the context of the problem. Statement: (memorize!!) We are ________% confident that the true mean context is between ______ and ______. Assumptions for t-inference • Have an SRS from population (or randomly assigned treatments) • unknown • Normal (or approx. normal) distribution – Given – Large sample size – Check graph of data Use only one of these methods to check normality Ex. 1) Find a 95% confidence interval for the true mean systolic blood pressure of the placebo group. Assumptions: • Have randomly assigned males to treatment • Systolic blood pressure is normally distributed (given). • is unknown 9.3 114.9 2.056 (111.22, 118.58) 27 We are 95% confident that the true mean systolic blood pressure is between 111.22 and 118.58. Ex. 2) A medical researcher measured the pulse rate of a random sample of 20 adults and found a mean pulse rate of 72.69 beats per minute with a standard deviation of 3.86 beats per minute. Assume pulse rate is normally distributed. Compute a 95% confidence interval for the true mean pulse rates of adults. We are 95% confident that the true mean pulse rate of adults is between 70.883 & 74.497. Ex 2 continued) Another medical researcher claims that the true mean pulse rate for adults is 72 beats per minute. Does the evidence support or refute this? Explain. The 95% confidence interval contains the claim of 72 beats per minute. Therefore, there is no evidence to doubt the claim. Ex. 3) Consumer Reports tested 14 randomly selected brands of vanilla yogurt and found the following numbers of calories per serving: 160 200 220 230 120 180 140 130 170 190 80 120 100 170 Compute a 98% confidence interval for the average calorie content per serving of vanilla yogurt. We are 98% confident that the true mean calorie content per serving of vanilla yogurt is between 126.16 calories & 189.56 calories. Ex 3 continued) A diet guide claims that you will get 120 calories from a serving Note: confidence intervals tell us of vanilla yogurt. What does this if something is NOT EQUAL – evidence indicate? never less or greater than! Since 120 calories is not contained within the 98% confidence interval, the evidence suggest that the average calories per serving does not equal 120 calories. CI & p-values deal with area in the tails Robust – is the area changed greatly when there is skewness • An inference procedure is ROBUST if the confidence level or p-value doesn’t change much if the normality assumption is violated. Since there is more area in the tails in tdistributions,can then, a distribution has • t-procedures beif used with some some skewness, tail area not skewness, as long the as there areisno greatly affected. outliers. • Larger n can have more skewness. Find a sample size: • If a certain margin of error is wanted, then to find the sample size necessary for that margin of error use: m z * n Always round up to the nearest person! Ex 4) The heights of PWSH male students is normally distributed with = 2.5 inches. How large a sample is necessary to be accurate within + .75 inches with a 95% confidence interval? n = 43 Some Cautions: • The data MUST be a SRS from the population (or randomly assigned treatment) • The formula is not correct for more complex sampling designs, i.e., stratified, etc. • No way to correct for bias in data Cautions continued: • Outliers can have a large effect on confidence interval • Must know to do a z-interval – which is unrealistic in practice