Chapter 7 Confidence Intervals

advertisement
Chapter 7
Confidence Intervals
INTRODUCTION
DERIVATION
NOTATIONS AND GENERAL FORM
1 SAMPLE Z-INTERVAL FOR Μ
1 SAMPLE T-INTERVAL FOR Μ
2 INDEPENDENT SAMPLE T-INTERVAL FOR Μ
2 DEPENDENT SAMPLE T-INTERVAL FOR Μ
Z-INTERVAL FOR P
SAMPLE SIZE
Introduction
In this chapter, we will estimate the population parameter
using an interval.
This interval will ‘capture’ the true population parameter with
a certain measure of precision.
Example: A 95% confidence interval for µ is (13, 18)
Notations and General Form
CI - confidence interval
CV - critical value
ME - margin of error
SE - standard error
SD - standard deviation
pt.est. - point estimate
Zα/2 - normal distribution critical value(use invnorm)
t(n-1,α/2) - students t distribution critical value with n-1
degrees of freedom (use math solver or the invT)
Notations and General Form
General Formula:
Pt .est  CV  SE
Notations and General Form
PointEstimate
Pointestimate
Parameter

X
pˆ
p
Standard Error(SE)
for X, SE  SD/ n
for pˆ , SE 
pˆ (1- pˆ )
n
Notations and General Form
Critical Value
Z-critical value
Z1-α/2 = InvNorm(1 - α/2)
example: 95% Confidence interval
α = 1 – 0.95
= 0.05
Z 1-α /2 = InvNorm(1- α/2)
= InvNorm(1-(.05/2))
= 1.96
Notations and General Form
Critical Value
T-critical value
t (1-α /2, df) = InvT(1-α/2, degrees of freedom)
or
t (1-α /2, df) = L on math solver (for TI-83)
Math -> solver -> tcdf(L,U,D) – A
L = CV
U = 9999
D = degrees of freedom
A = α/2
Notations and General Form
Example: 95% confidence interval based on a sample
size of 20. (α = .05, df = n-1 = 19)
T-critical value
t (1-α /2, df) = InvT(1-(0.05/2), 19)
Math->solver->tcdf(L,U,D) – A
L = CV (highlight and then press alpha enter)
U = 9999
D = 19
A = .025
ANSWER: t (1-0.025, 19) = 2.0930
1 Sample Z-interval for µ
Population Standard Deviation (σ) is known
Formula
X  Z / 2( / n )
where
X  pt.est.
Z1 / 2  Crit icalValue
( / n )  St andard Error
Z1 / 2 ( / n )  Margin of Error
1 Sample Z-interval for µ
Suppose the time allotted for commercials on a primetime
TV program is known to have a normal distribution with a
standard deviation of 1.5 minutes. A study of 25 showings
gave an average commercial time of 11 minutes. Find the
95% confidence interval for the true population mean, μ.
Given:
  1.5 (thisis theknown populationstandarddeviation)
n  25
X  11
Confidence level = 0.95
Critical value = invNorm(1-.05/2) = 1.96
1 Sample Z-interval for µ
A 95% confidence interval for μ is
 pt.est  CV * SE
 X  Z * ( / n )
 11 1.96* (1.5 / 25)
 11 1.96* 0.3
 11 0.588
 (10.412,11.588)
We are 95% confident that the average time in
commercials is between 10.412 and 11.588 minutes
Using 1-sampZInterval in TI-83/84
Stat -> Tests -> ZInterval
Notes on intervals
Effect of confidence level
Higher confidence level results to a longer confidence
interval since CV increases as α decreases.
Example, Z = 1.645 when α = .10 while Z = 1.96 when α = .05
Effect of sample size
Increasing sample size (n) shortens the confidence
interval since SE = SD/sqrt(n).
Note: level of significance (α)
confidence level (1- α)*100%
1 Sample t-interval for µ
Population Standard Deviation (σ) is unknown
Formula
X  t ( / 2, n  1)( SD / n )
where
X  pt.est.
t (1   / 2, n  1)  CriticalValue
SD / n  Standard Error
t (1   / 2, n  1)( SD / n )  Margin of Error
df  n - 1
1 Sample t-interval for µ
A random sample of 12 graduates of a certain secretarial school typed an
average of 79.3 words per minute with a standard deviation of 7.8 words
per minute. Assuming normal distribution for the number of words typed
per minute, find a 95% confidence interval for the average number of
words typed by all graduates of this school.
Given:
  not given
n  12
X  79.3
SD  7.8
Confidence level = 0.95
Critical value = 2.201
1 Sample t-interval for µ
A 95% confidence interval for μ is
 X  t ( SD / n )
 79.3  2.201* (7.8 / 12)
 79.3  2.201* 2.2517
 79.3  4.9559
 (74.3441,84.2559)
We are 95% confident that the average number of words
the graduates type per minute is between 74.3441 and 84.2559
words.
Using 1-samptInterval in TI-83/84
Stat -> Tests -> tInterval
2 Independent Sample t-interval for µ
.
Confidence Interval for µ1 - µ2
Formula
( X 1  X 2 )  t ( / 2, n1  n 2  2)
1 1
S   
 n1 n2 
2
p
where
Sp
2
(n1  1) S12  (n2  1) S 22

n1  n2  2
t( / 2, n1  n 2  2)  CriticalValue
Requirement 1: Both samples where taken from a normal distribution
Requirement 2: The population standard deviations are equal (σ1=σ2=σ)
2 Independent Sample t-interval for µ
Do credit cards with no annual fee charge higher interest rates than
cards that have annual fees? Among 29 cards surveyed, 17 had no
annual fees while 12 charged an annual fee. Among the cards with no
annual fee, the average interest rate was 19% (SD = 8%). Among cards
with an annual fee, the average interest rate was 17% (SD = 3%).
a. What assumptions do you need to get a confidence interval for the
difference in average interest rate?
b. Calculate the estimate of the common standard deviation.
c. Construct a 95% interval estimate for the difference in average
interest rates.
2 Independent Sample t-interval for µ
a. What assumptions do you need to get a confidence interval for
the difference in average interest rate?
The samples were taken from normally distributed populations
and that they have a common standard deviation.
2 Independent Sample t-interval for µ
b. Calculate the estimate of the common standard deviation.
Average ( )
SD (S)
Sample Size (n)
No Annual Fee
0.19
0.08
17
S2p
With Annual Fee
0.17
0.03
12
X WO  X W
0.19 – 0.17 = 0.02
2
(nWO  1) SWO
 (nW  1) SW2

nWO  nW  2
(17  1)0.082  (12  1)0.032

17  12  2
 0.004159
2 Independent Sample t-interval for µ
c. Construct a 95% interval estimate for the difference in average interest
rates.
Level of Confidence (1-α) = 0.95  α = 0.05
Critical value (tα/2,df=n1+n2-2 = t0.05/2,df=27 = t0.025,27) = 2.052
A 95% confidence interval for is calculated as
1
1 

( X WO  X W )  t S p2  
 n1 n2 
1 
 1
0.02  2.052 0.0041592593



17
12


0.02  2.052(0.0243159875
)
0.02  0.0498964063
( 0.0299,0.06989)
Therefore, with 95% confidence, the difference in average interest
rate will lie between -2.99% and 6.99%.
2 Independent Sample t-interval for µ
Using 2-samptInterval in TI-83/84
Average ( )
SD (S)
Sample Size (n)
No Annual Fee
0.19
0.08
17
Stat -> Tests -> 2samptInt
With Annual Fee
0.17
0.03
12
0.19 – 0.17 = 0.02
2 Independent Sample t-interval for µ
Suppose it is of interest to estimate the difference in
the first exam scores of STAT 2160 male and female
students. A random sample of 16 males and 17
females taking the course this semester was drawn
and asked about their scores. Among male students,
the average was found to be 15.28 with a standard
deviation of 2.3, while among females, the average is
16.5 with a standard deviation of 1.6. Assuming the
scores follow a normal distribution with equal
variances, construct a 95% CI for the true difference
in the first exam scores of male and female STAT
2160 students.
2 Independent Sample t-interval for µ
Step 1: Create a table of given values
MALE
FEMALE
n
16
17
Mean
15.28
16.5
Standard Deviation
2.3
1.6
Step 2: Compute for the common variance
2
2
(
n

1
)
S

(
n

1
)
S
1
2
2
S p2  1
n1  n2  2
2 Independent Sample t-interval for µ
2
2
(
16

1
)
2
.
3

(
17

1
)
1
.
6
2
Sp 
16  17  2
S  3.880967742
2
p
2 Independent Sample t-interval for µ
Step 3: Obtain the Critical Value (CV)
95% CI  α = 0.05
CV = invT(1-α/2, n1+n2-2)
= invT(0.975, 31)
= 2.0395
2 Independent Sample t-interval for µ
Step 4: Plug-in the values in the CI formula
( X 1  X 2 )  t ( / 2, n1  n 2  2)
1 1
S   
 n1 n2 
2
p
1
1
(15.28 16.5)  2.0395 3.880967742
  
 16 17 
 1.22 2.0395(0.6861870765
)
 2.6195,0.1795
iClicker
2 Dependent Sample t-interval for µ
There is one sample but measured twice
Formula
X DIFF  t ( / 2, n  1)( SDDIFF / n )
where
X DIFF  averageof thedifferences
t ( / 2, n  1)  Critical value
SD DIFF / n  SE of thedifferences
2 Dependent Sample t-interval for µ
The table below shows the opening and closing prices of a sample of 10 active
stocks on a certain day. Give an estimate of the difference in the average stock
prices? What is the corresponding standard error of this estimate?
Intel Corp
Citigroup Inc
Bank of America Corp
JPMorgan Chase & Co
General Electric Co
Microsoft Corp
Pfizer Inc
Exxon Mobil Corp
Coca-Cola Co
Alcoa Inc
Opening Price
16.09
17.47
25.6
41.52
20.11
23.77
17.03
67.69
46.48
11.54
Closing Price
16.25
16.32
24.67
42.33
19.37
23.44
16.93
62.92
49.23
10.09
2 Dependent Sample t-interval for µ
First, we need to take the difference between the two prices for each
stock.
Intel Corp
Citigroup Inc
Bank of America Corp
JPMorgan Chase & Co
General Electric Co
Microsoft Corp
Pfizer Inc
Exxon Mobil Corp
Coca-Cola Co
Alcoa Inc
Opening Price
Closing Price
Difference
16.09
17.47
25.6
41.52
20.11
23.77
17.03
67.69
46.48
11.54
16.25
16.32
24.67
42.33
19.37
23.44
16.93
62.92
49.23
10.09
-0.16
1.15
0.93
-0.81
0.74
0.33
0.1
4.77
-2.75
1.45
2 Dependent Sample t-interval for µ
Then calculate the mean of the differences and the corresponding
standard deviation then calculate the standard error.
X DIFF  0.575
S DIFF  1.904592403
S DIFF 1.904592403
SE( X DIFF ) 

 0.6022850008
n
10
2 Dependent Sample t-interval for µ
Construct a 95% confidence interval for the
difference in opening and closing stock prices.
Use 1-samptInterval in TI-83/84 since we converted
the two samples into a sample of differences.
Answer: (-0.7875, 1.9375)
Z-interval for P
Formula



p  Z / 2 p(1  p) / n
where
Z / 2  Crit icalValue


p(1  p) / n  Standard Error
Z-interval for P
In a random sample of 500 families owning television sets in the city of
Hamilton, Canada, it was found that 340 subscribed to HBO. Find a 95%
confidence interval for the actual proportion of families in this city who
subscribe to HBO.
Given:
# successesin thesample 340

p 

 0.68
sample size
500
Find: A 95% CI for the population proportion, p.
Level of Confidence (1-α) = 0.95  α = 0.05
Critical value (z1-α/2 = z1-0.05/2 = z0.975) = InvNorm(0.975) = 1.96
Z-interval for P
The confidence interval is calculated as follows:



p  z p(1  p) / n
0.68  1.96
0.68* (1  0.68)
500
0.68  1.96(0.0208614477
)
0.68  0.0408884375
0.6391,0.7209
Therefore, we are 95% confident that the actual proportion of
families in this city who subscribe to HBO is between 64% and
72%.
A function in TI-83/84 is 1propZInt. Stat -> Tests -> 1propZInt
iClicker
Sample Size
In order to be (1-α) x 100% confident that the sample mean is within a
distance ME of the mean μ, choose a sample size equal to
n = z2σ2/M2
For computing sample size for estimating population proportion,
the formula is
n
2

z p (1  p )
M2
Sample Size
A consumer group wishes to estimate the average electric bills for the month of
July for single-family homes in a large city. Based on studies conducted in other
cities, the standard deviation is assumed to be $25. The group wants to estimate
the average bill for July to be within  $5 of the true average with 95% confidence.
a. How many single-family homes should be selected?
b. If the group wants to be correct to within  $10, what sample size is
necessary?
c. If 99% confidence and a sampling error of  $5 are desired, how many
single-family homes are necessary?
Given:
ME = 5
Level of Confidence (1-α) = 0.95  α = 0.05
Critical value (zα/2 = z0.05/2 = z0.025) = InvNorm(0.025) = -1.96
σ = 25
Sample Size
a. How many single-family homes should be selected?
Given:
ME = 5
Level of Confidence (1-α) = 0.95  α = 0.05
Critical value (zα/2 = z0.05/2 = z0.025) = InvNorm(0.025) = -1.96
σ = 25
n

z 2 2
M2
(1.96) 2 (25) 2
(5) 2
 96.04
 97 single- family homes
Sample Size
b. If the group wants to be correct to within  $10, what sample size is
necessary?
n

z 2 2
M2
(1.96) 2 (25) 2
(10) 2
 24.01
 25 single- family homes
Sample Size
c. If 99% confidence and a sampling error of  $5 are desired, how many
single-family homes are necessary?
M=5
Level of Confidence (1-α) = 0.99  α = 0.01
Critical value (zα/2 = z0.01/2 = z0.005) = InvNorm(0.01/2) = -2.576
σ = 25
n

z 2 2
M2
(2.576) 2 (25) 2
(5) 2
 165.89
 166single- family homes
iClicker
Sample Size
In a random sample of 500 families owning television sets in the city of Hamilton,
Canada, it was found that 340 subscribed to HBO. Suppose that a CI for the
proportion of families who subscribe to HBO is computed at 90% confidence level
and within ± 0.05, what will the new sample size be?

Given : p  0.68
Z/2  invNorm(.10/2)  1.645
ME  0.05


z 2 p (1  p )
n
ME 2
2
1.645 * .68* (1  .68)

2
0.05
 235.53
 236
iClicker
Download