Chapter 8: Interval Estimation

advertisement
Chapter 8: Interval Estimation
•
Point Estimation:
is a point estimator of is a point estimator of p
•
Interval Estimation:
: ± Margin of Error
p: ± Margin of Error
known
± ± √
Confidence interval for :
± Margin of Error
unknown
± √
Copyright Reserved
1
8.1 Population mean: known
Example:
Given:
Consider marks that are normally distributed with 10.
Let = sample average = 58 with 16.
Question:
Calculate a 95% confidence interval for :
Answer:
1 – 0.95 = 0.05 = = level of significance
0.05
0.025
2
2
± √
58 ± 1.96 10
√16
58
58 ± 4.9
4.9,58 # 4.9
53.1,62.9
95% confident that is between 53.1 and 62.9.
Copyright Reserved
2
FOCUSING ON THE MARGIN OF ERROR:
± Margin of Error
± √
1.96 10
√16
= 4.9
95% of the time the sampling error will be 4.9 or less.
|
| |
|
OR
There is a 0.95 probability that the sample mean, , will provide a sampling error of 4.9 or less.
Copyright Reserved
3
Example:
Given:
Consider marks that are normally distributed with 10.
Let = sample average = 58 with 16.
Question:
Calculate a 90% confidence interval for :
Answer:
1 – 0.9 = 0.1 = = level of significance
0.1
0.05
2
2
± √
58 ± 1.645 10
√16
58
58 ± 4.1125
4.1125,58 # 4.1125
53.8875,62.1125
90% confident that is between 53.8875 and 62.1125.
Copyright Reserved
4
FOCUSING ON THE MARGIN OF ERROR:
± Margin of Error
± √
1.645 10
√16
= 4.1125
90% of the time the sampling error will be 4.1125 or less.
|
| |
|
OR
There is a 0.90 probability that the sample mean, , will provide a sampling error of 4.1125 or less.
Copyright Reserved
5
Example:
Given:
Consider marks that are normally distributed with 10.
Let = sample average = 58 with 16.
Question:
Calculate a 99% confidence interval for :
Answer:
1 – 0.99 = 0.01 = = level of significance
0.01
0.005
2
2
± √
58 ± 2.576 10
√16
58
58 ± 6.44
6.44,58 # 6.44
51.56,64.44
99% confident that is between 51.56 and 64.44.
Copyright Reserved
6
FOCUSING ON THE MARGIN OF ERROR:
± Margin of Error
± √
2.576 10
√16
= 6.44
99% of the time the sampling error will be 6.44 or less.
|
| |
|
OR
There is a 0.99 probability that the sample mean, , will provide a sampling error of 6.44 or less.
Copyright Reserved
7
Confidence Level
Confidence
coefficient
'
'
(
)'
Margin of Error
90%
0.90
0.10
0.05
1.645
1.645
*
95%
0.95
0.05
0.025
1.960
1.960
*
99%
0.99
0.01
0.005
2.576
2.576
*
(
Note: • = level of significance
• level of significance + confidence coefficient = 1
Graph:
8.2 Population mean: unknown
± √
______________________________________________________________________________
t – distribution:
•
•
•
Symmetric around 0.
Degrees of freedom (df) = n – 1.
Table in textbook.
Copyright Reserved
8
t Distribution
Area or
Probability
0
Degrees
of
freedom
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
t
Area in Upper Tail
0.20
1.376
1.061
0.978
0.941
0.920
0.906
0.896
0.889
0.883
0.879
0.876
0.873
0.870
0.868
0.866
0.865
0.863
0.862
0.861
0.860
0.859
0.858
0.858
0.857
0.856
0.856
0.855
0.855
0.854
0.854
0.853
0.853
0.853
0.852
0.10
3.078
1.886
1.638
1.533
1.476
1.440
1.415
1.397
1.383
1.372
1.363
1.356
1.350
1.345
1.341
1.337
1.333
1.330
1.328
1.325
1.323
1.321
1.319
1.318
1.316
1.315
1.314
1.313
1.311
1.310
1.309
1.309
1.308
1.307
0.05
6.314
2.920
2.353
2.132
2.015
1.943
1.895
1.860
1.833
1.812
1.796
1.782
1.771
1.761
1.753
1.746
1.740
1.734
1.729
1.725
1.721
1.717
1.714
1.711
1.708
1.706
1.703
1.701
1.699
1.697
1.696
1.694
1.692
1.691
0.025
12.706
4.303
3.182
2.776
2.571
2.447
2.365
2.306
2.262
2.228
2.201
2.179
2.160
2.145
2.131
2.120
2.110
2.101
2.093
2.086
2.080
2.074
2.069
2.064
2.060
2.056
2.052
2.048
2.045
2.042
2.040
2.037
2.035
2.032
0.01
31.821
6.965
4.541
3.747
3.365
3.143
2.998
2.896
2.821
2.764
2.718
2.681
2.650
2.624
2.602
2.583
2.567
2.552
2.539
2.528
2.518
2.508
2.500
2.492
2.485
2.479
2.473
2.467
2.462
2.457
2.453
2.449
2.445
2.441
0.005
63.657
9.925
5.841
4.604
4.032
3.707
3.499
3.355
3.250
3.169
3.106
3.055
3.012
2.977
2.947
2.921
2.898
2.878
2.861
2.845
2.831
2.819
2.807
2.797
2.787
2.779
2.771
2.763
2.756
2.750
2.744
2.738
2.733
2.728
Copyright Reserved
9
t distribution (Continued)
Degrees
of
freedom
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
Area in Upper Tail
0.20
0.852
0.852
0.851
0.851
0.851
0.851
0.850
0.850
0.850
0.850
0.850
0.850
0.849
0.849
0.849
0.849
0.849
0.849
0.848
0.848
0.848
0.848
0.848
0.848
0.848
0.848
0.848
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.847
0.846
0.846
0.846
0.846
0.846
0.10
1.306
1.306
1.305
1.304
1.304
1.303
1.303
1.302
1.302
1.301
1.301
1.300
1.300
1.299
1.299
1.299
1.298
1.298
1.298
1.297
1.297
1.297
1.297
1.296
1.296
1.296
1.296
1.295
1.295
1.295
1.295
1.295
1.294
1.294
1.294
1.294
1.294
1.293
1.293
1.293
1.293
1.293
1.293
1.292
1.292
0.05
1.690
1.688
1.687
1.686
1.685
1.684
1.683
1.682
1.681
1.680
1.679
1.679
1.678
1.677
1.677
1.676
1.675
1.675
1.674
1.674
1.673
1.673
1.672
1.672
1.671
1.671
1.670
1.670
1.669
1.669
1.669
1.668
1.668
1.668
1.667
1.667
1.667
1.666
1.666
1.666
1.665
1.665
1.665
1.665
1.664
0.025
2.030
2.028
2.026
2.024
2.023
2.021
2.020
2.018
2.017
2.015
2.014
2.013
2.012
2.011
2.010
2.009
2.008
2.007
2.006
2.005
2.004
2.003
2.002
2.002
2.001
2.000
2.000
1.999
1.998
1.998
1.997
1.997
1.996
1.995
1.995
1.994
1.994
1.993
1.993
1.993
1.992
1.992
1.991
1.991
1.990
0.01
2.438
2.434
2.431
2.429
2.426
2.423
2.421
2.418
2.416
2.414
2.412
2.410
2.408
2.407
2.405
2.403
2.402
2.400
2.399
2.397
2.396
2.395
2.394
2.392
2.391
2.390
2.389
2.388
2.387
2.386
2.385
2.384
2.383
2.382
2.382
2.381
2.380
2.379
2.379
2.378
2.377
2.376
2.376
2.375
2.374
0.005
2.724
2.719
2.715
2.712
2.708
2.704
2.701
2.698
2.695
2.692
2.690
2.687
2.685
2.682
2.680
2.678
2.676
2.674
2.672
2.670
2.668
2.667
2.665
2.663
2.662
2.660
2.659
2.657
2.656
2.655
2.654
2.652
2.651
2.650
2.649
2.648
2.647
2.646
2.645
2.644
2.643
2.642
2.641
2.640
2.639
Copyright Reserved
10
t distribution (Continued)
Degrees
of
freedom
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
∞
Area in Upper Tail
0.20
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.846
0.845
0.845
0.845
0.845
0.845
0.845
0.845
0.842
0.10
1.292
1.292
1.292
1.292
1.292
1.292
1.291
1.291
1.291
1.291
1.291
1.291
1.291
1.291
1.291
1.291
1.290
1.290
1.290
1.290
1.290
1.282
0.05
1.664
1.664
1.664
1.663
1.663
1.663
1.663
1.663
1.662
1.662
1.662
1.662
1.662
1.661
1.661
1.661
1.661
1.661
1.661
1.660
1.660
1.645
0.025
1.990
1.990
1.989
1.989
1.989
1.988
1.988
1.988
1.987
1.987
1.987
1.986
1.986
1.986
1.986
1.985
1.985
1.985
1.984
1.984
1.984
1.960
0.01
2.374
2.373
2.373
2.372
2.372
2.371
2.370
2.370
2.369
2.369
2.368
2.368
2.368
2.367
2.367
2.366
2.366
2.365
2.365
2.365
2.364
2.326
0.005
2.639
2.638
2.637
2.636
2.636
2.635
2.634
2.634
2.633
2.632
2.632
2.631
2.630
2.630
2.629
2.629
2.628
2.627
2.627
2.626
2.626
2.576
Copyright Reserved
11
Example:
Given: 15, 53.87 and 6.82. Take note: is unknown
Question:
Calculate a 95% confidence interval for :
1 – 0.95 = 0.05 = = level of significance
Answer:
0.05
0.025
2
2
df = n – 1 = 15 – 1 = 14
± √
53.87 ± 2.145 6.82
√15
53.87
53.87 ± 3.78
3.78,53.87 # 3.78
50.09,57.65
95% confident that is between 50.09 and 57.65.
Obtaining + using Excel:
,
= TINV(, df)
= TINV(0.05, 14)
= 2.144787
Copyright Reserved
12
Obtaining the level of significance using Excel:
= TDIST(+ , df, tails)
,
= TDIST(2.145, 14, 2)
= 0.04998 ≈ 0.05
FOCUSING ON THE MARGIN OF ERROR:
± Margin of Error
± √
2.145 6.82
√15
= 3.78
95% of the time the sampling error will be 3.78 or less.
|
| |
|
OR
There is a 0.95 probability that the sample mean, , will provide a sampling error of 3.78 or less.
Copyright Reserved
13
Example:
Given: 15, 53.87 and 6.82. Take note: is unknown
Question:
Calculate a 90% confidence interval for :
1 – 0.9 = 0.1 = = level of significance
Answer:
0.1
0.05
2
2
df = n – 1 = 15 – 1 = 14
± √
53.87 ± 1.761 6.82
√15
53.87
53.87 ± 3.1
3.1,53.87 # 3.1
50.77,56.97
90% confident that is between 50.77 and 56.97.
Obtaining + using Excel:
,
= TINV(, df)
= TINV(0.1, 14)
= 1.76131
Copyright Reserved
14
Obtaining the level of significance using Excel:
= TDIST(+ , df, tails)
,
= TDIST(1.761, 14, 2)
= 0.100054 ≈ 0.1
FOCUSING ON THE MARGIN OF ERROR:
± Margin of Error
± √
1.761 6.82
√15
= 3.1
90% of the time the sampling error will be 3.1 or less.
|
| |
|
OR
There is a 0.90 probability that the sample mean, , will provide a sampling error of 3.1 or less.
Copyright Reserved
15
8.4 Population proportion
p = population proportion
= sample proportion
Interval estimate of a population proportion:
± Margin of error
± .
± /
1
To use this expression to develop an interval estimate of a population proportion, p, the value of p
would have to be known. But, the value of p is what we are trying to estimate, so we simply
substitute the sample proportion for p. Therefore,
± /
1
Example: Women Golfers
Given:
A national survey of 902 women golfers was taken to learn how women golfers view themselves
as being treated at golf courses.
The survey found that 397 if the women golfers felt that they were treated fairly.
Question:
Calculate a 95% confidence interval for p.
Answer:
397
0.4401
902
Copyright Reserved
16
± /
0.4401 ± 1.96/
0.4401
1
0.44011 0.4401
902
0.4401 ± 0.0324
0.0324,0.4401 # 0.0324
0.4077,0.4725
95% confident that p is between 0.4077 and 0.4725.
FOCUSING ON THE MARGIN OF ERROR:
± Margin of error
± /
1.96/
1
0.44011 0.4401
902
= 0.0324
95% of the time the sampling error will be 0.0324 or less.
|
| |
|
OR
There is a 0.95 probability that the sample proportion, , will provide a sampling error of 0.0324 or
less.
Copyright Reserved
17
Question:
Calculate a 90% confidence interval for p.
Answer:
± /
1
0.44011 0.4401
0.4401 ± 1.645/
902
0.4401
0.4401 ± 0.027189
0.027189,0.4401 # 0.027189
0.4129,0.467
90% confident that p is between 0.4129 and 0.467.
FOCUSING ON THE MARGIN OF ERROR:
± Margin of error
± /
1
0.44011 0.4401
1.645/
902
= 0.027189
90% of the time the sampling error will be 0.027189 or less.
|
| |
|
OR
There is a 0.9 probability that the sample proportion, , will provide a sampling error of 0.027189 or
less.
Copyright Reserved
18
Relationships between the z and t – distributions
Characteristics of the t-distribution:
Symmetric around 0.
Has one degree of freedom, namely n – 1.
As the degrees of freedom increase, the t-distribution tends to the standard normal
distribution.
For “large sample cases” ( ≥ 30 the t-distribution approaches the standard normal
distribution.
In Figure 1 it is illustrated that as the degrees of freedom increase, i.e. as n – 1 increases, i.e. as the
sample size n increases, the t-distribution tends to the standard normal distribution.
Figure 1
•
•
•
The red curve represents the t-distribution with 4 degrees of freedom (denoted t(4));
the blue curve represents the t-distribution with 10 degrees of freedom (denoted t(10));
the black curve represents both the t-distribution with df tending to infinity (denoted t(∞))
and the standard normal distribution (denoted z-distribution).
Copyright Reserved
19
The importance of this? Take note of the following:
95% confidence interval
When working with a 95% confidence interval using the standard normal distribution we have:
The + value is obtained using the standard normal table.
,
When working with a 95% confidence interval using the t-distribution where the df tends to
infinity we have:
The + value is obtained using the t-table with area in the upper tail = 0.025 and df = ∞.
,
Note: The + and + values are the same, since the t-distribution tends to the standard normal
,
,
distribution as the df increase.
Copyright Reserved
20
90% confidence interval
When working with a 90% confidence interval using the standard normal distribution we have:
The + value is obtained using the standard normal table.
,
When working with a 90% confidence interval using the t-distribution where the df tends to
infinity we have:
The + value is obtained using the t-table with area in the upper tail =0.05 and df = ∞.
,
Note: The + and + values are the same, since the t-distribution tends to the standard normal
,
,
distribution as the df increase.
Copyright Reserved
21
99% confidence interval
When working with a 99% confidence interval using the standard normal distribution we have:
The + value is obtained using the standard normal table.
,
When working with a 99% confidence interval using the t-distribution where the df tends to
infinity we have:
The + value is obtained using the t-table with area in the upper tail =0.005 and df = ∞.
,
Note: The + and + values are the same, since the t-distribution tends to the standard normal
,
,
distribution as the df increase.
Copyright Reserved
22
Typical exam questions
Questions 1 and 2 are based on the following information:
The proportion of business travellers who is dissatisfied with the service of an airline is investigated.
A manager selects a systematic sample of 50 business travellers, from which 10 said that they are
dissatisfied with the service.
Let: p = population proportion of dissatisfied business travellers
Question 1
The lower limit of a 95% confidence interval for the population proportion is:
Answer 1
/ 3
.45.
6
0.2
1.963
7.7.8
97
47
0.089 with 97 0.2.
Question 2
If the confidence coefficient of a confidence interval decreases from 0.95 to 0.90 the:
(A) sample size increases.
(B) interval is narrower.
(C) significance level is smaller.
(D) margin of error is larger.
(E) standard error is larger.
Answer 2
If the confidence coefficient of a confidence interval decreases from 0.95 to 0.90, the / value
decreases from 1.96 to 1.645 and, consequently, the interval is narrower. Answer = B.
Question 3 is based on the following information:
A Business travel magazine rates the service of airlines on a regular basis (the rating scale with a low
score of 0 and a high score of 10 was used). It is known that 1.05. An airline was rated by 30
randomly selected business travellers which provided a sample mean of 7.5.
Question 3
The upper limit of a 99% confidence interval for the population mean is:
Answer 3
:
4.79
# / 6 7.5 # 2.576
7.99
√
√;7
Questions 4 to 10 are based on the following information:
The management team of a soccer stadium wants to estimate the average amount (in Rand) spent on
snacks and cool drinks per spectator. It is known that the amount (in Rand) is normally distributed.
They are also interested in the method of payment used for the purchase namely, credit card or cash.
Let the population mean of the amount (in Rand) spent on snacks and cool drinks per spectator.
the population proportion of spectators who paid with cash.
̅ the average amount (in Rand) spent on snacks and cool drinks.
Consider the following results in Excel:
Copyright Reserved
23
Formula worksheet:
Note: Rows 10 to 40 are hidden.
Value worksheet:
Note: Rows 10 to 40 are hidden
Question 4
The point estimate for the population mean of the amount (in Rand) spent is:
Answer 4
∑ >
6
494?.;@
A7
37.93
Question 5
The point estimate of the population proportion is:
Answer 5
In cell D3 of Excel the =COUNTIF(B2:B41, B3) function counts the number of payments made
using a credit card, i.e. 30 out of 40 payments were made using a credit card. Therefore, 40 – 30 = 10
payments were made using cash.
47
A7 0.25
Copyright Reserved
24
Question 6
When is used to estimate , the interval estimate for the population mean is based on the:
(A) standard normal distribution
(B) binomial distribution
(C) normal distribution
(D) B-distribution
(E) uniform distribution
Question 7
The margin of error of a 95% confidence interval for is:
Answer 7
To obtain the value of / we use the TINV function of Excel =TINV(, df) = TINV(0.05, 39) =
2.023 (this is given in cell D6 of Excel) with df = n – 1 = 40 – 1 = 39. To obtain the sample standard
deviation, we take the square root of the variance (this is given in cell D4 of Excel).
The margin of error is equal to /
C
√6
2.023 D
?
E 2.2391.
√A7
Question 8
The lower limit of a 99% confidence interval for the population proportion is:
Answer 8
/ 3
.45.
6
0.25
2.5763
7.97.?9
A7
0.0736
Question 9
The margin of error of a 95% confidence interval for the population proportion is:
Answer 9
/ 3
.45.
6
1.963
7.97.?9
A7
0.1342
Question 10
If the confidence coefficient of a confidence interval for is decreased from 95% to 90%, then:
(A) the standard error decreases, which implies a narrower interval.
(B) the standard error increases, which implies a wider interval.
(C) the sample size increases, which implies a narrower interval.
(D) lower limit increases, which implies a wider interval.
(E) the margin of error decreases, which implies a narrower interval.
Copyright Reserved
25
Download