11-1 Day 3

advertisement
AP STATISTICS
LESSON 11 – 1
(DAY 3)
Matched Pairs t Procedures
ESSENTIAL QUESTION:
How are matched pair designs included
in one-sample data analysis?
Objectives:

To construct t confidence intervals for matched pair
designs.

To construct t significance tests for matched pair
designs.

To define Robustness of t procedures.
Matched Pairs t Procedures
The taste test in Example 11.3 was a matched pairs
study in which the same 10 tasters rated before-andafter sweetness. Comparative studies are more
convincing than single-sample investigations. For that
reason, one-sample inference is less common than
comparative inference.
One common design to compare two treatments makes
use of one-sample procedures. In matched pairs
design, subjects are matched in pairs and each
treatment is given to one subject in each pair.
Matched Pairs t Procedures
(continued…)
To compare the responses to the two treatments in a
matched pairs design, apply the one-sample t
procedures to the observed differences.
The parameter μ in a matched pairs t procedure is the
mean difference in the responses to the two treatments
within matched pairs of subjects in the entire
population.
Example 11.4
Page 629
Floral Scents and Learning



The randomization is important because subjects tend to
improve their times as they work a maze repeatedly.
To analyze these data, subtract the scented time from the
unscented time for each subject. The 21 differences form
a single sample.
Inference procedures for comparing two samples assume
that the samples are selected independently of each other.
This assumption does not hold when the same subjects
are measured twice. The proper analysis depends on the
design used to produce the data.
Example 11.5 Page 632
Is Caffeine Dependence Real?
Construct a 90% confidence interval for the mean change in
depression score.
We are 90% confident that the actual mean difference in depression
score for the population is between 3.579 and 11.149 points. We
estimated that caffeine-dependent individuals would score, on average,
between 3.6 and 11.1 points higher on the Beck Depression Inventory
when the are given placebo instead of caffeine.
This study provides evidence that withholding caffeine from caffeinedependent individuals may lead to depression.
Robustness of t Procedures
The t confidence interval and test are exactly correct
when the distribution of the population is exactly
normal. No real data are exactly normal.
Because of the tails of normal curves drop off quickly,
samples from normal distributions will have very few
outliers . Outliers suggest that your data are not a
sample from a normal population. Like x and s, the t
procedures are strongly influence by outliers.
Robustness Procedures
(continued…)
A confidence interval of significance test is
called robust if the confidence level or P-value
does not change very much when the
assumptions of the procedure are violated.
Example 11.6 Page 635
The Effects of Outliers
One of the 46 engines in our sample emitted an
unusually high amount (2.94 grams/mile) of
NOX. If remove that single data point, What
will happen.
 Fortunately, the t procedures are quite robust
against nonnormality of the population when
there are no outliers, especially when the
distribution is roughly symmetric.
The Effects of Outliers (continued…)



Large samples improve the accuracy of Pvalues and critical values from the t distribution
when the population is not normal. The main
reason for this is the central limit theorem.
Always make a plot to check for skewness and
outliers before you use the one-sample t
procedures when
N ≥ unless an outlier or quite strong skewness
is present.
Using the t procedure




Except in the case of small samples, the assumption
that the data are an SRS from the population of interest
is more important than the assumption that the
population distribution is normal.
Sample size at least 15. Use t procedure if the data
are close to normal. If the data are clearly nonnormal
or if outliers are present, do not use t.
Sample size at least 15. The t procedure can be used
except in the presence of outliers or strong skewness.
Large samples. The t procedures can be used even
for clearly skewed distributions when the sample is
large, roughly n ≥ 40.
Can we use t ?
Example 11.7
Page 636 and 637
 Figure 11.9(a) is a histogram of the percent of each
state’s residents who are over 65.We have data on the
entire population, we can find μ and σ so we do not
confidence intervals or tests.
 Figure 11.9 (b) Shows the time lightning strike each day
in the mountains of Colorado. The data has 70
observations that have a symmetric distribution. You can
use t procedures with confidence.
 Figure 11.9 (c) shows the distribution of word length in
Shakespeare’s plays is skewed to right. You can use the
t procedure for a distribution like this if the sample size is
roughly 40 or larger.
The power of a statistical test measures its
of from
the the
t test
ability toThe
detectpower
deviations
null
hypothesis. In practice we carry out the test in
the hope of showing that the null hypothesis is
false, so higher power is important. The power
of the one-sample t test against a specific
alternative value of the population mean μ is
the probability that the test will reject the null
hypothesis when the mean has this alternative
value. To calculate the power, we assume a
fixed level of significance, usually α = 0.05
Example 11.8
Page 639
Power of Spanish Teachers.
Download