Feb. 27

advertisement
Stat 462 Feb. 27
Interaction (Section 8.2)
In statistics, the word interaction has to do with the combined effect of two predictor
variables (x-variables) on the response variable y. An interaction is said to be between the
two predictor variables.

Specifically, two x-variables, say x1 and x2, interact if the effect of x1 on y
depends upon the specific value present for x2.
In regression, the word “effect” refers to how much E(y) changes when an x is changed
by one unit.
Example 1
An experiment is done in which participants are smokers trying to quit.
x1 = 0 if participant used placebo patch
x1 = 1 if participant used nicotine patch
x2 = 0 if participant did not live with another smoker
x2 = 1 if participant lived with another smoker
Here are success rates (percent not smoking after 8 weeks).
x2 = 0
(No other smoker in home)
x1 = 0 (Placebo patch)
20%
x1 =1 (Nicotine patch)
58%



x2=1
Another smoker in home
20%
31%
When no other smoker is in the home (x2 = 0), the difference in success rates for
nicotine patch versus placebo patch is 58%20% = 38%.
When another smoker is in the home (x2 = 1), the difference in success rates for
nicotine patch versus placebo patch is 31%20% = 11%.
So the effect of x1, which is the difference in success rates for nicotine patch
(x1=1) versus placebo patch (x1=0), depends upon the specific value present for
x2 (whether another smoker is in the home or not). Thus, there is an interaction
between x1 and x2.
Example 2
An experiment is done to determine the maximum torque (circular force) that can be
applied before a mechanism used to spin logs being cut for plywood fails. Logs of two
different diameters are used. The spinning mechanism is inserted into the logs at four
different depths.
Following is a graph of y = torque (before failure) versus Depth with the two different log
sizes marked with different symbols. Notice that the slope appears to differ for two sizes.
.
Thus, the model we should consider is
E(Torque) = 0 + 1 Depth + 2Diameter + 3 (Depth*Diameter)
Minitab output follow. Notice that the interaction term is statistically significant.
The regression equation is
Torque = 13.8 - 8.68 Depth + 0.662 Diameter + 1.97 Interact
Predictor
Constant
Depth
Diameter
Interact
S = 2.697
Coef
13.795
-8.678
0.6625
1.9667
SE Coef
5.817
2.678
0.9405
0.4330
R-Sq = 90.9%
T
2.37
-3.24
0.70
4.54
P
0.028
0.004
0.489
0.000
R-Sq(adj) = 89.6%
Questions:
1. The Diameter term is not significant (p=0.489). What does this mean in this
situation?
2. Suppose we did the regression E(Torque) = 0 + 1 Depth. What would be the
appearance of a plot of residuals versus fits?
3. Suppose we did the regression E(Torque) = 0 + 1 Depth + 2Diameter. What
would be the appearance of a plot of residuals versus fits?
Example 3
College students are asked to report their actual weight and their ideal (desired) weight.
Here’s a graph of y = ideal versus x =actual with different symbols used for males and
females and a Lowess smoother used to help identify a pattern.
There may be a curved pattern and this pattern may differ for males and females. After a
few attempts, I found reasonable success with the model estimated by the following
output. The variable actwt150 = actual150, the variable sexcode = 1 if Female and 0 if
Male, and Lin_inte = sexcode*actwt150.
The regression equation is
ideal = 156 + 0.865 actwt150 - 22.3 sexcode - 0.00334 actwt150sqrd
- 0.350 Lin_interact
Predictor
Constant
acwt150
sexcode
actwt150sqrd
Lin_inte
S = 8.822
Coef
155.567
0.86516
-22.284
-0.0033351
-0.35036
SE Coef
1.575
0.06681
1.933
0.0009648
0.09383
R-Sq = 91.4%
Here’s a graph of the fits:
T
98.75
12.95
-11.53
-3.46
-3.73
P
0.000
0.000
0.000
0.001
0.000
R-Sq(adj) = 91.2%
Download