Comparing Two Mean Vectors Using Independent Samples

advertisement
Two-Sample Multivariate Tests
Mahon & Campbell (1974) recorded data on 200 specimens of Leptograpsus variegatus
crabs on the shore of Western Australia. This occurs in two color forms, blue and orange,
and they collected 50 of each form and sex.
The measurements were: carapace (shell) length CL, carapace width CW, the size the
frontal lobe FL, rear width RW, and body depth BD.
Assess the multivariate normality of the data contained in the data frame Crabs. The
species is contained in the data set. Be sure to take into consideration the fact that there
are two species and two sexes of crabs present in these data.
We will perform tests to contrast the differences between male and female crabs within
species, and also contrast the differences between orange and blue crabs within sex. We
first will form two subsets of the original data. One subset will contain the data on the
blue crabs and theother will contain the data on orange crabs.
> attach(Crabs)
> Blue.Crabs <- Crabs[sp=="B",]
> Orange.Crabs <- Crabs[sp=="O",]
> names(Blue.Crabs)
[1] "cgrp" "sex" "sp" "FL" "RW" "CL" "CW" "BD"
> Blue.Crabs <- Blue.Crabs[,-c(1,3)] # Remove columns for crab group and species
> Orange.Crabs<- Orange.Crabs[,-c(1,3)] # Remove columns for crab group and species
> detach("Crabs")
> attach(Blue.Crabs)
> BC.mat <- Blue.Crabs[,2:6] # Extract the columns containing measurements
> BC.mat[1:4,]
FL RW
CL
CW BD
1 8.1 6.7 16.1 19.0 7.0
2 8.8 7.7 18.1 20.8 7.4
3 9.2 7.8 19.0 22.4 7.7
4 9.6 7.9 20.1 23.1 8.2
> MVttest2(BC.mat,sex,pv=T,box=T,lcp=T)
Perform two-sample T2 test. The required arguments to the function are the data matrix
and the variable used to classify the two groups. In this case BC.mat contains the
measurements and sex contains and indicator for the sex of the crab
(F = female, M = male).
The other options are not required. Setting pv = T indicates that you wish the two sample
variance covariance matrices to be printed (default = F). Setting box = T creates
comparative boxplots for each of the variables (default = F). Setting lcp = T plots the
values of the maximizing linear combination for both groups (default = F). Look for a
great deal of separation between the linear combination scores when the null hypothesis
is rejected.
Output from MVttest2 function (Blue Crabs)
Difference in mean vectors
==================================================================
FL
RW
CL
CW
BD
1.572 -0.42 3.912 4.186 1.534
Sample Variance-Covariance Matrices
===========================================
S1 ( M )
======================================================================
FL
RW
CL
CW
BD
FL 10.26 6.54 23.30 26.61 10.17
RW 6.54 4.46 15.08 17.26 6.55
CL 23.30 15.08 53.42 60.98 23.24
CW 26.61 17.26 60.98 69.78 26.59
BD 10.17 6.55 23.24 26.59 10.24
S2 ( F )
======================================================================
FL
RW
CL
CW
BD
FL 6.91 6.28 15.47 17.80 7.10
RW 6.28 5.95 14.25 16.39 6.56
CL 15.47 14.25 35.04 40.21 16.14
CW 17.80 16.39 40.21 46.29 18.53
BD 7.10 6.56 16.14 18.53 7.58
Test for Equality of Means
=======================================================================
Hotelling's T2 = 241.969 Critical Value = 12.048 p-value= 0
The linear combination most responsible for rejection of the null
hypothesis
lin.comb
FL -2.785
RW -5.205
CL 2.822
CW 0.401
BD -0.552
95% Simultaneous C.I. for Mean Differences (T^2)
===========================================================
LCL
UCL
FL -0.4615290 3.605529
RW -2.0035212 1.163521
CL -0.7048465 8.528846
CW -1.1025639 9.474564
BD -0.5379137 3.605914
95% Simultaneous C.I. for Mean Differences (Bonferroni)
===========================================================
LCL
UCL
FL 0.03299663 3.1110034
RW -1.61843114 0.7784311
CL 0.41790554 7.4060945
CW 0.18354035 8.1884597
BD -0.03405348 3.1020535
Graphs for Selected Options (Blue Crabs)
Comparative Boxplots for Each Variable
Now consider the similar tests for the orange crabs.
>
>
>
>
detach("Blue.Crabs")
attach(Orange.Crabs)
OC.mat <- Orange.crabs[,2:6]
Mvttest2(OC.mat,sex,pv=T,box=T,lcp=T)
Output from MVttest2 function (Orange Crabs)
Difference in mean vectors
==================================================================
FL
RW
CL
CW
BD
-0.968 -2.574 -0.93 -1.848 -0.308
Sample Variance-Covariance Matrices
=======================================
S1 ( M )
===================================================================
FL
RW
CL
CW
BD
FL 12.36 7.60 26.62 29.31 12.32
RW 7.60 4.82 16.53 18.23 7.65
CL 26.62 16.53 57.93 63.77 26.76
CW 29.31 18.23 63.77 70.35 29.49
BD 12.32 7.65 26.76 29.49 12.44
S2 ( F )
===================================================================
FL
RW
CL
CW
BD
FL 8.84 6.73 17.17 19.25 8.01
RW 6.73 5.52 13.40 15.05 6.21
CL 17.17 13.40 34.07 38.08 15.91
CW 19.25 15.05 38.08 42.80 17.76
BD 8.01 6.21 15.91 17.76 7.58
Test for Equality of Means
===============================================================
Hotelling's T2 = 447.033 Critical Value = 12.048 p-value= 0
The linear combination most responsible for rejection of the null
hypothesis
lin.comb
FL -1.121
RW -7.121
CL 3.968
CW -1.258
BD 0.550
95% Simultaneous C.I. for Mean Differences (T^2)
===========================================================
LCL
UCL
FL -3.228160 1.2921598
RW -4.152142 -0.9958583
CL -5.638423 3.7784227
CW -7.069558 3.3735580
BD -2.504263 1.8882631
95% Simultaneous C.I. for Mean Differences (Bonf)
===========================================================
LCL
UCL
FL -2.678521 0.7425208
RW -3.768360 -1.3796401
CL -4.493401 2.6334006
CW -5.799749 2.1037486
BD -1.970163 1.3541629
Graphs from Selected Options (Orange Crabs)
Comparative Boxplots for each Variable
We now consider testing differences between species within a given sex. To do this we
again first form subsets of our data, this time corresponding to sex.
>
>
>
>
>
detach("Orange.Crabs")
attach(Crabs)
Male.Crabs <- Crabs[sex=="M",]
Female.Crabs <- Crabs[sex=="F",]
detach("Crabs")
> attach(Male.Crabs)
> MC.mat <- Male.Crabs[,4:8]
> MVttest2(MC.mat,sp,pv=T,box=T,lcp=T)
Output from MVttest2 (Male Crabs)
Difference in mean vectors
==================================================================
FL
RW
CL
CW
BD
-1.784 -0.544 -1.674 -0.378 -1.974
Sample Variance-Covariance Matrices
===========================================
S1 ( B )
======================================================================
FL
RW
CL
CW
BD
FL 10.26 6.54 23.30 26.61 10.17
RW 6.54 4.46 15.08 17.26 6.55
CL 23.30 15.08 53.42 60.98 23.24
CW 26.61 17.26 60.98 69.78 26.59
BD 10.17 6.55 23.24 26.59 10.24
S2 ( O )
======================================================================
FL
RW
CL
CW
BD
FL 12.36 7.60 26.62 29.31 12.32
RW 7.60 4.82 16.53 18.23
7.65
CL 26.62 16.53 57.93 63.77 26.76
CW 29.31 18.23 63.77 70.35 29.49
BD 12.32 7.65 26.76 29.49 12.44
Test for Equality of Means
===============================================================
Hotelling's T2 = 916.162 Critical Value = 12.048 p-value= 0
The linear combination most responsible for rejection of the null
hypothesis
lin.comb
FL -5.408
RW -3.766
CL -3.762
CW 11.087
BD -11.573
95% Simultaneous C.I. for Mean Differences (T^2)
===========================================================
LCL
UCL
FL -4.118179 0.5501790
RW -2.039355 0.9513555
CL -6.853814 3.5058136
CW -6.188841 5.4328413
BD -4.311756 0.3637559
95% Simultaneous C.I. for Mean Differences (Bonf)
===========================================================
LCL
UCL
FL -3.550540 -0.01746044
RW -1.675706 0.58770610
CL -5.594156 2.24615589
CW -4.775727 4.01972655
BD -3.743247 -0.20475340
Output for Selected Options (Male Crabs)
Repeat for female crabs.
>
>
>
>
detach("Male.Crabs")
attach(Female.Crabs)
FC.mat <- Female.Crabs[,4:8]
MVttest2(FC.mat,spec,pv=T,box=T,lcp=T)
Output and graphs not shown.
Download