1. Variance of a random vector and variance of vector elements

advertisement
VARIANCE OF MARKER BREEDING VALUES
Xiaochen Sun
December, 2013
1. Variance of a random vector and variance of vector
elements
Assume that
a ∼ (µ, Σσ 2 )
(1)
is an n × 1 random vector, Σ being positive definite matrix. Let V(a) denote
the variance among the n elements in a. It follows that
1 0 0 1 0 1
a − 11 a a − 11 a
V(a) =
n
n
n
1 0
1 0 1 0
= a I − 11 I − 11 a
n
n
n
1 0
1 0
= a I − 11 a.
n
n
1
(2)
(3)
(4)
The expected value of V(a) is
1 E V(a) = µ0 I −
n
1 = µ0 I −
n
1 = µ0 I −
n
1 0
= µ I−
n
1 0
11 µ +
n
1 0
11 µ +
n
1 0
11 µ +
n
1 0
11 µ +
n
σ2 1 0
tr I − 11 Σ
n
n
2
1 0 σ
tr Σ − tr 11 Σ
n
n
2
1 0 σ
tr Σ − tr 1 Σ1
n
n
2
1 0
σ
tr Σ − 1 Σ1 .
n
n
(5)
(6)
(7)
(8)
When µ = 0,
σ2
1 0
E V(a) =
tr Σ − 1 Σ1
n
n
1
σ2
n · diag −
n · diag + n(n − 1) · offdiag
=
n
n
n − 1
=
diag − offdiag σ 2 ,
n
(9)
(10)
(11)
in which diag is the average diagonal elements of Σ, and offdiag is the average off-diagonal elements of Σ.
For example, when a includes breeding values of n individuals and Σ equals
the numerator relationship matrix A, σ 2 is the additive genetic variance in
pedigree founders (independent individuals), denoted as σa2 . If the n individ
uals are independent from each other, E V(a) = σa2 given large n. If the n
individuals are paternal half-sibs, the diagonal elements in A equals 1.0 and
off-diagonal elements equals 0.25, therefore the expected variance of breeding
2
values among half-sibs equals to 0.75σa2 .
2. Variance of marker breeding values
Consider the statistical model for phenotypes
y = a + e,
(12)
a = Mα,
(13)
e ∼ 0, Iσe2 ,
(14)
in which y is an n × 1 vector of phenotypic values of n individuals adjusted
for fixed non-genetic effects, a is an n × 1 vector of marker breeding values,
M is an n × m matrix with row i including the genotypes of m markers for
individual i, α is an m × 1 vector of m marker effects, and e is an n × 1
vector of random residuals. Assuming each single locus in HWE, column j
of M, denoted as mj , includes n independent binomial variables with mij ∼
Bin(2, pj ), in which pj is allele frequency for marker j. Further we have
E mj = 2qj 1,
Var mj = 2pj qj I,
(15)
(16)
in which qj = 1 − pj . We will discuss about the variance of a, Var(a), and
variance of breeding values, V(a), under different assumptions on M and α.
3
2.1. Random α and fixed M (BayesC)
Assume α ∼ (η, Iσα2 ). The variance of marker breeding value Var(a) can be
written as
Var(a) = Var(Mα) = MM0 σα2 .
(17)
n
n − 1 X (i) 0 (i)
1 X (i) 0 (j) 2
V(a) =
m m − 2
m m σα ,
n2 i=1
n i6=j
(18)
According to (11),
in which m(i) is the vector of ith row of M, i.e. marker genotypes for individual i.
4
2.2. Fixed α and random M
In this case, the variance of marker breeding value Var(a) can be written as
Var(a) = Var(Mα)
m
X
= Var
m j αj
(19)
(20)
j=1
=
=
m
X
j=1
m
X
Var(mj αj ) + 2
X
Cov(mi αi , mj αj )
(21)
αi αj Cov(mi , mj )
(22)
i<j
αj2 Var(mj ) + 2
j=1
m
X
=I
X
i<j
2pj qj αj2 + 2
j=1
m
X
= 2I
X
αi αj rij
p
p
2pi qi I · 2pj qj I
(23)
i<j
pj qj αj2 + 4I
j=1
X√
pi qi pj qj αi αj rij ,
(24)
i<j
in which rij is linkage disequilibrium measured as correlation coefficient between genotypes of marker i and marker j.
According to equation (1),
Σ = I,
2
σ =2
(25)
m
X
pj qj αj2 + 4
X√
j=1
i<j
5
pi qi pj qj αi αj rij .
(26)
2.3. Random α and random M
Assume α ∼ (η, Iσα2 ), and α and M are independent. The variance of marker
breeding value Var(a) can be written as
Var(a) = EM Var(a|M) + VarM E(a|M)
= EM MM0 σα2 + VarM Mη
= σα2 EM MM0 + VarM Mη .
(27)
(28)
(29)
The first term in equation (25) can be written as
σα2 E(MM0 )
=
m
X
σα2 E
(30)
(31)
mj m0j
j=1
=
σα2
m
X
E mj m0j
j=1
m n
X
0 o
2
= σα
Var mj + E(mj ) E(mj )
(32)
j=1
=
=
σα2
m
X
2pj qj I +
j=1
m
X
2
2σα I
pj qj
j=1
σα2
m
X
(2qj )1 · (2qj )10
(33)
j=1
+
4σα2 110
m
X
qj2 .
(34)
j=1
The second term in equation (25) has similar form as equation (22) and can
be written as
Var Mη = 2I
m
X
pj qj ηj2 + 4I
j=1
X√
i<j
6
pi qi pj qj ηi ηj rij .
(35)
Assuming η = 0, we have Var Mη = 0. The diagonal elements of Var(a)
equal to
2σα2
m
X
pj qj +
4σα2
j=1
m
X
qj2
=
2σα2
j=1
m
X
qj + qj2 .
(36)
j=1
The off-diagonal elements of Var(a) equal to
4σα2
m
X
qj2 .
(37)
j=1
According to (11), when n is large, the variance of breeding values is
V(a) =
2σα2
m
X
qj +
qj2
−
j=1
4σα2
m
X
j=1
7
qj2
=
2σα2
m
X
j=1
pj q j .
(38)
Download