µ

advertisement
Problem Set 3
1)
Analysis of LD markers and Fitting Polygenic Effects
The purpose of this exercise it to illustrate the concepts of fitting LD markers as fixed or random
effects, along with fitting random animal genetic effects.
Consider the data set that was introduced in the lectures:
Animal
1
2
3
4
5
6
Sire
0
0
0
1
1
1
Dam
0
0
0
2
2
3
Phenotype
10.1
2.2
2.31
6.57
6.06
6.21
SNP alleles
Pat
Mat
0
1
1
1
1
1
0
1
0
1
0
1
a) Use this data set to estimate the additive effect of the marker, by regressing phenotype on the
number of 1 alleles that the individual carries (0/1/2) by setting up and solving the following least
squares matrix equation:  ∧ 
−1
1 '1 1n ' X  1n ' y 
 µ∧  =  n n
 

 g   X'1n X' X   X' y 
 
Compare the results to what you get from using the slope function of Excel.
b) Evaluate what happens to estimates if you regress on the number of 1 alleles minus 1 (i.e. -1/0/1)
c) Now fit a model that also includes a polygenic
set up and solve the following
∧
mixed model equations:
 µ∧  1n '1n
 g  =  X'1
n
∧ 
Use heritability = 0.75.

Z'1
u  
n
 
animal effect, following slide 25 of the notes, i.e.
1n ' X
1n ' Z 
X' X
X' Z 
Z' X Z' Z + A −1 λ 
−1
1n ' y 
 X' y 


 Z' y 
Compare the estimate of the marker effect to that obtained under a)
Evaluate what happens if you change heritability to a value close to zero.
d) Now fit the marker effect as a random effect, following slide 27 of the notes, i.e. set up and solve
the following mixed model equations:
∧
−1
1n ' X
1 n ' Z  1n ' y 
 µ∧  1n '1n
Use a variance component ratio for the  g  =  X'1
X' X + Iλ g
X' Z   X' y 
n



marker effect variance equal to λg = 10. ∧
Z' X
Z' Z + A −1 λ   Z' y 
u   Z'1n
 
Compare results to those obtained under c)
and also evaluate the impact of increasing or decreasing λg. Interpret the results.
1
e) Compute the EBV of each animal based on the model under c) and d) and compare results. EBV
are the sum of the estimate based on the marker (X ĝ ) and the polygenic EBV ( û ) and can be
gˆ 
computed as: [X Z ]  
uˆ 
f) Compute the accuracy of the EBV of each animal based on model d). Accuracies can be computed
based
on
the
prediction
error
variance-covariance
matrix:
C XX
PEV = [X Z] ZX
C
C XZ   X' 2
  σ e
C ZZ   Z'
where CXX is the block of the inverse of the left hand sides corresponding to X’X, etc.:
1n '1n
 X'1
n

 Z'1n
and
σ e2
1n ' X
1n ' Z 
X' X
X' Z 
Z' X Z' Z + A −1 λ 
−1
 C 11

= C X 1
 X Z1

is the environmental variance, which is equal to h
Z 1X
C XX
C ZX
C 1Z 

C XZ 
C ZZ 
σ p2 . Assume σ p2 = 9 .
2
Then, the accuracy of animal i can be computed as: ri = 1 −
PEV ( EBVi )
σ a2
Where PEV(EBVi) is the ith diagonal of the PEV matrix.
2
Download