Supplementary Information 2 (doc 41K)

Bioinformatics analysis
Bioinformatics analysis was performed to assess the biological relevance of
significant individual SNPs. For non-coding SNPs, evolutionary conservation in 17
vertebrates was examined using the “Vertebrate Multiz Alignment1 & Conservation”
track on the UCSC Genome Browser.2 The conservation track is based on a
phylogenetic hidden Markov model, phastCons.3 We determined whether the
associated SNP fell within a predicted evolutionarily conserved element using the
phastCons conserved elements track.4 Finally, we used the MATCHTM program5
( to
determine whether non-coding SNPs located in putative regulatory regions create or
destroy any TRANSFAC®6 predicted transcription factor binding sites. Two
sequences (101bp each), containing the alternate alleles for the SNP of interest,
were submitted for analysis. In addition to the default settings, MATCH was set to
search only the vertebrate matrix groups and to minimize the false positive matches.
The three signficant SNPs (56, 126 and 131) were subjected to bioinformatics
analysis to assess their biological significance. Neither SNP 56 nor SNP 126 is
conserved across species, as determined by examination of the Vertebrate Multiz
Alignments and Conservation track on UCSC ( SNP 131 does fall within a region of appreciable conservation, but it
does not correspond to any phastCons-predicted conserved elements. For the
vertebrate sequences with which it could be aligned (human, chimp, rhesus, dog,
cow and elephant), the risk allele of SNP 131 (T) is conserved, suggesting that this is
unlikely to be the deleterious allele. Finally, none of the three SNPs appeared to
create or abolish a TRANSFAC predicted transcription factor binding site, as
determined by the MATCHTM program.5
