Structural organization of the malaria mosquito heterochromatin

advertisement
STRUCTURAL
ORGANIZATION
OF
THE
MALARIA
MOSQUITO
HETEROCHROMATIN: FROM CHROMOSOMAL MORPHOLOGY TO GENOME
SEQUENCE
Maria V. Sharakhova1, Phillip George1, Irina V. Brusentsova2, Scotland Leman3,
Christopher D. Smith4, Igor V. Sharakhov1*
1
Department of Entomology, Virginia Tech, Blacksburg, VA, USA; 2 Department of
Molecular and Cellular Biology, Institute of Chemical Biology and Fundamental Medicine
SB RAS, Novosibirsk, Russia; 3Department of Statistics, Virginia Tech, Blacksburg, VA,
USA; 4Department of Biology, San Francisco State University, San Francisco, CA, USA;
e-mail:igor@vt.edu; *corresponding author
Key words: heterochromatin, malaria mosquito, imunostaning, Bayesian statistics
Motivations and aim: Heterochromatin plays an important role in chromosome function and
gene regulation. The goal of this study was to map and characterize the heterochromatic
portion of the malaria mosquito Anopheles gambiae genome.
Methods and Algorithms: Imunostaining of An. gambiae polytene chromosomes was
performed using antibodies against Heterochromatin Protein 1 (HP1) and lamin Dm0
Drosophila melanogaster. Physical mapping was done by Cy3, Cy5 labeled PCR amplified
genes. The bioinformatics analysis was conducted using C program ATCONTENT, Biomart
and SMARTest. We developed novel Bayesian statistical models to analyze genome
landscapes in different types of chromatin. A gene ontology (GO) annotation file was
generated using Interproscan-assigned GO terms and custom Perl scripts. Segmental
duplications were detected using BLAST-based whole-genome assembly comparison limited
to putative segmental duplications represented by pairwise alignments with ≥2.5-kb
and >90% sequence identity.
Results: Based on chromosomal location, two types of heterochromatin have been identified
in An. gambiae chromosomes: pericentromeric and intercalary. Intercalary heterochromatin
was represented by two morphological forms: compact, block-like type and diffuse, meshlike type. Pericentromeric heterochromatin in An. gambiae has a mesh-like structure.
Immunostaining of the An. gambaie chromosomes with antibodies against HP1 and lamin
Dm0 identified the major invariable sites of protein localization in regions of diffuse
heterochromatin and euchromatic region 9C but not in the compact intercalary
heterochromatin. To determine the euchromatin-heterochromatin boundaries in the An.
gambaie genome, we physically mapped genes to the polytene chromosomes: 16.6 Mb of the
mapped portion in the An. gambiae genome has been identified as heterochromatin. Unlike
Drosophila, Anopheles possesses three large regions (0.7, 0.8, and 2.9 megabase pairs long)
of intercalary heterochromatin. The study determined 230 in the mapped portion of the An.
gambiae heterochromatin. GO analysis revealed that heterochromatin is enriched by genes
with DNA-binding and regulatory activities. Bayesian statistical models showed that
heterochromatin and euchromatin differ in gene density and that all chromatin types are
distinguishable from each other by the coverage of transposable elements. Conversely,
segmental duplication, matrix-associated regions, and simple repeats are distributed
regardless of the chromatin type. Finally, we have demonstrated that the 42 Mb assembly of
unmapped scaffolds (“unknown” chromosome) have molecular characteristics of
heterochromatin.
Conclusion: The study revealed the presence of large blocks of intercalary heterochromatin in
An. gambiae genome. We conclude that each chromatin type has a unique combination of
various classes of genes and repetitive DNA.
Download