Transcripts of Unknown Function in Eukaryotic Cells: Lessons

advertisement
Transcripts of Unknown Function in Eukaryotic Cells: Lessons Forgotten and ReLearned
Gingeras, Thomas R., Affymetrix, Inc., Santa Clara, CA 95051
The current status of the functional annotations associated with the human
genome is in an unfinished state. The majority of current genome annotations are heavily
protein coding gene centric. This focus on protein coding genes intrinsically influences
current perceptions of how the genome is structured and regulated. This view of the
genome also has an underlying supposition that transcripts with very little coding
potential are not biologically important. However, recent experiments analyzing the sites
of transcription across large sections of the human genome have led to the conclusion that
the current human genome annotations can not account for the amounts of empirically
detected transcription. (Kapranov, et al. 2002, 2005; Rinn, et al., 2003, Kampa, et al.,
2004, Martone, et al., 2003, Cawley et al., 2004 Bertone, et al., 2004, Cheng, et. al, 2005).
Most of the detected unannotated transcription is composed of RNAs with very little
coding capacity (<100 aa). Except for the significant reduced coding capacity, these
transcripts of unknown function (TUFs) share many properties with well characterized
coding genes including: 1) processing of nuclear precursor transcripts (polyadenylation,
splicing and nuclear to cytosolic transport), 2) cell type expression specificity, 3)
correlation of their 5’ ends with promoter-like sites at which specific regulatory factors
and transcription-correlated chromatin modifications are found and 4) temporally
regulated expression correlated with cellular differentiation. TUFs have been shown to be
nuclear and cytosolic localized as both polyadenylated and non-polyadenylated
transcripts (Cheng, et al 2005). The biological functions of these TUFs have begun to be
explored using several genetic approaches. Initial results of these functional studies will
be described in both human can Drosophila cell systems. These data describe
unannotated populations of poly A+ and A-RNA transcripts which comprise a complex
architecture for the transcribed regions of the genomes and further indicate significantly
more complex strategies for transcriptional regulation than previously anticipated. It may
well be that the multifunctional importance of RNA may not have been lost during the
evolutionary ascendancy of the functional importance of proteins. This appears to be a
lesson to be relearned.
1.
Kapranov, P., S. et. al. (2002) Science 296: 916-919.
2.
Cawley, S.,et al. (2004) Cell 116: 499-510.
3.
Kampa, D., et al. (2004). Genome Res. 14: 331-342.
4.
Martone, R.,.et al.(2003). Proc. Natl. Acad. Sci. USA 100:12247-12252.
5.
Rinn, J.L., et al. (2003) Genes Dev. 17:529-540.
6.
Bertone, P.et al. (2004). Science 306: 2242-2246
7.
Cheng, J, et al. (2005) Science 308: 1149-1154.
8.
Kapranov, P. et al. (2005) Examples of complex architecture of the human transcriptome revealed by
RACE and high density tiling arrays. Genome Res 15: 987-997
.
Download