Kelley David, Rinn John
Genome Biol. 2012 Nov 26;13(11):R107. doi: 10.1186/gb-2012-13-11-r107.
Numerous studies over the past decade have elucidated a large set of long intergenic noncoding RNAs (lincRNAs) in the human genome. Research since has shown that lincRNAs constitute an important layer of genome regulation across a wide spectrum of species. However, the factors governing their evolution and origins remain relatively unexplored. One possible factor driving lincRNA evolution and biological function is transposable element (TE) insertions. Here, we comprehensively characterize the TE content of lincRNAs relative to genomic averages and protein coding transcripts.
Our analysis of the TE composition of 9,241 human lincRNAs revealed that, in sharp contrast to protein coding genes, 83% of lincRNAs contain a TE, and TEs comprise 42% of lincRNA sequence. lincRNA TE composition varies significantly from genomic averages - L1 and Alu elements are depleted and broad classes of endogenous retroviruses are enriched. TEs occur in biased positions and orientations within lincRNAs, particularly at their transcription start sites, suggesting a role in lincRNA transcriptional regulation. Accordingly, we observed a dramatic example of HERVH transcriptional regulatory signals correlating strongly with stem cell-specific expression of lincRNAs. Conversely, lincRNAs devoid of TEs are expressed at greater levels than lincRNAs with TEs in all tissues and cell lines, particularly in the testis.
TEs pervade lincRNAs, dividing them into classes, and may have shaped lincRNA evolution and function by conferring tissue-specific expression from extant transcriptional regulatory signals.
在过去十年中,大量研究已经阐明了人类基因组中一大类长链基因间非编码RNA(lincRNA)。此后的研究表明,lincRNA在广泛的物种中构成了基因组调控的重要层面。然而,控制它们进化和起源的因素仍相对未被探索。驱动lincRNA进化和生物学功能的一个可能因素是转座元件(TE)插入。在这里,我们全面描述了相对于基因组平均值和蛋白质编码转录本的lincRNA的TE含量。
我们对9241个人类lincRNA的TE组成分析表明,与蛋白质编码基因形成鲜明对比的是,83%的lincRNA含有一个TE,且TE占lincRNA序列的42%。lincRNA的TE组成与基因组平均值有显著差异——L1和Alu元件减少,而内源性逆转录病毒的广泛类别增加。TE在lincRNA内的位置和方向存在偏向性,特别是在它们的转录起始位点,这表明其在lincRNA转录调控中发挥作用。相应地,我们观察到一个显著的例子,即HERVH转录调控信号与lincRNA的干细胞特异性表达密切相关。相反,在所有组织和细胞系中,尤其是在睾丸中,不含TE的lincRNA的表达水平高于含有TE的lincRNA。
TE普遍存在于lincRNA中,将它们分为不同类别,并可能通过赋予现有转录调控信号的组织特异性表达来塑造lincRNA的进化和功能。