Ruiz-Orera Jorge, Albà M Mar
Evolutionary Genomics Group, Research Programme on Biomedical Informatics, Hospital del Mar Research Institute, Universitat Pompeu Fabra, Dr Aiguader 88, Barcelona 08003, Spain.
Catalan Institution for Research and Advanced Studies, Passeig Lluís Companys 23, Barcelona 08010, Spain.
NAR Genom Bioinform. 2019 Jul 5;1(1):e2. doi: 10.1093/nargab/lqz002. eCollection 2019 Apr.
The mammalian transcriptome includes thousands of transcripts that do not correspond to annotated protein-coding genes and that are known as long non-coding RNAs (lncRNAs). A handful of lncRNAs have well-characterized regulatory functions but the biological significance of the majority of them is not well understood. LncRNAs that are conserved between mice and humans are likely to be enriched in functional sequences. Here, we investigate the presence of different types of ribosome profiling signatures in lncRNAs and how they relate to sequence conservation. We find that lncRNA-conserved regions contain three times more ORFs with translation evidence than non-conserved ones, and identify nine cases that display significant sequence constraints at the amino acid sequence level. The study also reveals that conserved regions in intergenic lncRNAs are significantly enriched in protein-RNA interaction signatures when compared to non-conserved ones; this includes sites in well-characterized lncRNAs, such as and , as well as in tens of lncRNAs of unknown function. This work illustrates how the analysis of ribosome profiling data coupled with evolutionary analysis provides new opportunities to explore the lncRNA functional landscape.
哺乳动物转录组包含数千种与注释的蛋白质编码基因不对应的转录本,这些转录本被称为长链非编码RNA(lncRNA)。少数lncRNA具有特征明确的调控功能,但大多数lncRNA的生物学意义尚未得到充分理解。在小鼠和人类之间保守的lncRNA可能富含功能序列。在这里,我们研究了lncRNA中不同类型核糖体图谱特征的存在情况以及它们与序列保守性的关系。我们发现,与非保守区域相比,lncRNA保守区域中具有翻译证据的开放阅读框(ORF)多三倍,并鉴定出九个在氨基酸序列水平显示出显著序列限制的案例。该研究还表明,与非保守区域相比,基因间lncRNA的保守区域在蛋白质-RNA相互作用特征方面显著富集;这包括在特征明确的lncRNA(如 和 )以及数十种功能未知的lncRNA中的位点。这项工作说明了核糖体图谱数据分析与进化分析相结合如何为探索lncRNA功能格局提供新的机会。