利用独立的转录元件提高基因功能预测。

Improving gene function predictions using independent transcriptional components.

机构信息

Department of Medical Oncology, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.

The Stratingh Institute for Chemistry, University of Groningen, Groningen, The Netherlands.

出版信息

Nat Commun. 2021 Mar 5;12(1):1464. doi: 10.1038/s41467-021-21671-w.

DOI:10.1038/s41467-021-21671-w

PMID:33674610

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7935959/

Abstract

The interpretation of high throughput sequencing data is limited by our incomplete functional understanding of coding and non-coding transcripts. Reliably predicting the function of such transcripts can overcome this limitation. Here we report the use of a consensus independent component analysis and guilt-by-association approach to predict over 23,000 functional groups comprised of over 55,000 coding and non-coding transcripts using publicly available transcriptomic profiles. We show that, compared to using Principal Component Analysis, Independent Component Analysis-derived transcriptional components enable more confident functionality predictions, improve predictions when new members are added to the gene sets, and are less affected by gene multi-functionality. Predictions generated using human or mouse transcriptomic data are made available for exploration in a publicly available web portal.

摘要

高通量测序数据的解释受到我们对编码和非编码转录本功能理解不完整的限制。可靠地预测这些转录本的功能可以克服这一限制。在这里，我们报告了使用一致的独立成分分析和关联分析方法，使用公开的转录组谱预测由超过 55000 个编码和非编码转录本组成的超过 23000 个功能组。我们表明，与使用主成分分析相比，独立成分分析衍生的转录组成分能够更自信地进行功能预测，当向基因集添加新成员时，预测会得到改善，并且受基因多功能性的影响较小。使用人类或小鼠转录组数据生成的预测可在公共可用的网络门户中进行探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/7935959/75978761799e/41467_2021_21671_Fig1_HTML.jpg

相似文献

Improving gene function predictions using independent transcriptional components.

Nat Commun. 2021 Mar 5;12(1):1464. doi: 10.1038/s41467-021-21671-w.

Improving microRNA target prediction with gene expression profiles.

BMC Genomics. 2016 May 17;17:364. doi: 10.1186/s12864-016-2695-1.

Transcriptome Profiling of Developing Murine Lens Through RNA Sequencing.

Invest Ophthalmol Vis Sci. 2015 Jul;56(8):4919-26. doi: 10.1167/iovs.14-16253.

Ryūtō: network-flow based transcriptome reconstruction.

BMC Bioinformatics. 2019 Apr 16;20(1):190. doi: 10.1186/s12859-019-2786-5.

Express: A database of transcriptome profiles encompassing known and novel transcripts across multiple development stages in eye tissues.

Exp Eye Res. 2018 Mar;168:57-68. doi: 10.1016/j.exer.2018.01.009. Epub 2018 Jan 11.

Bioinformatic Methods for the Analysis of High-Throughput RNA Sequencing in Arbuscular Mycorrhizal Fungi.

Methods Mol Biol. 2020;2146:137-153. doi: 10.1007/978-1-0716-0603-2_11.

CLIPdb: a CLIP-seq database for protein-RNA interactions.

BMC Genomics. 2015 Feb 5;16(1):51. doi: 10.1186/s12864-015-1273-2.

Computational discovery of human coding and non-coding transcripts with conserved splice sites.

Bioinformatics. 2011 Jul 15;27(14):1894-900. doi: 10.1093/bioinformatics/btr314. Epub 2011 May 26.

De novo assembly of wheat root transcriptomes and transcriptional signature of longitudinal differentiation.

PLoS One. 2018 Nov 5;13(11):e0205582. doi: 10.1371/journal.pone.0205582. eCollection 2018.

Bridger: a new framework for de novo transcriptome assembly using RNA-seq data.

Genome Biol. 2015 Feb 11;16(1):30. doi: 10.1186/s13059-015-0596-2.

引用本文的文献

MKFGO: integrating multi-source knowledge fusion with pretrained language model for high-accuracy protein function prediction.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf420.

Transcriptional pattern enriched for synaptic signaling is associated with shorter survival of patients with high-grade serous ovarian cancer.

Elife. 2025 May 13;13:RP101369. doi: 10.7554/eLife.101369.

DPFunc: accurately predicting protein function via deep learning with domain-guided structure information.

Nat Commun. 2025 Jan 2;16(1):70. doi: 10.1038/s41467-024-54816-8.

iModulonMiner and PyModulon: Software for unsupervised mining of gene expression compendia.

PLoS Comput Biol. 2024 Oct 23;20(10):e1012546. doi: 10.1371/journal.pcbi.1012546. eCollection 2024 Oct.

Neoadjuvant immune checkpoint blockade in women with mismatch repair deficient endometrial cancer: a phase I study.

Nat Commun. 2024 Sep 3;15(1):7695. doi: 10.1038/s41467-024-52098-8.

Independent transcriptional patterns reveal biological processes associated with disease-free survival in early colorectal cancer.

Commun Med (Lond). 2024 May 3;4(1):79. doi: 10.1038/s43856-024-00504-z.

Exploring combinations of dimensionality reduction, transfer learning, and regularization methods for predicting binary phenotypes with transcriptomic data.

BMC Bioinformatics. 2024 Apr 26;25(1):167. doi: 10.1186/s12859-024-05795-6.

Patterns of transcription factor binding and epigenome at promoters allow interpretable predictability of multiple functions of non-coding and coding genes.

Comput Struct Biotechnol J. 2023 Jul 14;21:3590-3603. doi: 10.1016/j.csbj.2023.07.014. eCollection 2023.

A global genetic interaction network by single-cell imaging and machine learning.

Cell Syst. 2023 May 17;14(5):346-362.e6. doi: 10.1016/j.cels.2023.03.003. Epub 2023 Apr 27.

Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction.

PLoS Comput Biol. 2022 Dec 22;18(12):e1010793. doi: 10.1371/journal.pcbi.1010793. eCollection 2022 Dec.

本文引用的文献

A Genetic Map of the Response to DNA Damage in Human Cells.

Cell. 2020 Jul 23;182(2):481-496.e21. doi: 10.1016/j.cell.2020.05.040. Epub 2020 Jul 9.

scVAE: variational auto-encoders for single-cell gene expression data.

Bioinformatics. 2020 Aug 15;36(16):4415-4422. doi: 10.1093/bioinformatics/btaa293.

SDN2GO: An Integrated Deep Learning Model for Protein Function Prediction.

Front Bioeng Biotechnol. 2020 Apr 29;8:391. doi: 10.3389/fbioe.2020.00391. eCollection 2020.

Dimensionality reduction by UMAP to visualize physical and genetic interactions.

Nat Commun. 2020 Mar 24;11(1):1537. doi: 10.1038/s41467-020-15351-4.

Transcriptome-Based Prediction of Complex Traits in Maize.

Plant Cell. 2020 Jan;32(1):139-151. doi: 10.1105/tpc.19.00332. Epub 2019 Oct 22.

Deconvolution of autoencoders to learn biological regulatory modules from single cell mRNA sequencing data.

BMC Bioinformatics. 2019 Jul 8;20(1):379. doi: 10.1186/s12859-019-2952-9.

CRISPR knockout screen implicates three genes in lysosome function.

Sci Rep. 2019 Jul 3;9(1):9609. doi: 10.1038/s41598-019-45939-w.

Improving the diagnostic yield of exome- sequencing by predicting gene-phenotype associations using large-scale gene expression analysis.

Nat Commun. 2019 Jun 28;10(1):2837. doi: 10.1038/s41467-019-10649-4.

PICOT (GLRX3) is a positive regulator of stress-induced DNA-damage response.

Cell Signal. 2019 Oct;62:109340. doi: 10.1016/j.cellsig.2019.06.005. Epub 2019 Jun 5.

DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks.

Sci Rep. 2019 May 14;9(1):7344. doi: 10.1038/s41598-019-43708-3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用独立的转录元件提高基因功能预测。

Improving gene function predictions using independent transcriptional components.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献