European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
Nucleic Acids Res. 2018 Jan 25;46(2):582-592. doi: 10.1093/nar/gkx1165.
Most human genes generate multiple transcript isoforms. The differential expression of these isoforms can help specify cell types. Diverse transcript isoforms arise from the use of alternative transcription start sites, polyadenylation sites and splice sites; however, the relative contribution of these processes to isoform diversity in normal human physiology is unclear. To address this question, we investigated cell type-dependent differences in exon usage of over 18 000 protein-coding genes in 23 cell types from 798 samples of the Genotype-Tissue Expression Project. We found that about half of the expressed genes displayed tissue-dependent transcript isoforms. Alternative transcription start and termination sites, rather than alternative splicing, accounted for the majority of tissue-dependent exon usage. We confirmed the widespread tissue-dependent use of alternative transcription start sites in a second, independent dataset, Cap Analysis of Gene Expression data from the FANTOM consortium. Moreover, our results indicate that most tissue-dependent splicing involves untranslated exons and therefore may not increase proteome complexity. Thus, alternative transcription start and termination sites are the principal drivers of transcript isoform diversity across tissues, and may underlie the majority of cell type specific proteomes and functions.
大多数人类基因产生多种转录本异构体。这些异构体的差异表达有助于确定细胞类型。不同的转录本异构体是通过使用替代转录起始位点、多聚腺苷酸化位点和剪接位点产生的;然而,这些过程对正常人体生理学中异构体多样性的相对贡献尚不清楚。为了解决这个问题,我们研究了 23 种细胞类型中来自 Genotype-Tissue Expression Project 的 798 个样本中超过 18000 个编码蛋白基因的外显子使用的细胞类型依赖性差异。我们发现,大约一半的表达基因显示出组织依赖性的转录本异构体。替代转录起始和终止位点,而不是选择性剪接,解释了大多数组织依赖性外显子的使用。我们在第二个独立数据集 FANTOM 联盟的 Cap Analysis of Gene Expression 数据中证实了广泛存在的组织依赖性替代转录起始位点的使用。此外,我们的结果表明,大多数组织依赖性剪接涉及非翻译外显子,因此可能不会增加蛋白质组的复杂性。因此,替代转录起始和终止位点是跨组织转录本异构体多样性的主要驱动因素,可能是大多数细胞类型特异性蛋白质组和功能的基础。