Woldesemayat Adugna Abdi, Van Heusden Peter, Ndimba Bongani K, Christoffels Alan
South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Private Bag X17, Belleville, 7535, South Africa.
Department of Life and Consumer Sciences, College of Agriculture and Environmental Sciences, University of South Africa, UNISA Science Campus, Corner of Christiaan De Wet Road and Pioneer Avenue, Johannesburg, Florida, 1710, South Africa.
BMC Genet. 2017 Dec 22;18(1):119. doi: 10.1186/s12863-017-0584-5.
Drought is the most disastrous abiotic stress that severely affects agricultural productivity worldwide. Understanding the biological basis of drought-regulated traits, requires identification and an in-depth characterization of genetic determinants using model organisms and high-throughput technologies. However, studies on drought tolerance have generally been limited to traditional candidate gene approach that targets only a single gene in a pathway that is related to a trait. In this study, we used sorghum, one of the model crops that is well adapted to arid regions, to mine genes and define determinants for drought tolerance using drought expression libraries and RNA-seq data.
We provide an integrated and comparative in silico candidate gene identification, characterization and annotation approach, with an emphasis on genes playing a prominent role in conferring drought tolerance in sorghum. A total of 470 non-redundant functionally annotated drought responsive genes (DRGs) were identified using experimental data from drought responses by employing pairwise sequence similarity searches, pathway and interpro-domain analysis, expression profiling and orthology relation. Comparison of the genomic locations between these genes and sorghum quantitative trait loci (QTLs) showed that 40% of these genes were co-localized with QTLs known for drought tolerance. The genome reannotation conducted using the Program to Assemble Spliced Alignment (PASA), resulted in 9.6% of existing single gene models being updated. In addition, 210 putative novel genes were identified using AUGUSTUS and PASA based analysis on expression dataset. Among these, 50% were single exonic, 69.5% represented drought responsive and 5.7% were complete gene structure models. Analysis of biochemical metabolism revealed 14 metabolic pathways that are related to drought tolerance and also had a strong biological network, among categories of genes involved. Identification of these pathways, signifies the interplay of biochemical reactions that make up the metabolic network, constituting fundamental interface for sorghum defence mechanism against drought stress.
This study suggests untapped natural variability in sorghum that could be used for developing drought tolerance. The data presented here, may be regarded as an initial reference point in functional and comparative genomics in the Gramineae family.
干旱是最具灾难性的非生物胁迫,严重影响全球农业生产力。要了解干旱调控性状的生物学基础,需要利用模式生物和高通量技术鉴定并深入表征遗传决定因素。然而,耐旱性研究通常局限于传统的候选基因方法,该方法仅针对与某一性状相关途径中的单个基因。在本研究中,我们利用高粱这一适应干旱地区的模式作物之一,通过干旱表达文库和RNA测序数据挖掘耐旱基因并确定其决定因素。
我们提供了一种综合且具比较性的计算机模拟候选基因鉴定、表征及注释方法,重点关注在高粱耐旱性中起重要作用的基因。通过成对序列相似性搜索、途径和结构域间分析、表达谱分析及直系同源关系,利用干旱响应的实验数据共鉴定出470个功能注释的非冗余干旱响应基因(DRG)。这些基因与高粱数量性状位点(QTL)的基因组位置比较表明,其中40%的基因与已知的耐旱QTL共定位。使用拼接比对程序(PASA)进行的基因组重新注释导致9.6%的现有单基因模型得到更新。此外,基于表达数据集利用AUGUSTUS和PASA分析鉴定出210个推定的新基因。其中,50%为单外显子基因,69.5%表现出干旱响应,5.7%为完整基因结构模型。生化代谢分析揭示了14条与耐旱性相关且具有强大生物网络的代谢途径,涉及各类基因。这些途径的鉴定表明构成代谢网络的生化反应之间的相互作用,构成了高粱抗旱胁迫防御机制的基本界面。
本研究表明高粱中存在尚未开发的自然变异,可用于培育耐旱性。本文提供的数据可被视为禾本科功能和比较基因组学的初始参考点。