Centre for Biotechnology, Anna University, Sardar Patel Road, Guindy, Chennai 600025, India.
Evol Bioinform Online. 2011;7:235-55. doi: 10.4137/EBO.S8162. Epub 2011 Nov 10.
The Plasmodium falciparum genome being AT-rich, the presence of GC-rich regions suggests functional significance. Evolution imposes selection pressure to retain functionally important coding and regulatory elements. Hence searching for evolutionarily conserved GC-rich, intergenic regions in an AT-rich genome will help in discovering new coding regions and regulatory elements. We have used elevated GC content in intergenic regions coupled with sequence conservation against P. reichenowi, which is evolutionarily closely related to P. falciparum to identify potential sequences of functional importance. Interestingly, ~30% of the GC-rich, conserved sequences were associated with antigenic proteins encoded by var and rifin genes. The majority of sequences identified in the 5' UTR of var genes are represented by short expressed sequence tags (ESTs) in cDNA libraries signifying that they are transcribed in the parasite. Additionally, 19 sequences were located in the 3' UTR of rifins and 4 also have overlapping ESTs. Further analysis showed that several sequences associated with var genes have the capacity to encode small peptides. A previous report has shown that upstream peptides can regulate the expression of var genes hence we propose that these conserved GC-rich sequences may play roles in regulation of gene expression.
疟原虫基因组富含 A 和 T,GC 含量丰富的区域表明其具有功能意义。进化施加选择压力以保留具有重要功能的编码和调控元件。因此,在富含 A 和 T 的基因组中搜索进化上保守的 GC 丰富的基因间区,将有助于发现新的编码区和调控元件。我们利用基因间区 GC 含量升高并与进化上与疟原虫密切相关的 P. reichenowi 序列保守性,来识别潜在具有功能重要性的序列。有趣的是,约 30%的富含 GC 的保守序列与 var 和 rifin 基因编码的抗原蛋白有关。在 var 基因 5'UTR 中鉴定到的大多数序列都在 cDNA 文库中代表短表达序列标签 (ESTs),这表明它们在寄生虫中转录。此外,19 个序列位于 rifin 的 3'UTR 中,还有 4 个序列也有重叠的 ESTs。进一步分析表明,与 var 基因相关的几个序列具有编码小肽的能力。先前的报告表明,上游肽可以调节 var 基因的表达,因此我们提出这些保守的 GC 丰富序列可能在基因表达调控中发挥作用。