Suppr超能文献

酿酒酵母基因组的回文序列内容。

Palindrome content of the yeast Saccharomyces cerevisiae genome.

作者信息

Lisnić Berislav, Svetec Ivan-Kresimir, Sarić Hrvoje, Nikolić Ivan, Zgaga Zoran

机构信息

Faculty of Food Technology and Biotechnology, University of Zagreb, Pierottijeva 6, 10000, Zagreb, Croatia.

出版信息

Curr Genet. 2005 May;47(5):289-97. doi: 10.1007/s00294-005-0573-5. Epub 2005 Mar 18.

Abstract

Palindromic sequences are important DNA motifs involved in the regulation of different cellular processes, but are also a potential source of genetic instability. In order to initiate a systematic study of palindromes at the whole genome level, we developed a computer program that can identify, locate and count palindromes in a given sequence in a strictly defined way. All palindromes, defined as identical inverted repeats without spacer DNA, can be analyzed and sorted according to their size, frequency, GC content or alphabetically. This program was then used to prepare a catalog of all palindromes present in the chromosomal DNA of the yeast Saccharomyces cerevisiae. For each palindrome size, the observed palindrome counts were significantly different from those in the randomly generated equivalents of the yeast genome. However, while the short palindromes (2-12 bp) were under-represented, the palindromes longer than 12 bp were over-represented, AT-rich and preferentially located in the intergenic regions. The 44-bp palindrome found between the genes CDC53 and LYS21 on chromosome IV was the longest palindrome identified and contained only two C-G base pairs. Avoidance of coding regions was also observed for palindromes of 4-12 bp, but was less pronounced. Dinucleotide analysis indicated a strong bias against palindromic dinucleotides that could explain the observed short palindrome avoidance. We discuss some possible mechanisms that may influence the evolutionary dynamics of palindromic sequences in the yeast genome.

摘要

回文序列是参与不同细胞过程调控的重要DNA基序,但也是遗传不稳定的潜在来源。为了在全基因组水平上启动对回文序列的系统研究,我们开发了一个计算机程序,该程序可以以严格定义的方式识别、定位和计数给定序列中的回文序列。所有定义为没有间隔DNA的相同反向重复序列的回文序列,都可以根据其大小、频率、GC含量或按字母顺序进行分析和分类。然后使用该程序编制了酿酒酵母染色体DNA中存在的所有回文序列的目录。对于每个回文序列大小,观察到的回文序列计数与酵母基因组随机生成的等效序列中的计数显著不同。然而,虽然短回文序列(2-12个碱基对)的数量不足,但长度超过12个碱基对的回文序列数量过多,富含AT且优先位于基因间区域。在第四条染色体上的CDC53和LYS21基因之间发现的44个碱基对的回文序列是鉴定出的最长回文序列,仅包含两个C-G碱基对。对于4-12个碱基对的回文序列也观察到了对编码区域的回避,但不太明显。二核苷酸分析表明对回文二核苷酸有强烈的偏好,这可以解释观察到的对短回文序列的回避。我们讨论了一些可能影响酵母基因组中回文序列进化动态的机制。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验