Giardine Belinda, Elnitski Laura, Riemer Cathy, Makalowska Izabela, Schwartz Scott, Miller Webb, Hardison Ross C
Department of Computer Science and Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA.
Genome Res. 2003 Apr;13(4):732-41. doi: 10.1101/gr.603103.
We have developed a relational database to contain whole genome sequence alignments between human and mouse with extensive annotations of the human sequence. Complex queries are supported on recorded features, both directly and on proximity among them. Searches can reveal a wide variety of relationships, such as finding all genes expressed in a designated tissue that have a highly conserved noncoding sequence 5' to the start site. Other examples are finding single nucleotide polymorphisms that occur in conserved noncoding regions upstream of genes and identifying CpG islands that overlap the 5' ends of divergently transcribed genes. The database is available online at http://globin.cse.psu.edu/ and http://bio.cse.psu.edu/.
我们开发了一个关系数据库,用于存储人类和小鼠之间的全基因组序列比对以及人类序列的广泛注释。支持对记录的特征进行复杂查询,包括直接查询以及基于它们之间的邻近关系进行查询。搜索可以揭示各种各样的关系,例如找到在指定组织中表达且在起始位点5'端具有高度保守非编码序列的所有基因。其他例子包括找到基因上游保守非编码区域中出现的单核苷酸多态性,以及识别与反向转录基因5'端重叠的CpG岛。该数据库可在http://globin.cse.psu.edu/和http://bio.cse.psu.edu/在线获取。