Suppr超能文献

PubMatrix:一种用于多重文献挖掘的工具。

PubMatrix: a tool for multiplex literature mining.

作者信息

Becker Kevin G, Hosack Douglas A, Dennis Glynn, Lempicki Richard A, Bright Tiffani J, Cheadle Chris, Engel Jim

机构信息

Gene Expression and Genomics Unit, National Institutes of Health, Baltimore, MD, USA.

出版信息

BMC Bioinformatics. 2003 Dec 10;4:61. doi: 10.1186/1471-2105-4-61.

Abstract

BACKGROUND

Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a web-based tool that allows simple text based mining of the NCBI literature search service PubMed using any two lists of keywords terms, resulting in a frequency matrix of term co-occurrence.

RESULTS

For example, a simple term selection procedure allows automatic pair-wise comparisons of approximately 1-100 search terms versus approximately 1-10 modifier terms, resulting in up to 1,000 pair wise comparisons. The matrix table of pair-wise comparisons can then be surveyed, queried individually, and archived. Lists of keywords can include any terms currently capable of being searched in PubMed. In the context of cDNA microarray studies, this may be used for the annotation of gene lists from clusters of genes that are expressed coordinately. An associated PubMatrix public archive provides previous searches using common useful lists of keyword terms.

CONCLUSIONS

In this way, lists of terms, such as gene names, or functional assignments can be assigned genetic, biological, or clinical relevance in a rapid flexible systematic fashion. http://pubmatrix.grc.nia.nih.gov/

摘要

背景

使用诸如cDNA微阵列或蛋白质组学方法等多重策略的分子实验会生成需要生物学解释的大型数据集。基于文本的数据挖掘工具最近已被开发出来,用于查询这类数据的大型生物学数据集。PubMatrix是一个基于网络的工具,它允许使用任意两组关键词对NCBI文献检索服务PubMed进行简单的基于文本的挖掘,从而生成一个词共现频率矩阵。

结果

例如,一个简单的术语选择程序允许对大约1 - 100个搜索词与大约1 - 10个修饰词进行自动成对比较,从而产生多达1000个成对比较。然后可以查看、单独查询和存档成对比较的矩阵表。关键词列表可以包括目前能够在PubMed中搜索的任何术语。在cDNA微阵列研究的背景下,这可用于注释来自协同表达基因簇的基因列表。一个相关的PubMatrix公共存档提供了使用常见有用关键词列表的先前搜索。

结论

通过这种方式,可以以快速、灵活、系统的方式为诸如基因名称或功能分配等术语列表赋予遗传、生物学或临床相关性。http://pubmatrix.grc.nia.nih.gov/

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/964c/317283/5dea6c278775/1471-2105-4-61-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验