Structural Bioinformatics Lab, Universitat Pompeu Fabra-IMIM, Barcelona Research Park of Biomedicine, Barcelona, Catalonia, Spain.
BMC Bioinformatics. 2010 Jan 27;11:56. doi: 10.1186/1471-2105-11-56.
The analysis and usage of biological data is hindered by the spread of information across multiple repositories and the difficulties posed by different nomenclature systems and storage formats. In particular, there is an important need for data unification in the study and use of protein-protein interactions. Without good integration strategies, it is difficult to analyze the whole set of available data and its properties.
We introduce BIANA (Biologic Interactions and Network Analysis), a tool for biological information integration and network management. BIANA is a Python framework designed to achieve two major goals: i) the integration of multiple sources of biological information, including biological entities and their relationships, and ii) the management of biological information as a network where entities are nodes and relationships are edges. Moreover, BIANA uses properties of proteins and genes to infer latent biomolecular relationships by transferring edges to entities sharing similar properties. BIANA is also provided as a plugin for Cytoscape, which allows users to visualize and interactively manage the data. A web interface to BIANA providing basic functionalities is also available. The software can be downloaded under GNU GPL license from http://sbi.imim.es/web/BIANA.php.
BIANA's approach to data unification solves many of the nomenclature issues common to systems dealing with biological data. BIANA can easily be extended to handle new specific data repositories and new specific data types. The unification protocol allows BIANA to be a flexible tool suitable for different user requirements: non-expert users can use a suggested unification protocol while expert users can define their own specific unification rules.
生物数据的分析和使用受到多个存储库中信息传播的阻碍,以及不同命名系统和存储格式带来的困难。特别是在研究和使用蛋白质-蛋白质相互作用时,数据统一非常重要。如果没有良好的集成策略,就很难分析整套可用数据及其属性。
我们引入了 BIANA(生物相互作用和网络分析),这是一种用于生物信息集成和网络管理的工具。BIANA 是一个 Python 框架,旨在实现两个主要目标:i)整合多个生物信息源,包括生物实体及其关系;ii)将生物信息管理为一个网络,其中实体是节点,关系是边。此外,BIANA 使用蛋白质和基因的属性通过将边转移到具有相似属性的实体上来推断潜在的生物分子关系。BIANA 还作为 Cytoscape 的插件提供,允许用户可视化和交互管理数据。还提供了一个用于 BIANA 的基本功能的 Web 界面。该软件可在 http://sbi.imim.es/web/BIANA.php 下根据 GNU GPL 许可证下载。
BIANA 对数据统一的方法解决了处理生物数据的系统中常见的许多命名问题。BIANA 可以轻松扩展以处理新的特定数据存储库和新的特定数据类型。统一协议允许 BIANA 成为适合不同用户需求的灵活工具:非专家用户可以使用建议的统一协议,而专家用户可以定义自己的特定统一规则。