Suppr超能文献

AlgoRun:一种用于与平台无关的已实现算法的基于Docker的打包系统。

AlgoRun: a Docker-based packaging system for platform-agnostic implemented algorithms.

作者信息

Hosny Abdelrahman, Vera-Licona Paola, Laubenbacher Reinhard, Favre Thibauld

机构信息

Center for Quantitative Medicine.

Center for Quantitative Medicine Department of Cell Biology Institute for Systems Genomics, UConn Health, CT, USA.

出版信息

Bioinformatics. 2016 Aug 1;32(15):2396-8. doi: 10.1093/bioinformatics/btw120. Epub 2016 Mar 2.

Abstract

MOTIVATION

There is a growing need in bioinformatics for easy-to-use software implementations of algorithms that are usable across platforms. At the same time, reproducibility of computational results is critical and often a challenge due to source code changes over time and dependencies.

RESULTS

The approach introduced in this paper addresses both of these needs with AlgoRun, a dedicated packaging system for implemented algorithms, using Docker technology. Implemented algorithms, packaged with AlgoRun, can be executed through a user-friendly interface directly from a web browser or via a standardized RESTful web API to allow easy integration into more complex workflows. The packaged algorithm includes the entire software execution environment, thereby eliminating the common problem of software dependencies and the irreproducibility of computations over time. AlgoRun-packaged algorithms can be published on http://algorun.org, a centralized searchable directory to find existing AlgoRun-packaged algorithms.

AVAILABILITY AND IMPLEMENTATION

AlgoRun is available at http://algorun.org and the source code under GPL license is available at https://github.com/algorun

CONTACT

laubenbacher@uchc.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

生物信息学领域对易于使用的、可跨平台使用的算法软件实现的需求日益增长。与此同时,计算结果的可重复性至关重要,但由于源代码随时间变化以及存在依赖性,这往往是一个挑战。

结果

本文介绍的方法通过AlgoRun(一种使用Docker技术的用于实现算法的专用打包系统)满足了这两个需求。使用AlgoRun打包的已实现算法可以通过用户友好的界面直接从网页浏览器执行,或者通过标准化的RESTful网络应用程序编程接口执行,以便轻松集成到更复杂的工作流程中。打包后的算法包括整个软件执行环境,从而消除了软件依赖性这一常见问题以及随着时间推移计算结果不可重现的问题。使用AlgoRun打包的算法可以发布在http://algorun.org上,这是一个可集中搜索的目录,用于查找现有的使用AlgoRun打包的算法。

可用性与实现

AlgoRun可在http://algorun.org上获取,遵循GPL许可的源代码可在https://github.com/algorun上获取。

联系方式

laubenbacher@uchc.edu

补充信息

补充数据可在《生物信息学》在线版获取。

相似文献

1
AlgoRun: a Docker-based packaging system for platform-agnostic implemented algorithms.
Bioinformatics. 2016 Aug 1;32(15):2396-8. doi: 10.1093/bioinformatics/btw120. Epub 2016 Mar 2.
2
ballaxy: web services for structural bioinformatics.
Bioinformatics. 2015 Jan 1;31(1):121-2. doi: 10.1093/bioinformatics/btu574. Epub 2014 Sep 2.
3
Designing and executing scientific workflows with a programmable integrator.
Bioinformatics. 2004 Sep 1;20(13):2092-100. doi: 10.1093/bioinformatics/bth209. Epub 2004 Apr 1.
5
MGEScan: a Galaxy-based system for identifying retrotransposons in genomes.
Bioinformatics. 2016 Aug 15;32(16):2502-4. doi: 10.1093/bioinformatics/btw157. Epub 2016 Apr 7.
6
Workflows in bioinformatics: meta-analysis and prototype implementation of a workflow generator.
BMC Bioinformatics. 2005 Apr 7;6:87. doi: 10.1186/1471-2105-6-87.
7
Automated programming for bioinformatics algorithm deployment.
Bioinformatics. 2008 Feb 1;24(3):450-1. doi: 10.1093/bioinformatics/btm602. Epub 2008 Jan 3.
8
Phylesystem: a git-based data store for community-curated phylogenetic estimates.
Bioinformatics. 2015 Sep 1;31(17):2794-800. doi: 10.1093/bioinformatics/btv276. Epub 2015 May 4.
9
Tibanna: software for scalable execution of portable pipelines on the cloud.
Bioinformatics. 2019 Nov 1;35(21):4424-4426. doi: 10.1093/bioinformatics/btz379.
10
An automated workflow for parallel processing of large multiview SPIM recordings.
Bioinformatics. 2016 Apr 1;32(7):1112-4. doi: 10.1093/bioinformatics/btv706. Epub 2015 Dec 1.

引用本文的文献

1
Successes and Struggles with Computational Reproducibility: Lessons from the Fragile Families Challenge.
Socius. 2019 Jan-Dec;5. doi: 10.1177/2378023119849803. Epub 2019 Sep 10.
2
Democratizing bioinformatics through easily accessible software platforms for non-experts in the field.
Biotechniques. 2022 Feb;72(2):36-38. doi: 10.2144/btn-2021-0060. Epub 2022 Jan 21.
3
From command-line bioinformatics to bioGUI.
PeerJ. 2019 Nov 21;7:e8111. doi: 10.7717/peerj.8111. eCollection 2019.
4
ParaDB: A manually curated database containing genomic annotation for the human pathogenic fungi Paracoccidioides spp.
PLoS Negl Trop Dis. 2019 Jul 15;13(7):e0007576. doi: 10.1371/journal.pntd.0007576. eCollection 2019 Jul.
6
Implementation of informatics for integrating biology and the bedside (i2b2) platform as Docker containers.
BMC Med Inform Decis Mak. 2018 Jul 16;18(1):66. doi: 10.1186/s12911-018-0646-2.
7
The Scientific Filesystem.
Gigascience. 2018 May 1;7(5). doi: 10.1093/gigascience/giy023.
9
Bio-Docklets: virtualization containers for single-step execution of NGS pipelines.
Gigascience. 2017 Aug 1;6(8):1-7. doi: 10.1093/gigascience/gix048.

本文引用的文献

1
An algebra-based method for inferring gene regulatory networks.
BMC Syst Biol. 2014 Mar 26;8:37. doi: 10.1186/1752-0509-8-37.
2
Reproducible research in computational science.
Science. 2011 Dec 2;334(6060):1226-7. doi: 10.1126/science.1213847.
3
In silico research in the era of cloud computing.
Nat Biotechnol. 2010 Nov;28(11):1181-5. doi: 10.1038/nbt1110-1181.
4
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.
5
Taverna: a tool for building and running workflows of services.
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W729-32. doi: 10.1093/nar/gkl320.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验