Suppr超能文献

lefser:宏基因组生物标志物发现工具LEfSe在R语言中的实现

lefser: implementation of metagenomic biomarker discovery tool, LEfSe, in R.

作者信息

Khleborodova Asya, Gamboa-Tuz Samuel D, Ramos Marcel, Segata Nicola, Waldron Levi, Oh Sehyun

机构信息

Institute for Implementation Science in Population Health, City University of New York School of Public Health, New York, NY 10027, United States.

Department of Epidemiology and Biostatistics, City University of New York School of Public Health, New York, NY 10027, United States.

出版信息

Bioinformatics. 2024 Nov 28;40(12). doi: 10.1093/bioinformatics/btae707.

Abstract

SUMMARY

LEfSe is a widely used Python package and Galaxy module for metagenomic biomarker discovery and visualization, utilizing the Kruskal-Wallis test, Wilcoxon Rank-Sum test, and Linear Discriminant Analysis. R/Bioconductor provides a large collection of tools for metagenomic data analysis but has lacked an implementation of this widely used algorithm, hindering benchmarking against other tools and incorporation into R workflows. We present the lefser package to provide comparable functionality within the R/Bioconductor ecosystem of statistical analysis tools, with improvements to the original algorithm for performance, accuracy, and reproducibility. We benchmark the performance of lefser against the original algorithm using human and mouse metagenomic datasets.

AVAILABILITY AND IMPLEMENTATION

Our software, lefser, is distributed through the Bioconductor project (https://www.bioconductor.org/packages/release/bioc/html/lefser.html), and all the source code is available in the GitHub repository https://github.com/waldronlab/lefser.

摘要

摘要

LEfSe是一个广泛使用的Python包和Galaxy模块,用于宏基因组生物标志物的发现和可视化,它利用了Kruskal-Wallis检验、Wilcoxon秩和检验以及线性判别分析。R/Bioconductor提供了大量用于宏基因组数据分析的工具,但缺少这种广泛使用算法的实现,这阻碍了与其他工具的基准测试以及将其纳入R工作流程。我们提出了lefser包,以便在R/Bioconductor统计分析工具生态系统中提供可比的功能,并对原始算法在性能、准确性和可重复性方面进行了改进。我们使用人类和小鼠宏基因组数据集,将lefser的性能与原始算法进行了基准测试。

可用性和实现

我们的软件lefser通过Bioconductor项目(https://www.bioconductor.org/packages/release/bioc/html/lefser.html)进行分发,所有源代码可在GitHub存储库https://github.com/waldronlab/lefser中获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d60/11665633/76f034ecb800/btae707f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验