Suppr超能文献

Hybracter:实现可扩展、自动化、完整和准确的细菌基因组组装。

Hybracter: enabling scalable, automated, complete and accurate bacterial genome assemblies.

机构信息

Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, Australia.

The Department of Surgery - Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, South Australia, Australia.

出版信息

Microb Genom. 2024 May;10(5). doi: 10.1099/mgen.0.001244.

Abstract

Improvements in the accuracy and availability of long-read sequencing mean that complete bacterial genomes are now routinely reconstructed using hybrid (i.e. short- and long-reads) assembly approaches. Complete genomes allow a deeper understanding of bacterial evolution and genomic variation beyond single nucleotide variants. They are also crucial for identifying plasmids, which often carry medically significant antimicrobial resistance genes. However, small plasmids are often missed or misassembled by long-read assembly algorithms. Here, we present Hybracter which allows for the fast, automatic and scalable recovery of near-perfect complete bacterial genomes using a long-read first assembly approach. Hybracter can be run either as a hybrid assembler or as a long-read only assembler. We compared Hybracter to existing automated hybrid and long-read only assembly tools using a diverse panel of samples of varying levels of long-read accuracy with manually curated ground truth reference genomes. We demonstrate that Hybracter as a hybrid assembler is more accurate and faster than the existing gold standard automated hybrid assembler Unicycler. We also show that Hybracter with long-reads only is the most accurate long-read only assembler and is comparable to hybrid methods in accurately recovering small plasmids.

摘要

长读测序的准确性和可用性的提高意味着现在通常使用混合(即短读和长读)组装方法来重建完整的细菌基因组。完整的基因组允许更深入地了解细菌进化和基因组变异,超越单个核苷酸变体。它们对于识别质粒也至关重要,质粒通常携带具有重要医学意义的抗生素耐药基因。然而,长读测序组装算法经常会错过或错误组装小质粒。在这里,我们提出了 Hybracter,它允许使用长读首先组装方法快速、自动和可扩展地恢复近乎完美的完整细菌基因组。Hybracter 可以作为混合组装器或仅长读组装器运行。我们使用不同的样本面板,包括具有不同长读准确性水平的样本,并使用经过手动精心整理的地面真实参考基因组,将 Hybracter 与现有的自动化混合和仅长读组装工具进行了比较。我们证明,作为混合组装器的 Hybracter 比现有的自动化混合组装标准 Unicycler 更准确、更快。我们还表明,仅使用长读的 Hybracter 是最准确的仅长读组装器,并且在准确恢复小质粒方面与混合方法相当。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c549/11165638/b84791cc1dd2/mgen-10-01244-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验