Suppr超能文献

一种用于分析克隆性暴发的仅基于纳米孔的开源测序工作流程可实现短读长水平的准确性。

An open-source nanopore-only sequencing workflow for analysis of clonal outbreaks delivers short-read level accuracy.

作者信息

Vereecke Nick, Yoon Thomas B, Luo Ting L, Corey Brendan W, Lebreton Francois, Mc Gann Patrick T, Dekker John P

机构信息

Bacterial Pathogenesis and Antimicrobial Resistance Section (BPARS), Laboratory of Clinical Immunology & Microbiology (LCIM), National Institute for Allergy and Infectious Disease (NIAID), National Institutes of Health (NIH), Bethesda, Maryland, USA.

Multidrug-Resistant Organism Repository and Surveillance Network (MRSN), Diagnostics and Countermeasures Branch, Center for Infectious Disease Research (CIDR), Walter Reed Army Institute of Research (WRAIR), Silver Spring, Maryland, USA.

出版信息

J Clin Microbiol. 2025 Jul 18:e0066425. doi: 10.1128/jcm.00664-25.

Abstract

UNLABELLED

In this work, we present an optimized nanopore long-read only sequencing workflow for epidemiologic analysis of clonal outbreaks built with open-source tools. A set of unrelated clinical isolates ( = 10) was chosen for workflow optimization, and sequencing libraries were prepared using a modified rapid barcoding strategy that incorporates temperature ramps to improve performance for high-GC content genomes. Sequencing data were used to benchmark the performance of the dorado suite (v0.9.1), including its basecaller, pre-assembly read error correction, and post-assembly polishing algorithms. All long-read assemblies and core genome multilocus sequence typing (cgMLST) were performed with Flye and pyMLST, respectively. Results were compared with a standard reference Illumina short-read approach, and discordant positions were determined at the core and whole-genome levels. Optimal performance was found with dorado sup@v5.0.0 basecalling with the inclusion of dorado error correction and dorado polish with its bacterial model. This workflow was then validated with four retrospective hospital outbreak isolate sets, including ( = 12), ( = 11), ( = 10), and ( = 10). The nanopore-only assemblies obtained from the optimized pipeline demonstrated fully concordant cgMLST-based minimum spanning trees compared to the Illumina short-read reference. At the whole-genome level, high concordance was also observed, with as few as two discordant positions per genome compared to short-read assemblies. This optimized library preparation and open-source computational workflow enables nanopore-only clonality and outbreak analysis with performance comparable to that of Illumina short-read sequencing and will contribute critically to hospital infection control.

IMPORTANCE

For the past decade, bacterial whole-genome sequencing has been performed using high-accuracy short-read sequencing. More recently, long-read sequencing with Oxford Nanopore Technologies (ONT) instruments has emerged as a potential alternative based on multiple advantages, including lower costs, portability, and speed. However, this platform has suffered from basecall error rates that were too high for many applications in clinical microbiology, including outbreak tracing. With the release of new flow cell chemistries and basecall algorithms, the accuracy has improved dramatically, making this approach feasible for outbreak investigations. In this work, we optimize a streamlined nanopore-only workflow for epidemiologic analysis of bacterial pathogens. The workflow was validated with isolates from four previously identified clinical outbreaks with varying GC content and demonstrated fully concordant cgMLST clustering as compared to short-read references. This workflow will facilitate the broader implementation of ONT-only genomes and cgMLST analysis to assist in hospital outbreaks worldwide.

摘要

未标注

在本研究中,我们展示了一种优化的仅使用纳米孔长读长测序的工作流程,用于使用开源工具对克隆性暴发进行流行病学分析。选择了一组不相关的临床分离株(n = 10)进行工作流程优化,并使用改良的快速条形码策略制备测序文库,该策略引入温度梯度以提高对高GC含量基因组的性能。测序数据用于评估多拉多软件包(v0.9.1)的性能,包括其碱基识别器、组装前读段纠错和组装后优化算法。所有长读长组装和核心基因组多位点序列分型(cgMLST)分别使用Flye和pyMLST进行。将结果与标准的参考Illumina短读长方法进行比较,并在核心和全基因组水平确定不一致的位置。发现使用多拉多sup@v5.0.0碱基识别并结合多拉多纠错和细菌模型的多拉多优化时性能最佳。然后使用四个回顾性医院暴发分离株集对该工作流程进行验证,包括金黄色葡萄球菌(n = 12)、肺炎克雷伯菌(n = 11)、鲍曼不动杆菌(n = 10)和铜绿假单胞菌(n = 10)。从优化流程获得的仅纳米孔组装与Illumina短读长参考相比,基于cgMLST的最小生成树完全一致。在全基因组水平,也观察到高度一致性,与短读长组装相比,每个基因组仅有两个不一致的位置。这种优化的文库制备和开源计算工作流程能够进行仅纳米孔的克隆性和暴发分析,其性能与Illumina短读长测序相当,将对医院感染控制做出重要贡献。

重要性

在过去十年中,细菌全基因组测序一直使用高精度短读长测序进行。最近,基于牛津纳米孔技术(ONT)仪器的长读长测序作为一种潜在的替代方法出现,具有多种优势,包括成本更低、便携性和速度更快。然而,该平台的碱基识别错误率对于临床微生物学中的许多应用(包括暴发追踪)来说过高。随着新的流动槽化学和碱基识别算法的发布,准确性有了显著提高,使得这种方法对于暴发调查可行。在本研究中,我们优化了一种简化的仅纳米孔工作流程用于细菌病原体的流行病学分析。该工作流程用来自四个先前确定的具有不同GC含量的临床暴发的分离株进行了验证,并且与短读长参考相比,显示出完全一致的cgMLST聚类。这种工作流程将促进仅ONT基因组和cgMLST分析的更广泛应用,以协助全球范围内的医院暴发调查。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验