Suppr超能文献

宏基因组学的真相:量化和抵消16S rRNA研究中的偏差

The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies.

作者信息

Brooks J Paul, Edwards David J, Harwich Michael D, Rivera Maria C, Fettweis Jennifer M, Serrano Myrna G, Reris Robert A, Sheth Nihar U, Huang Bernice, Girerd Philippe, Strauss Jerome F, Jefferson Kimberly K, Buck Gregory A

机构信息

Department of Statistical Sciences and Operations Research, Virginia Commonwealth University, 23284-3083, Richmond, VA, USA.

Center for the Study of Biological Complexity, Virginia Commonwealth University, 23284, Richmond, VA, USA.

出版信息

BMC Microbiol. 2015 Mar 21;15:66. doi: 10.1186/s12866-015-0351-6.

Abstract

BACKGROUND

Characterizing microbial communities via next-generation sequencing is subject to a number of pitfalls involving sample processing. The observed community composition can be a severe distortion of the quantities of bacteria actually present in the microbiome, hampering analysis and threatening the validity of conclusions from metagenomic studies. We introduce an experimental protocol using mock communities for quantifying and characterizing bias introduced in the sample processing pipeline. We used 80 bacterial mock communities comprised of prescribed proportions of cells from seven vaginally-relevant bacterial strains to assess the bias introduced in the sample processing pipeline. We created two additional sets of 80 mock communities by mixing prescribed quantities of DNA and PCR product to quantify the relative contribution to bias of (1) DNA extraction, (2) PCR amplification, and (3) sequencing and taxonomic classification for particular choices of protocols for each step. We developed models to predict the "true" composition of environmental samples based on the observed proportions, and applied them to a set of clinical vaginal samples from a single subject during four visits.

RESULTS

We observed that using different DNA extraction kits can produce dramatically different results but bias is introduced regardless of the choice of kit. We observed error rates from bias of over 85% in some samples, while technical variation was very low at less than 5% for most bacteria. The effects of DNA extraction and PCR amplification for our protocols were much larger than those due to sequencing and classification. The processing steps affected different bacteria in different ways, resulting in amplified and suppressed observed proportions of a community. When predictive models were applied to clinical samples from a subject, the predicted microbiome profiles were better reflections of the physiology and diagnosis of the subject at the visits than the observed community compositions.

CONCLUSIONS

Bias in 16S studies due to DNA extraction and PCR amplification will continue to require attention despite further advances in sequencing technology. Analysis of mock communities can help assess bias and facilitate the interpretation of results from environmental samples.

摘要

背景

通过新一代测序对微生物群落进行特征分析会受到许多与样本处理有关的陷阱影响。观察到的群落组成可能会严重扭曲微生物组中实际存在的细菌数量,从而妨碍分析并威胁宏基因组研究结论的有效性。我们引入了一种使用模拟群落的实验方案,用于量化和表征样本处理流程中引入的偏差。我们使用了80个细菌模拟群落,这些群落由来自七种与阴道相关的细菌菌株的规定比例的细胞组成,以评估样本处理流程中引入的偏差。我们通过混合规定量的DNA和PCR产物创建了另外两组各80个模拟群落,以量化(1)DNA提取、(2)PCR扩增以及(3)针对每个步骤的特定方案选择的测序和分类对偏差的相对贡献。我们开发了模型,根据观察到的比例预测环境样本的“真实”组成,并将其应用于一名受试者在四次就诊期间采集的一组临床阴道样本。

结果

我们观察到使用不同的DNA提取试剂盒会产生截然不同的结果,但无论选择哪种试剂盒都会引入偏差。我们在一些样本中观察到偏差导致的错误率超过85%,而大多数细菌的技术变异非常低,不到5%。我们的方案中DNA提取和PCR扩增的影响远大于测序和分类的影响。处理步骤以不同方式影响不同细菌,导致群落观察比例的放大和抑制。当将预测模型应用于受试者的临床样本时,预测的微生物组谱比观察到的群落组成更能反映受试者在就诊时的生理状况和诊断结果。

结论

尽管测序技术有了进一步发展,但由于DNA提取和PCR扩增导致的16S研究中的偏差仍需持续关注。对模拟群落的分析有助于评估偏差并促进对环境样本结果的解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f68a/4433096/765364841b9c/12866_2015_351_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验