Suppr超能文献

基于 PhenoGraph 和二进制编码基因组数据的数据驱动 SARS-CoV-2 亚群鉴定。

Data-driven identification of SARS-CoV-2 subpopulations using PhenoGraph and binary-coded genomic data.

机构信息

Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou 510700, China.

Guangzhou Nanxin Pharmaceutical Co., Ltd., Guangzhou 510700, China.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab307.

Abstract

For epidemic prevention and control, the identification of SARS-CoV-2 subpopulations sharing similar micro-epidemiological patterns and evolutionary histories is necessary for a more targeted investigation into the links among COVID-19 outbreaks caused by SARS-CoV-2 with similar genetic backgrounds. Genomic sequencing analysis has demonstrated the ability to uncover viral genetic diversity. However, an objective analysis is necessary for the identification of SARS-CoV-2 subpopulations. Herein, we detected all the mutations in 186 682 SARS-CoV-2 isolates. We found that the GC content of the SARS-CoV-2 genome had evolved to be lower, which may be conducive to viral spread, and the frameshift mutation was rare in the global population. Next, we encoded the genomic mutations in binary form and used an unsupervised learning classifier, namely PhenoGraph, to classify this information. Consequently, PhenoGraph successfully identified 303 SARS-CoV-2 subpopulations, and we found that the PhenoGraph classification was consistent with, but more detailed and precise than the known GISAID clades (S, L, V, G, GH, GR, GV and O). By the change trend analysis, we found that the growth rate of SARS-CoV-2 diversity has slowed down significantly. We also analyzed the temporal, spatial and phylogenetic relationships among the subpopulations and revealed the evolutionary trajectory of SARS-CoV-2 to a certain extent. Hence, our results provide a better understanding of the patterns and trends in the genomic evolution and epidemiology of SARS-CoV-2.

摘要

为了进行疫情防控,有必要识别具有相似微观流行病学模式和进化历史的 SARS-CoV-2 亚群,以便更有针对性地调查具有相似遗传背景的 SARS-CoV-2 引起的 COVID-19 爆发之间的联系。基因组测序分析已经证明了揭示病毒遗传多样性的能力。然而,需要进行客观分析才能识别 SARS-CoV-2 亚群。在此,我们检测了 186682 个 SARS-CoV-2 分离株中的所有突变。我们发现,SARS-CoV-2 基因组的 GC 含量已经进化得更低,这可能有利于病毒传播,而全球人群中的移码突变很少见。接下来,我们将基因组突变编码为二进制形式,并使用无监督学习分类器 PhenoGraph 对该信息进行分类。结果,PhenoGraph 成功地识别出了 303 个 SARS-CoV-2 亚群,我们发现 PhenoGraph 分类与已知的 GISAID 进化枝(S、L、V、G、GH、GR、GV 和 O)一致,但更详细和精确。通过变化趋势分析,我们发现 SARS-CoV-2 多样性的增长率显著放缓。我们还分析了亚群之间的时间、空间和系统发育关系,并在一定程度上揭示了 SARS-CoV-2 的进化轨迹。因此,我们的结果提供了对 SARS-CoV-2 基因组进化和流行病学模式和趋势的更好理解。

相似文献

2
Co-mutation modules capture the evolution and transmission patterns of SARS-CoV-2.
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab222.
3
Phylogenetic classification of the whole-genome sequences of SARS-CoV-2 from India & evolutionary trends.
Indian J Med Res. 2021;153(1 & 2):166-174. doi: 10.4103/ijmr.IJMR_3418_20.
6
Comparative Genomics Reveals Early Emergence and Biased Spatiotemporal Distribution of SARS-CoV-2.
Mol Biol Evol. 2021 May 19;38(6):2547-2565. doi: 10.1093/molbev/msab049.
7
Higher entropy observed in SARS-CoV-2 genomes from the first COVID-19 wave in Pakistan.
PLoS One. 2021 Aug 31;16(8):e0256451. doi: 10.1371/journal.pone.0256451. eCollection 2021.
10
Mutation profile of SARS-CoV-2 genome in a sample from the first year of the pandemic in Colombia.
Infect Genet Evol. 2022 Jan;97:105192. doi: 10.1016/j.meegid.2021.105192. Epub 2021 Dec 18.

引用本文的文献

1
Automated cytometric gating with human-level performance using bivariate segmentation.
Nat Commun. 2025 Feb 12;16(1):1576. doi: 10.1038/s41467-025-56622-2.
2
Automated Cytometric Gating with Human-Level Performance Using Bivariate Segmentation.
bioRxiv. 2024 May 9:2024.05.06.592739. doi: 10.1101/2024.05.06.592739.
3
Nanopore sequencing technology and its applications.
MedComm (2020). 2023 Jul 10;4(4):e316. doi: 10.1002/mco2.316. eCollection 2023 Aug.
4
Towards Efficient and Accurate SARS-CoV-2 Genome Sequence Typing Based on Supervised Learning Approaches.
Microorganisms. 2022 Sep 4;10(9):1785. doi: 10.3390/microorganisms10091785.
5
Genomic diversity of SARS-CoV-2 in Oxford during United Kingdom's first national lockdown.
Sci Rep. 2021 Nov 2;11(1):21484. doi: 10.1038/s41598-021-01022-x.

本文引用的文献

1
Recombinant SARS-CoV-2 genomes circulated at low levels over the first year of the pandemic.
Virus Evol. 2021 Jul 15;7(2):veab059. doi: 10.1093/ve/veab059. eCollection 2021 Sep.
2
On the origin and continuing evolution of SARS-CoV-2.
Natl Sci Rev. 2020 Jun;7(6):1012-1023. doi: 10.1093/nsr/nwaa036. Epub 2020 Mar 3.
3
Quasispecies of SARS-CoV-2 revealed by single nucleotide polymorphisms (SNPs) analysis.
Virulence. 2021 Dec;12(1):1209-1226. doi: 10.1080/21505594.2021.1911477.
4
Rapid detection of inter-clade recombination in SARS-CoV-2 with Bolotie.
Genetics. 2021 Jul 14;218(3). doi: 10.1093/genetics/iyab074.
5
Phylogenetic classification of the whole-genome sequences of SARS-CoV-2 from India & evolutionary trends.
Indian J Med Res. 2021;153(1 & 2):166-174. doi: 10.4103/ijmr.IJMR_3418_20.
6
SARS-CoV-2 variants B.1.351 and P.1 escape from neutralizing antibodies.
Cell. 2021 Apr 29;184(9):2384-2393.e12. doi: 10.1016/j.cell.2021.03.036. Epub 2021 Mar 20.
7
Antibody resistance of SARS-CoV-2 variants B.1.351 and B.1.1.7.
Nature. 2021 May;593(7857):130-135. doi: 10.1038/s41586-021-03398-2. Epub 2021 Mar 8.
8
Resistance of SARS-CoV-2 variants to neutralization by monoclonal and serum-derived polyclonal antibodies.
Nat Med. 2021 Apr;27(4):717-726. doi: 10.1038/s41591-021-01294-w. Epub 2021 Mar 4.
9
Population Bottlenecks and Intra-host Evolution During Human-to-Human Transmission of SARS-CoV-2.
Front Med (Lausanne). 2021 Feb 15;8:585358. doi: 10.3389/fmed.2021.585358. eCollection 2021.
10
Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England.
Science. 2021 Apr 9;372(6538). doi: 10.1126/science.abg3055. Epub 2021 Mar 3.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验