Department of Medical Laboratory Science and Biotechnology, China Medical University, Taichung 40402, Taiwan.
Department of Biotechnology, Asia University, Taichung 41354, Taiwan.
Infect Genet Evol. 2021 Jul;91:104800. doi: 10.1016/j.meegid.2021.104800. Epub 2021 Mar 5.
The pandemic spread of Coronavirus Disease 2019 (COVID-19) is still ongoing since severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is identified as the etiologic pathogen late December 2019. After over six-month spread of COVID-19, SARS-CoV-2 causes critical threats to global public health and economy. The investigations on evolution and genotyping on genetic variations are of great importance, therefore, the present study characterized the molecular variation of SARS-CoV-2 by analyzing 4230 complete genome sequences from the worldwide samples collected during the first 6-month pandemic. Phylogenetic tree analysis with Neighbor-Joining and Maximum-Parsimony methods indicated that the haplotypes of SARS-CoV-2 genome sequences were classified into four clades with the unique nucleotide and amino acid changes: T27879C (ORF8 L84S) in clade 1 (25.34%), A23138G (spike D614G) in clade 2 (63.54%), G10818T (nsp6 L37F), C14540T (nsp12 T442I), and G25879T (ORF3a V251F) in clade 3 (2.58%), and miscellaneous changes in clade 4 (8.54%). Interestingly, subclade 2B with the amino acid changes at nsp2 T85I, Spike D614G, and ORF3a Q57H was firstly reported on March 4, 2020 in United States of America, becoming the most frequent sub-haplogroup in the world (36.21%) and America (45.81%). Subclade 1C with the amino acid changes at nsp13 P504L and ORF8 L84S was becoming the second most frequent sub-haplogroup in the world (19.91%) and America (26.29%). Subclade 2A with the amino acid changes in Spike D614G and Nucleocapsid R203K and G204R was highly prevalent in Asia (18.82%) and Europe (29.72%). The study highlights the notable clades and sub-clades with unique mutations, revealing the genetic and geographical relevant post the six-month outbreak of COVID-19. This study thoroughly observed the genetic feature of SARS-CoV-2 haplotyping, providing an epidemiological trend of COVID-19.
自 2019 年底确定严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)为病因病原体以来,2019 年冠状病毒病(COVID-19)的大流行仍在继续。COVID-19 传播超过六个月后,SARS-CoV-2 对全球公共卫生和经济造成严重威胁。因此,对遗传变异的进化和基因分型的研究非常重要,本研究通过分析从全球范围内在大流行的头 6 个月期间收集的 4230 个完整基因组序列,对 SARS-CoV-2 的分子变异进行了特征描述。通过邻接法和最大简约法构建的系统进化树分析表明,SARS-CoV-2 基因组序列的单倍型分为四个分支,具有独特的核苷酸和氨基酸变化:第 1 分支的 T27879C(ORF8 L84S)(25.34%),第 2 分支的 A23138G(刺突 D614G)(63.54%),第 3 分支的 G10818T(nsp6 L37F)、C14540T(nsp12 T442I)和 G25879T(ORF3a V251F)(2.58%),第 4 分支的其他变化(8.54%)。有趣的是,第 2B 亚分支在美国于 2020 年 3 月 4 日首次报道了 nsp2 T85I、Spike D614G 和 ORF3a Q57H 的氨基酸变化,成为世界上(36.21%)和美国(45.81%)最常见的亚单倍型。第 1C 亚分支的 nsp13 P504L 和 ORF8 L84S 的氨基酸变化成为世界上(19.91%)和美国(26.29%)第二常见的亚单倍型。第 2A 亚分支的 Spike D614G、核衣壳蛋白 R203K 和 G204R 的氨基酸变化在亚洲(18.82%)和欧洲(29.72%)非常流行。本研究突出了具有独特突变的显著分支和亚分支,揭示了 COVID-19 爆发六个月后的遗传和地理相关情况。本研究全面观察了 SARS-CoV-2 单倍型的遗传特征,为 COVID-19 的流行趋势提供了依据。