Moxley Tristan A, Johnson-Leung Jennifer, Seamon Erich, Williams Christopher, Ridenhour Benjamin J
Bioinformatics and Computational Biology Program, University of Idaho, Moscow, ID, United States of America.
Department of Mathematics and Statistical Science, University of Idaho, Moscow, ID, United States of America.
PLoS One. 2024 Jan 26;19(1):e0297065. doi: 10.1371/journal.pone.0297065. eCollection 2024.
COVID-19 has been at the forefront of global concern since its emergence in December of 2019. Determining the social factors that drive case incidence is paramount to mitigating disease spread. We gathered data from the Social Vulnerability Index (SVI) along with Democratic voting percentage to attempt to understand which county-level sociodemographic metrics had a significant correlation with case rate for COVID-19.
We used elastic net regression due to issues with variable collinearity and model overfitting. Our modelling framework included using the ten Health and Human Services regions as submodels for the two time periods 22 March 2020 to 15 June 2021 (prior to the Delta time period) and 15 June 2021 to 1 November 2021 (the Delta time period).
Statistically, elastic net improved prediction when compared to multiple regression, as almost every HHS model consistently had a lower root mean square error (RMSE) and satisfactory R2 coefficients. These analyses show that the percentage of minorities, disabled individuals, individuals living in group quarters, and individuals who voted Democratic correlated significantly with COVID-19 attack rate as determined by Variable Importance Plots (VIPs).
The percentage of minorities per county correlated positively with cases in the earlier time period and negatively in the later time period, which complements previous research. In contrast, higher percentages of disabled individuals per county correlated negatively in the earlier time period. Counties with an above average percentage of group quarters experienced a high attack rate early which then diminished in significance after the primary vaccine rollout. Higher Democratic voting consistently correlated negatively with cases, coinciding with previous findings regarding a partisan divide in COVID-19 cases at the county level. Our findings can assist regional policymakers in distributing resources to more vulnerable counties in future pandemics based on SVI.
自2019年12月新冠病毒出现以来,一直是全球关注的焦点。确定推动病例发生率的社会因素对于减轻疾病传播至关重要。我们收集了社会脆弱性指数(SVI)以及民主党投票率的数据,试图了解哪些县级社会人口统计学指标与新冠病毒病例率存在显著相关性。
由于变量共线性和模型过度拟合问题,我们使用了弹性网络回归。我们的建模框架包括将十个卫生与公众服务部地区用作2020年3月22日至2021年6月15日(德尔塔时期之前)和2021年6月15日至2021年11月1日(德尔塔时期)这两个时间段的子模型。
从统计学角度来看,与多元回归相比,弹性网络改善了预测效果,因为几乎每个卫生与公众服务部模型的均方根误差(RMSE)始终较低,且R2系数令人满意。这些分析表明,根据变量重要性图(VIP)确定,少数族裔、残疾人士、居住在集体宿舍的人员以及投票给民主党的人员的比例与新冠病毒感染率显著相关。
每个县的少数族裔比例在早期与病例呈正相关,在后期呈负相关,这与先前的研究结果相补充。相比之下,每个县残疾人士比例较高在早期呈负相关。集体宿舍比例高于平均水平的县在早期经历了高感染率,在主要疫苗推出后其重要性随后降低。较高的民主党投票率始终与病例呈负相关,这与之前关于县级新冠病毒病例存在党派分歧的研究结果一致。我们的研究结果可以帮助地区政策制定者在未来疫情中根据社会脆弱性指数向更脆弱的县分配资源。