Suppr超能文献

使用线性噪声近似将随机流行病模型拟合到基因谱系。

Fitting stochastic epidemic models to gene genealogies using linear noise approximation.

作者信息

Tang Mingwei, Dudas Gytis, Bedford Trevor, Minin Vladimir N

机构信息

Department of Statistics, University of Washington, Seattle.

Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center.

出版信息

Ann Appl Stat. 2023 Mar;17(1):1-22. doi: 10.1214/21-aoas1583. Epub 2023 Jan 24.

Abstract

Phylodynamics is a set of population genetics tools that aim at reconstructing demographic history of a population based on molecular sequences of individuals sampled from the population of interest. One important task in phylodynamics is to estimate changes in (effective) population size. When applied to infectious disease sequences such estimation of population size trajectories can provide information about changes in the number of infections. To model changes in the number of infected individuals, current phylodynamic methods use non-parametric approaches (e.g., Bayesian curve-fitting based on change-point models or Gaussian process priors), parametric approaches (e.g., based on differential equations), and stochastic modeling in conjunction with likelihood-free Bayesian methods. The first class of methods yields results that are hard to interpret epidemiologically. The second class of methods provides estimates of important epidemiological parameters, such as infection and removal/recovery rates, but ignores variation in the dynamics of infectious disease spread. The third class of methods is the most advantageous statistically, but relies on computationally intensive particle filtering techniques that limits its applications. We propose a Bayesian model that combines phylodynamic inference and stochastic epidemic models, and achieves computational tractability by using a linear noise approximation (LNA) - a technique that allows us to approximate probability densities of stochastic epidemic model trajectories. LNA opens the door for using modern Markov chain Monte Carlo tools to approximate the joint posterior distribution of the disease transmission parameters and of high dimensional vectors describing unobserved changes in the stochastic epidemic model compartment sizes (e.g., numbers of infectious and susceptible individuals). In a simulation study, we show that our method can successfully recover parameters of stochastic epidemic models. We apply our estimation technique to Ebola genealogies estimated using viral genetic data from the 2014 epidemic in Sierra Leone and Liberia.

摘要

系统发育动力学是一组群体遗传学工具,旨在根据从感兴趣的群体中采样的个体的分子序列重建该群体的人口统计学历史。系统发育动力学中的一项重要任务是估计(有效)种群大小的变化。当应用于传染病序列时,这种种群大小轨迹的估计可以提供有关感染数量变化的信息。为了模拟感染个体数量的变化,当前的系统发育动力学方法使用非参数方法(例如,基于变化点模型或高斯过程先验的贝叶斯曲线拟合)、参数方法(例如,基于微分方程)以及结合无似然贝叶斯方法的随机建模。第一类方法产生的结果在流行病学上难以解释。第二类方法提供了重要流行病学参数的估计值,如感染率和清除/恢复率,但忽略了传染病传播动态中的变化。第三类方法在统计上最具优势,但依赖于计算密集型的粒子滤波技术,这限制了其应用。我们提出了一种贝叶斯模型,该模型结合了系统发育动力学推断和随机流行病模型,并通过使用线性噪声近似(LNA)实现了计算上的易处理性——这是一种使我们能够近似随机流行病模型轨迹概率密度的技术。LNA为使用现代马尔可夫链蒙特卡罗工具来近似疾病传播参数以及描述随机流行病模型区室大小(例如,感染个体和易感个体的数量)未观察到变化的高维向量的联合后验分布打开了大门。在一项模拟研究中,我们表明我们的方法能够成功恢复随机流行病模型的参数。我们将我们的估计技术应用于使用来自2014年塞拉利昂和利比里亚埃博拉疫情的病毒基因数据估计的埃博拉谱系。

相似文献

1
Fitting stochastic epidemic models to gene genealogies using linear noise approximation.
Ann Appl Stat. 2023 Mar;17(1):1-22. doi: 10.1214/21-aoas1583. Epub 2023 Jan 24.
2
Inferring epidemiological dynamics with Bayesian coalescent inference: the merits of deterministic and stochastic models.
Genetics. 2015 Feb;199(2):595-607. doi: 10.1534/genetics.114.172791. Epub 2014 Dec 19.
3
Phylodynamic inference for structured epidemiological models.
PLoS Comput Biol. 2014 Apr 17;10(4):e1003570. doi: 10.1371/journal.pcbi.1003570. eCollection 2014 Apr.
4
Inference for nonlinear epidemiological models using genealogies and time series.
PLoS Comput Biol. 2011 Aug;7(8):e1002136. doi: 10.1371/journal.pcbi.1002136. Epub 2011 Aug 25.
5
Optimal point process filtering and estimation of the coalescent process.
J Theor Biol. 2017 May 21;421:153-167. doi: 10.1016/j.jtbi.2017.04.001. Epub 2017 Apr 3.
6
Estimating Epidemic Incidence and Prevalence from Genomic Data.
Mol Biol Evol. 2019 Aug 1;36(8):1804-1816. doi: 10.1093/molbev/msz106.
7
Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics.
Mol Biol Evol. 2008 Jul;25(7):1459-71. doi: 10.1093/molbev/msn090. Epub 2008 Apr 11.
8
A linear noise approximation for stochastic epidemic models fit to partially observed incidence counts.
Biometrics. 2022 Dec;78(4):1530-1541. doi: 10.1111/biom.13538. Epub 2021 Sep 7.
9
An efficient Bayesian inference framework for coalescent-based nonparametric phylodynamics.
Bioinformatics. 2015 Oct 15;31(20):3282-9. doi: 10.1093/bioinformatics/btv378. Epub 2015 Jun 20.

引用本文的文献

1
Statistical Challenges in Tracking the Evolution of SARS-CoV-2.
Stat Sci. 2022 May;37(2):162-182. doi: 10.1214/22-sts853. Epub 2022 May 16.
2
A computationally tractable birth-death model that combines phylogenetic and epidemiological data.
PLoS Comput Biol. 2022 Feb 11;18(2):e1009805. doi: 10.1371/journal.pcbi.1009805. eCollection 2022 Feb.

本文引用的文献

1
Estimating Epidemic Incidence and Prevalence from Genomic Data.
Mol Biol Evol. 2019 Aug 1;36(8):1804-1816. doi: 10.1093/molbev/msz106.
2
Efficient Data Augmentation for Fitting Stochastic Epidemic Models to Prevalence Data.
J Comput Graph Stat. 2017;26(4):918-929. doi: 10.1080/10618600.2017.1328365. Epub 2017 Oct 9.
3
Bayesian phylodynamic inference with complex models.
PLoS Comput Biol. 2018 Nov 13;14(11):e1006546. doi: 10.1371/journal.pcbi.1006546. eCollection 2018 Nov.
4
Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.
Virus Evol. 2018 Jun 8;4(1):vey016. doi: 10.1093/ve/vey016. eCollection 2018 Jan.
5
Gaussian process approximations for fast inference from infectious disease data.
Math Biosci. 2018 Jul;301:111-120. doi: 10.1016/j.mbs.2018.02.003. Epub 2018 Feb 20.
6
The Structured Coalescent and Its Approximations.
Mol Biol Evol. 2017 Nov 1;34(11):2970-2981. doi: 10.1093/molbev/msx186.
7
Simultaneous inference of phylogenetic and transmission trees in infectious disease outbreaks.
PLoS Comput Biol. 2017 May 18;13(5):e1005495. doi: 10.1371/journal.pcbi.1005495. eCollection 2017 May.
8
Virus genomes reveal factors that spread and sustained the Ebola epidemic.
Nature. 2017 Apr 20;544(7650):309-315. doi: 10.1038/nature22040. Epub 2017 Apr 12.
9
Infectious Disease Dynamics Inferred from Genetic Data via Sequential Monte Carlo.
Mol Biol Evol. 2017 Aug 1;34(8):2065-2084. doi: 10.1093/molbev/msx124.
10
PREDICTIVE MODELING OF CHOLERA OUTBREAKS IN BANGLADESH.
Ann Appl Stat. 2016 Jun;10(2):575-595. doi: 10.1214/16-AOAS908. Epub 2016 Jul 22.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验