Almudevar Anthony, McCall Matthew N, McMurray Helene, Land Hartmut
University of Rochester, USA.
Stat Appl Genet Mol Biol. 2011 Oct 5;10(1):/j/sagmb.2011.10.issue-1/1544-6115.1727/1544-6115.1727.xml. doi: 10.2202/1544-6115.1727.
Gene perturbation experiments are commonly used for the reconstruction of gene regulatory networks. Typical experimental methodology imposes persistent changes on the network. The resulting data must therefore be interpreted as a steady state from an altered gene regulatory network, rather than a direct observation of the original network. In this article an implicit modeling methodology is proposed in which the unperturbed network of interest is scored by first modeling the persistent perturbation, then predicting the steady state, which may then be compared to the observed data. This results in a many-to-one inverse problem, so a computational Bayesian approach is used to assess model uncertainty. The methodology is first demonstrated on a number of synthetic networks. It is shown that the Bayesian approach correctly assigns high posterior probability to the network structure and steady state behavior. Further, it is demonstrated that where uncertainty of model features is indicated, the uncertainty may be accurately resolved with further perturbation experiments. The methodology is then applied to the modeling of a gene regulatory network using perturbation data from nine genes which have been shown to respond synergistically to known oncogenic mutations. A hypothetical model emerges which conforms to reported regulatory properties of these genes. Furthermore, the Bayesian methodology is shown to be consistent in the sense that multiple randomized applications of the fitting algorithm converge to an approximately common posterior density on the space of models. Such consistency is generally not feasible for algorithms which report only single models. We conclude that fully Bayesian methods, coupled with models which accurately account for experimental constraints, are a suitable tool for the inference of gene regulatory networks, in terms of accuracy, estimation of model uncertainty, and experimental design.
基因扰动实验通常用于基因调控网络的重建。典型的实验方法会对网络施加持续的变化。因此,所得数据必须被解释为来自改变后的基因调控网络的稳态,而不是对原始网络的直接观察。在本文中,我们提出了一种隐式建模方法,其中通过首先对持续扰动进行建模,然后预测稳态来对感兴趣的未扰动网络进行评分,然后可以将其与观察到的数据进行比较。这导致了一个多对一的逆问题,因此使用计算贝叶斯方法来评估模型不确定性。该方法首先在一些合成网络上进行了演示。结果表明,贝叶斯方法正确地将高后验概率分配给网络结构和稳态行为。此外,结果表明,在指出模型特征的不确定性的情况下,可以通过进一步的扰动实验准确地解决不确定性。然后,该方法应用于使用来自九个基因的扰动数据对基因调控网络进行建模,这些基因已被证明对已知致癌突变有协同反应。出现了一个符合这些基因报道的调控特性的假设模型。此外,贝叶斯方法在拟合算法的多次随机应用在模型空间上收敛到近似共同的后验密度的意义上被证明是一致的。对于只报告单个模型的算法来说,这种一致性通常是不可行的。我们得出结论,就准确性、模型不确定性估计和实验设计而言,完全贝叶斯方法与准确考虑实验约束的模型相结合,是推断基因调控网络的合适工具。