Rougemont Jacques, Naef Felix
Bioinformatics and Biostatistics Core Facility, Swiss Institute of Bioinformatics, Ecole Polytechnique Fédérale (EPFL), Lausanne, Switzerland.
Methods Mol Biol. 2012;786:263-73. doi: 10.1007/978-1-61779-292-2_16.
Chromatin immunoprecipitation experiments followed by ultra-high-throughput sequencing (ChIP-seq) is becoming the method of choice to identify transcription factor binding sites in prokaryotes and eukaryotes in vivo. Here, we review the computational steps that are necessary for analyzing the sequenced chromatin fragments, including mapping of short reads onto reference genomes, normalization of multiple conditions, detection of bona fide peaks or binding sites, annotation of sites, characterization of sequence-specific binding affinities, and relationships with biophysical models for protein-DNA interactions. The goal is that following the indicated steps will help the discovery of novel mechanisms underlying transcription regulation in a broad range of experimental systems.
染色质免疫沉淀实验结合超高通量测序(ChIP-seq)正成为在体内鉴定原核生物和真核生物中转录因子结合位点的首选方法。在此,我们回顾分析测序染色质片段所需的计算步骤,包括将短读段映射到参考基因组、多种条件的标准化、真实峰或结合位点的检测、位点注释、序列特异性结合亲和力的表征以及与蛋白质 - DNA相互作用生物物理模型的关系。目标是遵循所指示的步骤将有助于在广泛的实验系统中发现转录调控的新机制。