用于协调 Illumina 的 450K 和 EPIC 平台的 DNA 甲基化数据以用于流行病学研究的有效处理管道。

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

机构信息

Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

出版信息

BMC Res Notes. 2021 Sep 8;14(1):352. doi: 10.1186/s13104-021-05741-2.

DOI:10.1186/s13104-021-05741-2

PMID:34496950

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8424820/

Abstract

OBJECTIVE

Illumina BeadChip arrays are commonly used to generate DNA methylation data for large epidemiological studies. Updates in technology over time create challenges for data harmonization within and between studies, many of which obtained data from the older 450K and newer EPIC platforms. The pre-processing pipeline for DNA methylation is not trivial, and influences the downstream analyses. Incorporating different platforms adds a new level of technical variability that has not yet been taken into account by recommended pipelines. Our study evaluated the performance of various tools on different versions of platform data harmonization at each step of pre-processing pipeline, including quality control (QC), normalization, batch effect adjustment, and genomic inflation. We illustrate our novel approach using 450K and EPIC data from the Diabetes Autoimmunity Study in the Young (DAISY) prospective cohort.

RESULTS

We found normalization and probe filtering had the biggest effect on data harmonization. Employing a meta-analysis was an effective and easily executable method for accounting for platform variability. Correcting for genomic inflation also helped with harmonization. We present guidelines for studies seeking to harmonize data from the 450K and EPIC platforms, which includes the use of technical replicates for evaluating numerous pre-processing steps, and employing a meta-analysis.

摘要

目的

Illumina BeadChip 阵列常用于生成大型流行病学研究的 DNA 甲基化数据。随着时间的推移，技术的更新为研究内部和研究之间的数据协调带来了挑战，其中许多研究从较旧的 450K 和较新的 EPIC 平台获得了数据。DNA 甲基化的预处理管道并不简单，并且会影响下游分析。整合不同的平台增加了一个尚未被推荐管道考虑到的新的技术可变性层次。我们的研究评估了各种工具在预处理管道的每个步骤（包括质量控制 (QC)、标准化、批次效应调整和基因组膨胀）中对不同版本平台数据协调的性能。我们使用来自年轻糖尿病自身免疫研究 (DAISY) 前瞻性队列的 450K 和 EPIC 数据说明了我们的新方法。

结果

我们发现标准化和探针过滤对数据协调有最大的影响。采用荟萃分析是一种有效且易于执行的方法，可以解决平台变异性问题。校正基因组膨胀也有助于协调。我们为试图协调 450K 和 EPIC 平台数据的研究提供了指导方针，包括使用技术重复来评估众多预处理步骤，并采用荟萃分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b07e/8424820/6b76de7579c7/13104_2021_5741_Fig1_HTML.jpg

相似文献

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

BMC Res Notes. 2021 Sep 8;14(1):352. doi: 10.1186/s13104-021-05741-2.

Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.

Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.

Considerations for normalization of DNA methylation data by Illumina 450K BeadChip assay in population studies.

Epigenetics. 2013 Nov;8(11):1141-52. doi: 10.4161/epi.26037. Epub 2013 Aug 19.

Correlation of Infinium HumanMethylation450K and MethylationEPIC BeadChip arrays in cartilage.

Epigenetics. 2020 Jun-Jul;15(6-7):594-603. doi: 10.1080/15592294.2019.1700003. Epub 2019 Dec 13.

Aclust2.0: a revamped unsupervised R tool for Infinium methylation beadchips data analyses.

Bioinformatics. 2022 Oct 14;38(20):4820-4822. doi: 10.1093/bioinformatics/btac583.

IMA: an R package for high-throughput analysis of Illumina's 450K Infinium methylation data.

Bioinformatics. 2012 Mar 1;28(5):729-30. doi: 10.1093/bioinformatics/bts013. Epub 2012 Jan 16.

Complete pipeline for Infinium(®) Human Methylation 450K BeadChip data processing using subset quantile normalization for accurate DNA methylation estimation.

Epigenomics. 2012 Jun;4(3):325-41. doi: 10.2217/epi.12.21.

An evaluation of analysis pipelines for DNA methylation profiling using the Illumina HumanMethylation450 BeadChip platform.

Epigenetics. 2013 Mar;8(3):333-46. doi: 10.4161/epi.24008. Epub 2013 Feb 19.

Comparison of Illumina 450K and EPIC arrays in placental DNA methylation.

Epigenetics. 2019 Dec;14(12):1177-1182. doi: 10.1080/15592294.2019.1634975. Epub 2019 Jun 28.

A framework for analyzing DNA methylation data from Illumina Infinium HumanMethylation450 BeadChip.

BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):115. doi: 10.1186/s12859-018-2096-3.

引用本文的文献

Differences in immune cell profiles around the time of islet autoimmunity seroconversion in children with and without type 1 diabetes.

bioRxiv. 2025 Jul 10:2025.06.23.661117. doi: 10.1101/2025.06.23.661117.

Examining cellular heterogeneity in human DNA methylation studies: Overview and recommendations.

STAR Protoc. 2025 Mar 21;6(1):103638. doi: 10.1016/j.xpro.2025.103638. Epub 2025 Feb 12.

Prediction of Multiple Degenerative Diseases Based on DNA Methylation in a Co-Physiology Mechanisms Perspective.

Int J Mol Sci. 2024 Sep 1;25(17):9514. doi: 10.3390/ijms25179514.

Longitudinal changes in DNA methylation during the onset of islet autoimmunity differentiate between reversion versus progression of islet autoimmunity.

Front Immunol. 2024 Jun 10;15:1345494. doi: 10.3389/fimmu.2024.1345494. eCollection 2024.

DNA Methylation Near May Mediate the Relationship between Family History of Type 1 Diabetes and Type 1 Diabetes Risk.

Pediatr Diabetes. 2023;2023. doi: 10.1155/2023/5367637. Epub 2023 Sep 11.

A novel approach toward optimal workflow selection for DNA methylation biomarker discovery.

BMC Bioinformatics. 2024 Jan 23;25(1):37. doi: 10.1186/s12859-024-05658-0.

Associations between blood leukocyte DNA methylation and sustained attention in mid-to-late childhood.

Epigenomics. 2023 Oct;15(19):965-981. doi: 10.2217/epi-2023-0169. Epub 2023 Nov 9.

Integrative Approaches of DNA Methylation Patterns According to Age, Sex and Longitudinal Changes.

Curr Genomics. 2023 Feb 14;23(6):385-399. doi: 10.2174/1389202924666221207100513.

A systematic evaluation of normalization methods and probe replicability using infinium EPIC methylation data.

Clin Epigenetics. 2023 Mar 11;15(1):41. doi: 10.1186/s13148-023-01459-z.

Epigenetic-based age acceleration in a representative sample of older Americans: Associations with aging-related morbidity and mortality.

Proc Natl Acad Sci U S A. 2023 Feb 28;120(9):e2215840120. doi: 10.1073/pnas.2215840120. Epub 2023 Feb 21.

本文引用的文献

Longitudinal DNA methylation differences precede type 1 diabetes.

Sci Rep. 2020 Feb 28;10(1):3721. doi: 10.1038/s41598-020-60758-0.

Epigenome-Wide Association Study for All-Cause Mortality in a Cardiovascular Cohort Identifies Differential Methylation in Castor Zinc Finger 1 ().

J Am Heart Assoc. 2019 Nov 5;8(21):e013228. doi: 10.1161/JAHA.119.013228. Epub 2019 Oct 23.

In Epigenomic Studies, Including Cell-Type Adjustments in Regression Models Can Introduce Multicollinearity, Resulting in Apparent Reversal of Direction of Association.

Front Genet. 2019 Sep 10;10:816. doi: 10.3389/fgene.2019.00816. eCollection 2019.

Comparison of Illumina 450K and EPIC arrays in placental DNA methylation.

Epigenetics. 2019 Dec;14(12):1177-1182. doi: 10.1080/15592294.2019.1634975. Epub 2019 Jun 28.

Systematic evaluation of DNA methylation age estimation with common preprocessing methods and the Infinium MethylationEPIC BeadChip array.

Clin Epigenetics. 2018 Oct 16;10(1):123. doi: 10.1186/s13148-018-0556-2.

DNA methylation in human diseases.

Genes Dis. 2018 Jan 31;5(1):1-8. doi: 10.1016/j.gendis.2018.01.002. eCollection 2018 Mar.

SeSAMe: reducing artifactual detection of DNA methylation by Infinium BeadChips in genomic deletions.

Nucleic Acids Res. 2018 Nov 16;46(20):e123. doi: 10.1093/nar/gky691.

Comparison of DNA methylation measured by Illumina 450K and EPIC BeadChips in blood of newborns and 14-year-old children.

Epigenetics. 2018;13(6):655-664. doi: 10.1080/15592294.2018.1497386. Epub 2018 Aug 15.

Adjusting for Batch Effects in DNA Methylation Microarray Data, a Lesson Learned.

Front Genet. 2018 Mar 16;9:83. doi: 10.3389/fgene.2018.00083. eCollection 2018.

Positional effects revealed in Illumina methylation array and the impact on analysis.

Epigenomics. 2018 May;10(5):643-659. doi: 10.2217/epi-2017-0105. Epub 2018 Feb 22.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于协调 Illumina 的 450K 和 EPIC 平台的 DNA 甲基化数据以用于流行病学研究的有效处理管道。

An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

机构信息

出版信息

OBJECTIVE

RESULTS

目的

结果

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献