Complete reference genome and pangenome improve genome-wide detection and interpretation of DNA methylation using sequencing and array data
Date
Authors
MacIsaac, Julia L.
Rehkopf, David H.
Rosero Bixby, Luis
Kobor, Michael S.
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
The complete telomere-to-telomere human genome assembly (T2T-CHM13) and the draft human pangenome reference provide unique opportunities to refine DNA methylation (DNAm) studies. Here, we find that T2T-CHM13 calls 7.4% more CpGs genome wide compared to GRCh38 across four widely used short-read DNAm profiling methods and improves the evaluation of probe cross-reactivity and mismatch for Illumina DNAm arrays, yielding new and more reproducible sets of unambiguous probes. The pangenome reference further expands CpG calling by 4.5% in short-read sequencing data and identifies cross-population and population-specific unambiguous probes in DNAm arrays, owing to its improved representation of genetic diversity. These benefits facilitate the discovery of biologically relevant DNAm alterations in epigenome-wide association studies (EWASs). For instance, additional DNAm alterations enriched in cancer-related genes and pathways are identified in cancer EWASs. Together, this study highlights the practical applications of T2T-CHM13 and pangenome for genome biology and provides a basis for expansion of DNAm investigations.
Description
Keywords
GENOMA, GENOMA HUMANO, CANCER, Genética humana