A computational model for correcting intrinsic enzymatic biases in chromatin accessibility profiling data

Chongzhi Zang; Shengen Shawn Hu

A computational model for correcting intrinsic enzymatic biases in chromatin accessibility profiling data

ORAL

Abstract

Genome-wide profiling of chromatin accessibility using high-throughput sequencing-based techniques such as ATAC-seq (with Tn5 transposase) or DNase-seq (with DNase I) has been widely used for studying regulatory DNA elements and transcriptional regulation. Enzymatic DNA cleavage exhibits intrinsic sequence biases that confound data analysis. We developed a simplex encoding-based mathematical model for accurate estimation of such sequence biases. Here we present a computational method for correcting intrinsic biases from both bulk and single-cell sequencing data. We show that bias correction can improve transcription factor binding prediction from DNase footprints and can generate more accurate cell clustering from single-cell ATAC-seq data. This work demonstrates that innovative quantitative modeling can enhance conventional bioinformatics analysis and extract biologically meaningful information from complicated high-throughput data.

March 10, 2023, 4:06 PM – March 10, 2023, 4:18 PM

Publication: Hu, S.S. et al. Intrinsic bias estimation for improved analysis of bulk and single-cell chromatin accessibility profiles using SELMA. Nature Communications 13, 5533 (2022).

Presenters

Chongzhi Zang

University of Virginia

Authors

Chongzhi Zang

University of Virginia
Shengen Shawn Hu

University of Virginia