Javascript must be enabled to continue!
HiC-ACT: Improved Detection of Chromatin Interactions from Hi-C Data via Aggregated Cauchy Test
View through CrossRef
Abstract
Genome-wide chromatin conformation capture technologies such as Hi-C are commonly employed to study chromatin spatial organization. In particular, to identify statistically significant long-range chromatin interactions from Hi-C data, most existing methods such as Fit-Hi-C/FitHiC2 and HiCCUPS assume that all chromatin interactions are statistically independent. Such an independence assumption is reasonable at low resolution (e.g., 40Kb bin), but is invalid at high resolution (e.g., 5 or 10Kb bins) since spatial dependency of neighboring chromatin interactions is non-negligible at high resolution. Our previous hidden Markov random field based methods accommodate spatial dependency but are computationally intensive. It is urgent to develop approaches that can model spatial dependence, in a computationally efficient and scalable manner. Here, we develop HiC-ACT, an aggregated Cauchy test (ACT) based approach, to improve the detection of chromatin interactions by post-processing results from methods assuming independence. To benchmark the performance of HiC-ACT, we re-analyzed deeply sequenced Hi-C data from a human lymphoblastoid cell line GM12878 and mouse embryonic stem cell line (mESC). Our results demonstrate advantages of HiC-ACT in improving sensitivity with controlled type-I error. By leveraging information from neighboring chromatin interactions, HiC-ACT enhances the power to detect interactions with lower signal to noise ratio and similar (if not stronger) epigenetic signatures that suggest regulatory roles. We further demonstrate that HiC-ACT peaks show higher overlap with known enhancers than Fit-Hi-C/FitHiC2 peaks, in both GM12878 and mESC. HiC-ACT, effectively a summary statistic based approach, is computationally efficient (~6 minutes and ~2GB memory to process 25,000 pairwise interactions).
Title: HiC-ACT: Improved Detection of Chromatin Interactions from Hi-C Data via Aggregated Cauchy Test
Description:
Abstract
Genome-wide chromatin conformation capture technologies such as Hi-C are commonly employed to study chromatin spatial organization.
In particular, to identify statistically significant long-range chromatin interactions from Hi-C data, most existing methods such as Fit-Hi-C/FitHiC2 and HiCCUPS assume that all chromatin interactions are statistically independent.
Such an independence assumption is reasonable at low resolution (e.
g.
, 40Kb bin), but is invalid at high resolution (e.
g.
, 5 or 10Kb bins) since spatial dependency of neighboring chromatin interactions is non-negligible at high resolution.
Our previous hidden Markov random field based methods accommodate spatial dependency but are computationally intensive.
It is urgent to develop approaches that can model spatial dependence, in a computationally efficient and scalable manner.
Here, we develop HiC-ACT, an aggregated Cauchy test (ACT) based approach, to improve the detection of chromatin interactions by post-processing results from methods assuming independence.
To benchmark the performance of HiC-ACT, we re-analyzed deeply sequenced Hi-C data from a human lymphoblastoid cell line GM12878 and mouse embryonic stem cell line (mESC).
Our results demonstrate advantages of HiC-ACT in improving sensitivity with controlled type-I error.
By leveraging information from neighboring chromatin interactions, HiC-ACT enhances the power to detect interactions with lower signal to noise ratio and similar (if not stronger) epigenetic signatures that suggest regulatory roles.
We further demonstrate that HiC-ACT peaks show higher overlap with known enhancers than Fit-Hi-C/FitHiC2 peaks, in both GM12878 and mESC.
HiC-ACT, effectively a summary statistic based approach, is computationally efficient (~6 minutes and ~2GB memory to process 25,000 pairwise interactions).
Related Results
Cauchy, Cauchy–Santos–Sartori–Faria, Logit, and Probit Functions for Estimating Seed Longevity in Soybean
Cauchy, Cauchy–Santos–Sartori–Faria, Logit, and Probit Functions for Estimating Seed Longevity in Soybean
Seed longevity is characterized as the time for which seed remains viable during storage. Seed longevity can be estimated by a Probit model that determines the period in which 50% ...
Mesoscale Modeling of a Nucleosome-Binding Antibody (PL2-6): Mono- vs. Bivalent Chromatin Complexes
Mesoscale Modeling of a Nucleosome-Binding Antibody (PL2-6): Mono- vs. Bivalent Chromatin Complexes
ABSTRACTVisualizing chromatin adjacent to the nuclear envelope (denoted “epichromatin”) by in vitro immunostaining with a bivalent nucleosome-binding antibody (termed monoclonal an...
Electrostatics and Solvation: Essential Determinants of Chromatin Compaction
Electrostatics and Solvation: Essential Determinants of Chromatin Compaction
ABSTRACT
Chromatin compaction is a process of fundamental importance in Biology, as it greatly influences cellular function and gene expression. The dynamics of com...
Characterizing trajectory-like chromatin architectures with Fun2
Characterizing trajectory-like chromatin architectures with Fun2
AbstractChromatin is intricately folded into dynamic 3D structures, orchestrating key DNA metabolic processes. DNA replication, a core chromatin metabolic event, is tightly linked ...
One Chaperone to Rule Them All: Deciphering How Chromatin is Assembled During DNA Replication
One Chaperone to Rule Them All: Deciphering How Chromatin is Assembled During DNA Replication
Genomic DNA, which governs cellular life, resides within the nucleus of every human cell. Inside each nucleus lies approximately two meters of DNA, posing a significant challenge, ...
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Multiple epigenetic factors co-localize with HMGN proteins in A-compartment chromatin
Multiple epigenetic factors co-localize with HMGN proteins in A-compartment chromatin
Abstract
Background
Nucleosomal binding proteins, HMGN, is a family of chromatin architectural proteins that are expressed in all vertebrate nuclei....
Aggregated Search
Aggregated Search
The goal of aggregated search is to provide integrated search across multiple heterogeneous search services in a unified interface—a single query box and a common presentation of r...

