Javascript must be enabled to continue!
Interpretable Convolution Methods for Learning Genomic Sequence Motifs
View through CrossRef
AbstractThe first-layer filters employed in convolutional neural networks tend to learn, or extract, spatial features from the data. Within their application to genomic sequence data, these learned features are often visualized and interpreted by converting them to sequence logos; an information-based representation of the consensus nucleotide motif. The process to obtain such motifs, however, is done through post-training procedures which often discard the filter weights themselves and instead rely upon finding those sequences maximally correlated with the given filter. Moreover, the filters collectively learn motifs with high redundancy, often simply shifted representations of the same sequence. We propose a schema to learn sequence motifs directly through weight constraints and transformations such that the individual weights comprising the filter are directly interpretable as either position weight matrices (PWMs) or information gain matrices (IGMs). We additionally leverage regularization to encourage learning highly-representative motifs with low inter-filter redundancy. Through learning PWMs and IGMs directly we present preliminary results showcasing how our method is capable of incorporating previously-annotated database motifs along with learning motifs de novo and then outline a pipeline for how these tools may be used jointly in a data application.
Title: Interpretable Convolution Methods for Learning Genomic Sequence Motifs
Description:
AbstractThe first-layer filters employed in convolutional neural networks tend to learn, or extract, spatial features from the data.
Within their application to genomic sequence data, these learned features are often visualized and interpreted by converting them to sequence logos; an information-based representation of the consensus nucleotide motif.
The process to obtain such motifs, however, is done through post-training procedures which often discard the filter weights themselves and instead rely upon finding those sequences maximally correlated with the given filter.
Moreover, the filters collectively learn motifs with high redundancy, often simply shifted representations of the same sequence.
We propose a schema to learn sequence motifs directly through weight constraints and transformations such that the individual weights comprising the filter are directly interpretable as either position weight matrices (PWMs) or information gain matrices (IGMs).
We additionally leverage regularization to encourage learning highly-representative motifs with low inter-filter redundancy.
Through learning PWMs and IGMs directly we present preliminary results showcasing how our method is capable of incorporating previously-annotated database motifs along with learning motifs de novo and then outline a pipeline for how these tools may be used jointly in a data application.
Related Results
Predicting regional somatic mutation rates using DNA motifs
Predicting regional somatic mutation rates using DNA motifs
Abstract
How the locus-specificity of epigenetic modifications is regulated remains an unanswered question. A contributing mechanism is that epig...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Abstract
The prediction of complex or quantitative traits from single nucleotide polymorphism (SNP) genotypes has transformed livestock and plant breeding, and is also pl...
Role of RNA Motifs in RNA Interaction with Membrane Lipid Rafts: Implications for Therapeutic Applications of Exosomal RNAs
Role of RNA Motifs in RNA Interaction with Membrane Lipid Rafts: Implications for Therapeutic Applications of Exosomal RNAs
RNA motifs may promote interactions with exosomes (EXO-motifs) and lipid rafts (RAFT-motifs) that are enriched in exosomal membranes. These interactions can promote selective RNA l...
The quantum convolution product
The quantum convolution product
Abstract
In classical statistical mechanics, physical states (probability measures) are embedded in the Banach algebra of complex Borel measures on phase space, wher...
Abstract 1695: Genomic clustering tendency of transcription factors reflects phase-separated transcriptional condensates at cancer super-enhancers
Abstract 1695: Genomic clustering tendency of transcription factors reflects phase-separated transcriptional condensates at cancer super-enhancers
Abstract
Many transcription factors (TFs) have been shown to bind to super-enhancers, forming transcriptional condensates to activate transcription in various cellul...
Relationship Between Weight Correlation of the Convolution Kernels and the Optimal Architecture of CNN
Relationship Between Weight Correlation of the Convolution Kernels and the Optimal Architecture of CNN
Currently, deep learning has been one of the most popular research topics, and it has already been successfully applied in many fields such as image recognition, recommendation sys...
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
Enhancing Pangeo-Fish with HEALPix Convolution: Impact Evaluation and Benefits
The Pangeo-Fish project processes biologging data to analyze fish movement and migration patterns.  While SciPy’s convolution methods are robust, they are not op...

