Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

IBD: An Interpretable Backdoor-Detection Method via Multivariate Interactions

View through CrossRef
Recent work has shown that deep neural networks are vulnerable to backdoor attacks. In comparison with the success of backdoor-attack methods, existing backdoor-defense methods face a lack of theoretical foundations and interpretable solutions. Most defense methods are based on experience with the characteristics of previous attacks, but fail to defend against new attacks. In this paper, we propose IBD, an interpretable backdoor-detection method via multivariate interactions. Using information theory techniques, IBD reveals how the backdoor works from the perspective of multivariate interactions of features. Based on the interpretable theorem, IBD enables defenders to detect backdoor models and poisoned examples without introducing additional information about the specific attack method. Experiments on widely used datasets and models show that IBD achieves a 78% increase in average in detection accuracy and an order-of-magnitude reduction in time cost compared with existing backdoor-detection methods.
Title: IBD: An Interpretable Backdoor-Detection Method via Multivariate Interactions
Description:
Recent work has shown that deep neural networks are vulnerable to backdoor attacks.
In comparison with the success of backdoor-attack methods, existing backdoor-defense methods face a lack of theoretical foundations and interpretable solutions.
Most defense methods are based on experience with the characteristics of previous attacks, but fail to defend against new attacks.
In this paper, we propose IBD, an interpretable backdoor-detection method via multivariate interactions.
Using information theory techniques, IBD reveals how the backdoor works from the perspective of multivariate interactions of features.
Based on the interpretable theorem, IBD enables defenders to detect backdoor models and poisoned examples without introducing additional information about the specific attack method.
Experiments on widely used datasets and models show that IBD achieves a 78% increase in average in detection accuracy and an order-of-magnitude reduction in time cost compared with existing backdoor-detection methods.

Related Results

Backdoor DNFs
Backdoor DNFs
We introduce backdoor DNFs, as a tool to measure the theoretical hardness of CNF formulas. Like backdoor sets and backdoor trees, backdoor DNFs are defined relative to a tractable ...
An Inflammatory Bowel Diseases Integrated Resources Portal (IBDIRP)
An Inflammatory Bowel Diseases Integrated Resources Portal (IBDIRP)
Abstract IBD, including ulcerative colitis and Crohn’s disease, is a chronic and debilitating gastrointestinal disorder that affects millions of people worldwide. Re...
CSP beyond tractable constraint languages
CSP beyond tractable constraint languages
AbstractThe constraint satisfaction problem (CSP) is among the most studied computational problems. While NP-hard, many tractable subproblems have been identified (Bulatov 2017, Zh...
P125 Ankylosing spondylitis can influence the outcome of inflammatory bowel disease
P125 Ankylosing spondylitis can influence the outcome of inflammatory bowel disease
Abstract Background Both inflammatory bowel disease (IBD) and ankylosing spondylitis (AS) are inflammatory diseases but there wa...
Sub-Band Backdoor Attack in Remote Sensing Imagery
Sub-Band Backdoor Attack in Remote Sensing Imagery
Remote sensing datasets usually have a wide range of spatial and spectral resolutions. They provide unique advantages in surveillance systems, and many government organizations use...
Towards Robust Dual-Trigger Physical Backdoor Attacks against Multi-Object Tracking
Towards Robust Dual-Trigger Physical Backdoor Attacks against Multi-Object Tracking
In recent years, backdoor attacks have posed a significant threat to the security of deep models. Attackers can induce erroneous behavior in victim models through carefully designe...
Serum neutrophil gelatinase associated lipocalin (NGAL) as a marker of activity in inflammatory bowel disease
Serum neutrophil gelatinase associated lipocalin (NGAL) as a marker of activity in inflammatory bowel disease
Abstract Inflammatory bowel disease (IBD) is a disease of activity and remission. Lipocalin 2 (LCN2), the coding gene for NGAL is one of the most over-expressed genes in th...

Back to Top