Javascript must be enabled to continue!
Omni-IML: Towards Unified Image Manipulation Localization
View through CrossRef
Image manipulation can lead to misinterpretation of visual content, posing significant risks to information security. Image Manipulation Localization (IML) has thus received increasing attention. However, existing IML methods rely heavily on task-specific designs, making them perform well only on one target image type but are mostly random guessing on other image types, and even joint training on multiple image types causes significant performance degradation. This hinders the deployment for real applications as it notably increases maintenance costs and the misclassification of image types leads to serious error accumulation. To this end, we propose Omni-IML, the first generalist model to unify diverse IML tasks. Specifically, Omni-IML achieves generalism by adopting the Modal Gate Encoder and the Dynamic Weight Decoder to adaptively determine the optimal encoding modality and the optimal decoder filters for each sample. We additionally propose an Anomaly Enhancement module that enhances the features of tampered regions with box supervision and helps the generalist model to extract common features across different IML tasks. We validate our approach on IML tasks across three major scenarios: natural images, document images, and face images. Without bells and whistles, our Omni-IML achieves state-of-the-art performance on all three tasks with a single unified model, providing valuable strategies and insights for real-world application and future research in generalist image forensics. Our code will be publicly available.
Title: Omni-IML: Towards Unified Image Manipulation Localization
Description:
Image manipulation can lead to misinterpretation of visual content, posing significant risks to information security.
Image Manipulation Localization (IML) has thus received increasing attention.
However, existing IML methods rely heavily on task-specific designs, making them perform well only on one target image type but are mostly random guessing on other image types, and even joint training on multiple image types causes significant performance degradation.
This hinders the deployment for real applications as it notably increases maintenance costs and the misclassification of image types leads to serious error accumulation.
To this end, we propose Omni-IML, the first generalist model to unify diverse IML tasks.
Specifically, Omni-IML achieves generalism by adopting the Modal Gate Encoder and the Dynamic Weight Decoder to adaptively determine the optimal encoding modality and the optimal decoder filters for each sample.
We additionally propose an Anomaly Enhancement module that enhances the features of tampered regions with box supervision and helps the generalist model to extract common features across different IML tasks.
We validate our approach on IML tasks across three major scenarios: natural images, document images, and face images.
Without bells and whistles, our Omni-IML achieves state-of-the-art performance on all three tasks with a single unified model, providing valuable strategies and insights for real-world application and future research in generalist image forensics.
Our code will be publicly available.
Related Results
Factors Associated with Symptom Recurrence Following Operative Treatment of Interdigital Neuroma
Factors Associated with Symptom Recurrence Following Operative Treatment of Interdigital Neuroma
Category: Midfoot/Forefoot; Trauma Introduction/Purpose: Interdigital Neuroma (IN) is a benign enlargement of tissue surrounding the common plantar digital nerve. Traditionally, ne...
Indoor Localization System Based on RSSI-APIT Algorithm
Indoor Localization System Based on RSSI-APIT Algorithm
An indoor localization system based on the RSSI-APIT algorithm is designed in this study. Integrated RSSI (received signal strength indication) and non-ranging APIT (approximate pe...
ความผันแปรทางกายวิภาค ของเส้นประสาท sural ที่สัมพันธ์กับเอ็นร้อยหวาย และหลอดเลือดดำ small saphenous
ความผันแปรทางกายวิภาค ของเส้นประสาท sural ที่สัมพันธ์กับเอ็นร้อยหวาย และหลอดเลือดดำ small saphenous
ในการทำ sural nerve biopsy, sural nerve graft, การรักษาโรคหลอดเลือดขอดโดยการตัดลอกหลอดเลือดดำ small saphenous (SSV) และการรักษาการฉีกขาดของเอ็นร้อยหวายอาจก่อให้เกิดการบาดเจ็บของเส้...
ROLE OF OMNI SERVICES IN ENHANCING PASSENGER TRANSPORTATION AND TOURISM
ROLE OF OMNI SERVICES IN ENHANCING PASSENGER TRANSPORTATION AND TOURISM
Omni bus services have become an important part of the private transportation system in India. These buses are operated by private travel agencies and mainly provide intercity pass...
Integration of machine learning to identify diagnostic genes in leukocytes for acute myocardial infarction patients
Integration of machine learning to identify diagnostic genes in leukocytes for acute myocardial infarction patients
Abstract
Background
Acute myocardial infarction (AMI) has two clinical characteristics: high missed diagnosis and dysfunction of leukocytes. Transcr...
Origins and control of bacterial contamination during spinal manipulation
Origins and control of bacterial contamination during spinal manipulation
Background: Research has revealed that healthcare workers’ hands serve as a source and vehicle for the transmission of micro-organisms within the healthcare sector, thus resulting ...
Double Exposure
Double Exposure
I. Happy Endings
Chaplin’s Modern Times features one of the most subtly strange endings in Hollywood history. It concludes with the Tramp (Chaplin) and the Gamin (Paulette Godda...
Omni-Channel Strategy in the Digital Retail Environment
Omni-Channel Strategy in the Digital Retail Environment
The evolution of technology has impacted how businesses structure their channels. Companies must adapt to integrated channels, both offline and online, to effectively connect with ...

