Javascript must be enabled to continue!
ML-RID: Mutual Learning Enhanced Visible-Infrared Person Re-identification
View through CrossRef
Abstract
In computer vision, Person Re-Identification (ReID) is essential for surveillance, matching images of individuals across different cameras and lighting conditions. Despite progress in visible light scenarios using CNNs, methods struggle in low-light, prompting the need for Visible-Infrared (VI) ReID. The challenge lies in bridging the semantic gap between visible and infrared modalities due to their distinct imaging properties. Existing methods are limited, focusing on complex feature extractors without fully utilizing intermediate representations to reduce cross-modality discrepancies. Infrared images lack color information, and differences in reflectivity between modalities hinder the extraction of robust features for identification. To alleviate these problems, We introduce ML-RID, leveraging Mutual Learning (ML) to enhance the performance of the VI-ReID task. Specifically , we propose an Adaptive Feature Fusion Module (AFFM) that dynamically fuses visible and infrared features, compensating for semantic deficiencies and enhancing feature representation. Then, a Feature Projection Module (FPM) aligns predictions across modalities, while the Jensen-Shannon Divergence is utilized to ensure the prediction consistency and reliability, encouraging learning from each modality’s strengths. A penal term is proposed to maintain auxiliary modality diversity, aiding in knowledge transfer as a “teacher”. Experiments on two datasets show ML-RID outperforms current models, with visualizations verifying its effectiveness, marking a step forward in cross-modality Re-ID.
Title: ML-RID: Mutual Learning Enhanced Visible-Infrared Person Re-identification
Description:
Abstract
In computer vision, Person Re-Identification (ReID) is essential for surveillance, matching images of individuals across different cameras and lighting conditions.
Despite progress in visible light scenarios using CNNs, methods struggle in low-light, prompting the need for Visible-Infrared (VI) ReID.
The challenge lies in bridging the semantic gap between visible and infrared modalities due to their distinct imaging properties.
Existing methods are limited, focusing on complex feature extractors without fully utilizing intermediate representations to reduce cross-modality discrepancies.
Infrared images lack color information, and differences in reflectivity between modalities hinder the extraction of robust features for identification.
To alleviate these problems, We introduce ML-RID, leveraging Mutual Learning (ML) to enhance the performance of the VI-ReID task.
Specifically , we propose an Adaptive Feature Fusion Module (AFFM) that dynamically fuses visible and infrared features, compensating for semantic deficiencies and enhancing feature representation.
Then, a Feature Projection Module (FPM) aligns predictions across modalities, while the Jensen-Shannon Divergence is utilized to ensure the prediction consistency and reliability, encouraging learning from each modality’s strengths.
A penal term is proposed to maintain auxiliary modality diversity, aiding in knowledge transfer as a “teacher”.
Experiments on two datasets show ML-RID outperforms current models, with visualizations verifying its effectiveness, marking a step forward in cross-modality Re-ID.
Related Results
Solution-processed quantum dot infrared lasers
Solution-processed quantum dot infrared lasers
(English) Colloidal semiconductors quantum dots (CQDs) have emerged as a promising solutionprocessed gain material that can be engineered via low-cost and scalable chemical techniq...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
An Infrared Sequence Image Generating Method for Target Detection and Tracking
An Infrared Sequence Image Generating Method for Target Detection and Tracking
Training infrared target detection and tracking models based on deep learning requires a large number of infrared sequence images. The cost of acquisition real infrared target sequ...
Meta-HGNet: Meta-Heterogeneous Generalized Network for Visible Infrared Person Re-identification
Meta-HGNet: Meta-Heterogeneous Generalized Network for Visible Infrared Person Re-identification
Abstract
Visible infrared person re-identification plays an important role in heterogeneous modality image person identity matching, however, significant modality differenc...
An Improved Deep Mutual-Attention Learning Model for Person Re-Identification
An Improved Deep Mutual-Attention Learning Model for Person Re-Identification
Person re-identification is the task of matching pedestrian images across a network of non-overlapping camera views. It poses aggregated challenges resulted from random human pose,...
Analisis Kinerja Reksa Dana Pendapatan Tetap, Reksa Dana Saham, dan Reksa Dana Campuran (Studi Di Bursa Efek Indonesia – BEI)
Analisis Kinerja Reksa Dana Pendapatan Tetap, Reksa Dana Saham, dan Reksa Dana Campuran (Studi Di Bursa Efek Indonesia – BEI)
This study aims to determine the differentiation of the performance of fixed income mutual funds, equity mutual funds, and mix mutual funds of aspects of real returns. Studies cond...
PENGATURAN REKSADANA SYARIAH DALAM KONSTRUKSI HUKUM POSITIF DI INDONESIAENGATURAN REKSADANA SYARIAH DALAM KONSTRUKSI HUKUM POSITIF DI INDONESIA
PENGATURAN REKSADANA SYARIAH DALAM KONSTRUKSI HUKUM POSITIF DI INDONESIAENGATURAN REKSADANA SYARIAH DALAM KONSTRUKSI HUKUM POSITIF DI INDONESIA
This research is related to the Arrangement of Sharia Mutual Funds in the Construction of Positive Laws in Indonesia. Sharia mutual funds are one of the instruments that play an es...
Staged Feature Mapping Optimization Learning for Visible-Infrared Person Re-identification
Staged Feature Mapping Optimization Learning for Visible-Infrared Person Re-identification
Abstract
Visible-infrared person re-identification (VI-ReID) is a significant and intricate endeavor in specific person retrieval, requiring the fusion of distinct features...


