Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Staged Feature Mapping Optimization Learning for Visible-Infrared Person Re-identification

View through CrossRef
Abstract Visible-infrared person re-identification (VI-ReID) is a significant and intricate endeavor in specific person retrieval, requiring the fusion of distinct features observed in visible and infrared modalities. To address the limitations of current methods, which predominantly use simple Convolutional Neural Network (CNN) structures as the backbone, leading to spatial information loss during training and complicating cross-modal feature alignment, we propose a novel approach using Swin-TransformerV2 as the backbone and staged feature mapping optimization learning for VI-ReID. Firstly, we introduce a new Ratio Center Difference Loss (RCD) to address the scattering of positive samples from different modalities in feature space, and we devise a Cross-modal Intra-class Denoising Loss (CID) which dynamically calculates the average distance between positive and negative samples to strengthen the differences between classes and adjust the feature space in different stages. Additionally, to accommodate the latest backbone models during the training phase, we design a Staged Modality-shared Loss Scheduler (SMS). Finally, our method introduces Channel Hybrid Filling Module (CHF), which enriches datasets and mitigates low-level modal discrepancies. After conducting numerous experiments on the SYSU-MM01 and RegDB datasets, it has been proven that our proposed method surpasses the current forefront methods in visible-infrared person re-identification.
Springer Science and Business Media LLC
Title: Staged Feature Mapping Optimization Learning for Visible-Infrared Person Re-identification
Description:
Abstract Visible-infrared person re-identification (VI-ReID) is a significant and intricate endeavor in specific person retrieval, requiring the fusion of distinct features observed in visible and infrared modalities.
To address the limitations of current methods, which predominantly use simple Convolutional Neural Network (CNN) structures as the backbone, leading to spatial information loss during training and complicating cross-modal feature alignment, we propose a novel approach using Swin-TransformerV2 as the backbone and staged feature mapping optimization learning for VI-ReID.
Firstly, we introduce a new Ratio Center Difference Loss (RCD) to address the scattering of positive samples from different modalities in feature space, and we devise a Cross-modal Intra-class Denoising Loss (CID) which dynamically calculates the average distance between positive and negative samples to strengthen the differences between classes and adjust the feature space in different stages.
Additionally, to accommodate the latest backbone models during the training phase, we design a Staged Modality-shared Loss Scheduler (SMS).
Finally, our method introduces Channel Hybrid Filling Module (CHF), which enriches datasets and mitigates low-level modal discrepancies.
After conducting numerous experiments on the SYSU-MM01 and RegDB datasets, it has been proven that our proposed method surpasses the current forefront methods in visible-infrared person re-identification.

Related Results

ML-RID: Mutual Learning Enhanced Visible-Infrared Person Re-identification
ML-RID: Mutual Learning Enhanced Visible-Infrared Person Re-identification
Abstract In computer vision, Person Re-Identification (ReID) is essential for surveillance, matching images of individuals across different cameras and lighting conditions....
Meta-HGNet: Meta-Heterogeneous Generalized Network for Visible Infrared Person Re-identification
Meta-HGNet: Meta-Heterogeneous Generalized Network for Visible Infrared Person Re-identification
Abstract Visible infrared person re-identification plays an important role in heterogeneous modality image person identity matching, however, significant modality differenc...
Solution-processed quantum dot infrared lasers
Solution-processed quantum dot infrared lasers
(English) Colloidal semiconductors quantum dots (CQDs) have emerged as a promising solutionprocessed gain material that can be engineered via low-cost and scalable chemical techniq...
Comparison of LA and PVC mapping using OCTARAY and OPTRELL catheters
Comparison of LA and PVC mapping using OCTARAY and OPTRELL catheters
AbstractBackgroundMultielectrode mapping catheters, such as the OCTARAY and OPTRELL, are essential in creating myocardial electroanatomical mapping in arrhythmias. The OCTARAY is a...
Contour- and Texture-based analysis for victim identification in forensic odontology
Contour- and Texture-based analysis for victim identification in forensic odontology
PurposeForensic dentistry is the application of dentistry in legal proceedings that arise from any facts relating to teeth. The ultimate goal of forensic odontology is to identify ...
Staged Versus Simultaneous Surgery for Adult Spinal Deformity
Staged Versus Simultaneous Surgery for Adult Spinal Deformity
Study Design. Systematic review and meta-analysis. Objective. To assess the safety and efficacy of staged versus same-day ...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...

Back to Top