Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Unsupervised Method Based on Unpaired Multimodality Data for Heterogeneous Face Recognition

View through CrossRef
Abstract Current deep learning methods for heterogeneous face recognition (HFR) rely on pairwise multimodal image data for training, but such data are difficult to collect. In this paper, we propose an unsupervised deep learning method based on unpaired multimodal image data. This method employs a variational autoencoder (VAE) and a discriminator from a generative adversarial network (GAN) to disentangle the given heterogeneous image data into domain-independent semantic features and domain-dependent style features. Specifically, the VAE utilizes its latent space to disentangle features and encode explicitly domain-independent semantic features that are used to match face images from different modalities. The discriminator is used to discriminate the domains of images generated by the VAE, which can improve the domain recognition ability of the VAE. Moreover, multiple-scale feature aggregation is incorporated into the encoder part of the VAE to make the domain-independent semantic features contain multiple-scale construction information. Experimental results obtained on three widely used face datasets are presented to demonstrate the effectiveness of the proposed method. Our code will be available on GitHub.
Springer Science and Business Media LLC
Title: An Unsupervised Method Based on Unpaired Multimodality Data for Heterogeneous Face Recognition
Description:
Abstract Current deep learning methods for heterogeneous face recognition (HFR) rely on pairwise multimodal image data for training, but such data are difficult to collect.
In this paper, we propose an unsupervised deep learning method based on unpaired multimodal image data.
This method employs a variational autoencoder (VAE) and a discriminator from a generative adversarial network (GAN) to disentangle the given heterogeneous image data into domain-independent semantic features and domain-dependent style features.
Specifically, the VAE utilizes its latent space to disentangle features and encode explicitly domain-independent semantic features that are used to match face images from different modalities.
The discriminator is used to discriminate the domains of images generated by the VAE, which can improve the domain recognition ability of the VAE.
Moreover, multiple-scale feature aggregation is incorporated into the encoder part of the VAE to make the domain-independent semantic features contain multiple-scale construction information.
Experimental results obtained on three widely used face datasets are presented to demonstrate the effectiveness of the proposed method.
Our code will be available on GitHub.

Related Results

Binocular Displacement of Unpaired Region
Binocular Displacement of Unpaired Region
Binocular displacement of binocularly unpaired parts of the stimulus was examined by means of the Poggendorff figure. The Poggendorff figure can be used to investigate displacement...
Video Indexing through Human Faces by Combined Deep Learning Neural Networks
Video Indexing through Human Faces by Combined Deep Learning Neural Networks
This research aims to suggest an algorithm that uses the human face as a cue for detecting faces and recognition from input video. Face recognition has become popular because it ha...
3D Face Factorisation for Face Recognition Using Pattern Recognition Algorithms
3D Face Factorisation for Face Recognition Using Pattern Recognition Algorithms
Abstract The face is the preferable biometrics for person recognition or identification applications because person identifying by face is a human connate habit. In ...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Face recognition methods analysis
Face recognition methods analysis
Face Recognition is one of the most important issues in Image processing tasks. It is important because it uses for various purposes in real world such as Criminal detection or for...
DLUT: Decoupled Learning-Based Unsupervised Tracker
DLUT: Decoupled Learning-Based Unsupervised Tracker
Unsupervised learning has shown immense potential in object tracking, where accurate classification and regression are crucial for unsupervised trackers. However, the classificatio...
Identifying Links Between Latent Memory and Speech Recognition Factors
Identifying Links Between Latent Memory and Speech Recognition Factors
Objectives: The link between memory ability and speech recognition accuracy is often examined by correlating summary measures of performance across various tasks, but i...
A face recognition algorithm based on the combine of image feature compensation and improved PSO
A face recognition algorithm based on the combine of image feature compensation and improved PSO
AbstractFace recognition systems have been widely applied in various scenarios in people's daily lives. The recognition rate and speed of face recognition systems have always been ...

Back to Top