Javascript must be enabled to continue!

An Unsupervised Method Based on Unpaired Multimodality Data for Heterogeneous Face Recognition

Abstract Current deep learning methods for heterogeneous face recognition (HFR) rely on pairwise multimodal image data for training, but such data are difficult to collect. In this paper, we propose an unsupervised deep learning method based on unpaired multimodal image data. This method employs a variational autoencoder (VAE) and a discriminator from a generative adversarial network (GAN) to disentangle the given heterogeneous image data into domain-independent semantic features and domain-dependent style features. Specifically, the VAE utilizes its latent space to disentangle features and encode explicitly domain-independent semantic features that are used to match face images from different modalities. The discriminator is used to discriminate the domains of images generated by the VAE, which can improve the domain recognition ability of the VAE. Moreover, multiple-scale feature aggregation is incorporated into the encoder part of the VAE to make the domain-independent semantic features contain multiple-scale construction information. Experimental results obtained on three widely used face datasets are presented to demonstrate the effectiveness of the proposed method. Our code will be available on GitHub.

Springer Science and Business Media LLC

Tang Mingxin chen wei

2022

Title: An Unsupervised Method Based on Unpaired Multimodality Data for Heterogeneous Face Recognition

Description:

Abstract Current deep learning methods for heterogeneous face recognition (HFR) rely on pairwise multimodal image data for training, but such data are difficult to collect.

In this paper, we propose an unsupervised deep learning method based on unpaired multimodal image data.

This method employs a variational autoencoder (VAE) and a discriminator from a generative adversarial network (GAN) to disentangle the given heterogeneous image data into domain-independent semantic features and domain-dependent style features.

Specifically, the VAE utilizes its latent space to disentangle features and encode explicitly domain-independent semantic features that are used to match face images from different modalities.

The discriminator is used to discriminate the domains of images generated by the VAE, which can improve the domain recognition ability of the VAE.

Moreover, multiple-scale feature aggregation is incorporated into the encoder part of the VAE to make the domain-independent semantic features contain multiple-scale construction information.

Experimental results obtained on three widely used face datasets are presented to demonstrate the effectiveness of the proposed method.

Our code will be available on GitHub.

Back

Binocular displacement of binocularly unpaired parts of the stimulus was examined by means of the Poggendorff figure. The Poggendorff figure can be used to investigate displacement...

Video Indexing through Human Faces by Combined Deep Learning Neural Networks

This research aims to suggest an algorithm that uses the human face as a cue for detecting faces and recognition from input video. Face recognition has become popular because it ha...

3D Face Factorisation for Face Recognition Using Pattern Recognition Algorithms

Abstract The face is the preferable biometrics for person recognition or identification applications because person identifying by face is a human connate habit. In ...

De gevel – een intermediair element tussen buiten en binnen

This study is based on the fact that all people have a basic need for protection from other people (and animals) as well as from the elements (the exterior climate). People need a ...

Depth-aware salient object segmentation

Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...

Face recognition methods analysis

Face Recognition is one of the most important issues in Image processing tasks. It is important because it uses for various purposes in real world such as Criminal detection or for...

DLUT: Decoupled Learning-Based Unsupervised Tracker

Unsupervised learning has shown immense potential in object tracking, where accurate classification and regression are crucial for unsupervised trackers. However, the classificatio...

Identifying Links Between Latent Memory and Speech Recognition Factors

Objectives: The link between memory ability and speech recognition accuracy is often examined by correlating summary measures of performance across various tasks, but i...

Email:
Password:

Email:

An Unsupervised Method Based on Unpaired Multimodality Data for Heterogeneous Face Recognition

Related Results