Javascript must be enabled to continue!

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes cross-modal retrieval problem become a hot issue in this years. Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data. However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity. This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial. CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation. Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity. Comprehensive experiments are conducted on three benchmark datasets. The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

MDPI AG

Lei Zhu Jiayu Song Xiangxiang Wei Long Jun

2020

Title: Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

Description:

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet.

This trend makes cross-modal retrieval problem become a hot issue in this years.

Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data.

However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity.

This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial.

CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation.

Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity.

Comprehensive experiments are conducted on three benchmark datasets.

The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

Back

Modal kerja merupakan suatu kekayaan yang digunakan untuk membelanjai perusahaan sehari-hari. Modal kerja biasanya berbentuk uang kas, piutang, persediaan barang yang kesemuanya it...

ProDef-MDS: A Proactive Defense Mechanism Protecting Malware Detection Systems from Adversarial Attacks

Malware threatens cybersecurity by enabling data theft, unauthorized access, and extortion. Traditional malware detection systems (MDS) struggle with the increasing volume and comp...

A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing

In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...

Cross-modal Retrieval based on Shared Proxies

Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a com...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

A Review of Video Text Retrieval Research

Video text retrieval is a hot research topic in artificial intelligence, with the core challenge being the semantic gap between visual dynamic features and discrete linguistic symb...

Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)

AbstractThe Untia fishing village community was formed from the relocation of the residents of Laelae Island in 1998. The community that was built from the results of relocation ha...

Efficient Defense Against First Order Adversarial Attacks on Convolutional Neural Networks

Machine learning models, especially neural networks, are vulnerable to adversarial attacks, where inputs are purposefully altered to induce incorrect predictions. These adversarial...

Email:
Password:

Email:

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

Related Results