Javascript must be enabled to continue!

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes cross-modal retrieval problem become a hot issue in this years. Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data. However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity. This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial. CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation. Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity. Comprehensive experiments are conducted on three benchmark datasets. The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

MDPI AG

Lei Zhu Jiayu Song Xiangxiang Wei Long Jun

2020

Title: Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

Description:

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet.

This trend makes cross-modal retrieval problem become a hot issue in this years.

Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data.

However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity.

This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial.

CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation.

Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity.

Comprehensive experiments are conducted on three benchmark datasets.

The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

Back

Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a com...

A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing

In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...

The nature of automatic semantic retrieval in individuals with mild cognitive impairment

The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...

Improving Diversity and Quality of Adversarial Examples in Adversarial Transformation Network

Abstract This paper proposes a method to mitigate two major issues of Adversarial Transformation Networks (ATN) including the low diversity and the low quality of adversari...

Innovative Entity Semantic Retrieval and Reasoning Algorithm with Knowledge Graph Integration (KGI-ERR)

The rapid growth of data in various domains has made entity semantic retrieval and reasoning increasingly crucial. Traditional retrieval methods often fail to capture the complex r...

The Effect of Transcranial Random Noise Stimulation (tRNS) over Bilateral Parietal Cortex in Visual Cross-Modal Conflict

Background: Visual cross-modal conflict impairs performance in auditory working memory tasks, and the bilateral inferior parietal cortex (IPC) plays a pivotal role in inhibiting th...

On determination of the part-of-speech affiliation of modal words

The subject of this research is the part-of-speech affiliation of modal words – a special group of lexemes that are irreplaceable, syncategorematic, and lack formal unity...

Adversarial Training and Robustness in Machine Learning Frameworks

In the realm of machine learning, ensuring robustness against adversarial attacks is increasingly crucial. Adversarial training has emerged as a prominent strategy to fortify model...

Email:
Password:

Email:

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

Related Results