Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

View through CrossRef
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes cross-modal retrieval problem become a hot issue in this years. Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data. However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity. This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial. CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation. Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity. Comprehensive experiments are conducted on three benchmark datasets. The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.
Title: Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
Description:
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet.
This trend makes cross-modal retrieval problem become a hot issue in this years.
Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data.
However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity.
This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial.
CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation.
Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity.
Comprehensive experiments are conducted on three benchmark datasets.
The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.

Related Results

Cross-modal Retrieval based on Shared Proxies
Cross-modal Retrieval based on Shared Proxies
Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a com...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...
Improving Diversity and Quality of Adversarial Examples in Adversarial Transformation Network
Improving Diversity and Quality of Adversarial Examples in Adversarial Transformation Network
Abstract This paper proposes a method to mitigate two major issues of Adversarial Transformation Networks (ATN) including the low diversity and the low quality of adversari...
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...
Innovative Entity Semantic Retrieval and Reasoning Algorithm with Knowledge Graph Integration (KGI-ERR)
Innovative Entity Semantic Retrieval and Reasoning Algorithm with Knowledge Graph Integration (KGI-ERR)
The rapid growth of data in various domains has made entity semantic retrieval and reasoning increasingly crucial. Traditional retrieval methods often fail to capture the complex r...
The Effect of Transcranial Random Noise Stimulation (tRNS) over Bilateral Parietal Cortex in Visual Cross-Modal Conflict
The Effect of Transcranial Random Noise Stimulation (tRNS) over Bilateral Parietal Cortex in Visual Cross-Modal Conflict
Background: Visual cross-modal conflict impairs performance in auditory working memory tasks, and the bilateral inferior parietal cortex (IPC) plays a pivotal role in inhibiting th...
On determination of the part-of-speech affiliation of modal words
On determination of the part-of-speech affiliation of modal words
The subject of this research is the part-of-speech affiliation of modal words – a special group of lexemes that are irreplaceable, syncategorematic, and lack formal unity...
Adversarial Training and Robustness in Machine Learning Frameworks
Adversarial Training and Robustness in Machine Learning Frameworks
In the realm of machine learning, ensuring robustness against adversarial attacks is increasingly crucial. Adversarial training has emerged as a prominent strategy to fortify model...

Back to Top