Javascript must be enabled to continue!
Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
View through CrossRef
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes cross-modal retrieval problem become a hot issue in this years. Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data. However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity. This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial. CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation. Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity. Comprehensive experiments are conducted on three benchmark datasets. The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.
Title: Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
Description:
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet.
This trend makes cross-modal retrieval problem become a hot issue in this years.
Many existing works pay attentions on correlation learning to generate a common subspace for cross-modal correlation measurement, and others uses adversarial learning technique to abate the heterogeneity of multi-modal data.
However, very few works combine correlation learning and adversarial learning to bridge the inter-modal semantic gap and diminish cross-modal heterogeneity.
This paper propose a novel cross-modal retrieval method, named ALSCOR, which is an end-to-end framework to integrate cross-modal representation learning, correlation learning and adversarial.
CCA model, accompanied by two representation model, VisNet and TxtNet is proposed to capture non-linear correlation.
Beside, intra-modal classifier and modality classifier are used to learn intra-modal discrimination and minimize the inter-modal heterogeneity.
Comprehensive experiments are conducted on three benchmark datasets.
The results demonstrate that the proposed ALSCOR has better performance than the state-of-the-arts.
Related Results
Cross-modal Retrieval based on Shared Proxies
Cross-modal Retrieval based on Shared Proxies
Abstract
Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a com...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...
The Effect of Transcranial Random Noise Stimulation (tRNS) over Bilateral Parietal Cortex in Visual Cross-Modal Conflict
The Effect of Transcranial Random Noise Stimulation (tRNS) over Bilateral Parietal Cortex in Visual Cross-Modal Conflict
Background: Visual cross-modal conflict impairs performance in auditory working memory tasks, and the bilateral inferior parietal cortex (IPC) plays a pivotal role in inhibiting th...
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Approaches to Different Learning Styles in Undergraduate Medical Students of Al-Tibri Medical College Karachi
Objectives: The purpose of this study was to evaluate the different styles of learning preferred by undergraduate medical students from 1st to 5th year of Al-Tibri Medical College ...
Modal Sosial Masyarakat Dusun Melayang dalam Pemanfaatan Buah Tengkawang di Hutan Adat Pikul
Modal Sosial Masyarakat Dusun Melayang dalam Pemanfaatan Buah Tengkawang di Hutan Adat Pikul
AbstrakModal sosial adalah kemampuan masyarakat untuk bekerjasama demi mencapai suatu tujuan bersama didalam suatu kelompok. Hutan Adat Pikul memiliki potensi tengkawang yang sanga...
Exploiting Wikipedia Semantics for Computing Word Associations
Exploiting Wikipedia Semantics for Computing Word Associations
<p><b>Semantic association computation is the process of automatically quantifying the strength of a semantic connection between two textual units based on various lexi...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

