Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Cross-modal Retrieval based on Shared Proxies

View through CrossRef
Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a common space that is semantically discriminative and modality invariant is the main challenge in cross-modal retrieval. Existing solutions usually employ pairwise or triplet data relationships to learn the common space, which can only capture the data similarity locally but would be unable to effectively characterize the global geometry of the common embedding space, and thus would limit the performance of cross-modal retrieval. In this paper, we introduce a shared proxy solution to cross-modal retrieval. We propose to incorporate the principles of shared proxy with neighbourhood component analysis to learn a common space for different modalities in which the distance between a sample’s representation and its corresponding proxy is minimized while the distances between a sample’s representation and the proxies not belonging to the sample are maximized. We propose the Cross-mOdal proXy learnIng (COXI) framework which integrates a cross-modal shared proxy loss, a discriminative loss and a modality invariant loss for supervised cross-modal retrieval. Extensive experiments on benchmark datasets clearly shows that COXI outperforms state of the art cross-modal retrieval techniques. Code is available on https://github.com/LigangZheng/COXI.
Title: Cross-modal Retrieval based on Shared Proxies
Description:
Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem.
Learning a common space that is semantically discriminative and modality invariant is the main challenge in cross-modal retrieval.
Existing solutions usually employ pairwise or triplet data relationships to learn the common space, which can only capture the data similarity locally but would be unable to effectively characterize the global geometry of the common embedding space, and thus would limit the performance of cross-modal retrieval.
In this paper, we introduce a shared proxy solution to cross-modal retrieval.
We propose to incorporate the principles of shared proxy with neighbourhood component analysis to learn a common space for different modalities in which the distance between a sample’s representation and its corresponding proxy is minimized while the distances between a sample’s representation and the proxies not belonging to the sample are maximized.
We propose the Cross-mOdal proXy learnIng (COXI) framework which integrates a cross-modal shared proxy loss, a discriminative loss and a modality invariant loss for supervised cross-modal retrieval.
Extensive experiments on benchmark datasets clearly shows that COXI outperforms state of the art cross-modal retrieval techniques.
Code is available on https://github.
com/LigangZheng/COXI.

Related Results

ANALISIS MODAL KERJA PADA KOPERASI SERBA USAHA DI KOTA METRO
ANALISIS MODAL KERJA PADA KOPERASI SERBA USAHA DI KOTA METRO
Modal kerja merupakan suatu kekayaan yang digunakan untuk membelanjai perusahaan sehari-hari. Modal kerja biasanya berbentuk uang kas, piutang, persediaan barang yang kesemuanya it...
Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)
Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)
AbstractThe Untia fishing village community was formed from the relocation of the residents of Laelae Island in 1998. The community that was built from the results of relocation ha...
Sum things are not what they seem: Problems with the interpretation and analysis of radiocarbon-date proxies
Sum things are not what they seem: Problems with the interpretation and analysis of radiocarbon-date proxies
Radiocarbon-date proxies are widely used in studies exploring long-term variation in human and environmental phenomena. Examined phenomena include, for example, variation in past h...
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Abstract A deepwater field in West Africa was decommissioned and subsea facilities retrieval operation was carried out as part of the Abandonment and Decommissioning...
Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval
With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes ...
EXPLORING THE USE OF MODAL AUXILIARY VERBS IN CORPUS OF CONTEMPORARY OF AMERISCAN ENGLISH (COCA)
EXPLORING THE USE OF MODAL AUXILIARY VERBS IN CORPUS OF CONTEMPORARY OF AMERISCAN ENGLISH (COCA)
Abstract. This paper deals with the frequent use of modal auxiliary verbs in Corpus of Contemporary of American English (COCA). The modal auxiliary modal verbs mentioned as the dat...
Arus Modal dan Integrasi Pasar Modal Internasional
Arus Modal dan Integrasi Pasar Modal Internasional
Telaah literatur ini bertujuan untuk memberikan pemahaman tentang Pasar Modal Internasional, arus modal dan intergari pasar modal internasional. Semakin terbukanya perekonomian dun...

Back to Top