Javascript must be enabled to continue!

Cross-modal Retrieval based on Shared Proxies

Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem. Learning a common space that is semantically discriminative and modality invariant is the main challenge in cross-modal retrieval. Existing solutions usually employ pairwise or triplet data relationships to learn the common space, which can only capture the data similarity locally but would be unable to effectively characterize the global geometry of the common embedding space, and thus would limit the performance of cross-modal retrieval. In this paper, we introduce a shared proxy solution to cross-modal retrieval. We propose to incorporate the principles of shared proxy with neighbourhood component analysis to learn a common space for different modalities in which the distance between a sample’s representation and its corresponding proxy is minimized while the distances between a sample’s representation and the proxies not belonging to the sample are maximized. We propose the Cross-mOdal proXy learnIng (COXI) framework which integrates a cross-modal shared proxy loss, a discriminative loss and a modality invariant loss for supervised cross-modal retrieval. Extensive experiments on benchmark datasets clearly shows that COXI outperforms state of the art cross-modal retrieval techniques. Code is available on https://github.com/LigangZheng/COXI.

Research Square Platform LLC

Yuxin Wei Ligang Zheng Guoping Qiu Guocan Cai

2023

Title: Cross-modal Retrieval based on Shared Proxies

Description:

Abstract Inconsistency of distribution and representation across different data modalities makes measuring cross-modal similarities a very difficult problem.

Learning a common space that is semantically discriminative and modality invariant is the main challenge in cross-modal retrieval.

Existing solutions usually employ pairwise or triplet data relationships to learn the common space, which can only capture the data similarity locally but would be unable to effectively characterize the global geometry of the common embedding space, and thus would limit the performance of cross-modal retrieval.

In this paper, we introduce a shared proxy solution to cross-modal retrieval.

We propose to incorporate the principles of shared proxy with neighbourhood component analysis to learn a common space for different modalities in which the distance between a sample’s representation and its corresponding proxy is minimized while the distances between a sample’s representation and the proxies not belonging to the sample are maximized.

We propose the Cross-mOdal proXy learnIng (COXI) framework which integrates a cross-modal shared proxy loss, a discriminative loss and a modality invariant loss for supervised cross-modal retrieval.

Extensive experiments on benchmark datasets clearly shows that COXI outperforms state of the art cross-modal retrieval techniques.

Code is available on https://github.

com/LigangZheng/COXI.

Back

Modal kerja merupakan suatu kekayaan yang digunakan untuk membelanjai perusahaan sehari-hari. Modal kerja biasanya berbentuk uang kas, piutang, persediaan barang yang kesemuanya it...

Kontribusi Modal Sosial dalam Mengefektifkan Modal Lingkungan (Kasus Komunitas Kampung Nelayan Untia Makassar)

AbstractThe Untia fishing village community was formed from the relocation of the residents of Laelae Island in 1998. The community that was built from the results of relocation ha...

Sum things are not what they seem: Problems with the interpretation and analysis of radiocarbon-date proxies

Radiocarbon-date proxies are widely used in studies exploring long-term variation in human and environmental phenomena. Examined phenomena include, for example, variation in past h...

Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel

Abstract A deepwater field in West Africa was decommissioned and subsea facilities retrieval operation was carried out as part of the Abandonment and Decommissioning...

Adversarial Learning Based Semantic Correlation Representation for Cross-Modal Retrieval

With the rapid development of Internet and the widely usage of smart devices, massive multimedia data are generated, collected, stored and shared on the Internet. This trend makes ...

The influence of timing of oocytes retrieval and embryo transfer on the IVF-ET outcomes in patients having bilateral salpingectomy due to bilateral hydrosalpinx

ObjectiveThe objective of the study was to investigate whether the sequence of oocyte retrieval and salpingectomy for hydrosalpinx affects pregnancy outcomes of in vitro fertilizat...

EXPLORING THE USE OF MODAL AUXILIARY VERBS IN CORPUS OF CONTEMPORARY OF AMERISCAN ENGLISH (COCA)

Abstract. This paper deals with the frequent use of modal auxiliary verbs in Corpus of Contemporary of American English (COCA). The modal auxiliary modal verbs mentioned as the dat...

Arus Modal dan Integrasi Pasar Modal Internasional

Telaah literatur ini bertujuan untuk memberikan pemahaman tentang Pasar Modal Internasional, arus modal dan intergari pasar modal internasional. Semakin terbukanya perekonomian dun...

Email:
Password:

Email:

Cross-modal Retrieval based on Shared Proxies

Related Results