Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Emergent communication of multimodal deep generative models based on Metropolis-Hastings naming game

View through CrossRef
Deep generative models (DGM) are increasingly employed in emergent communication systems. However, their application in multimodal data contexts is limited. This study proposes a novel model that combines multimodal DGM with the Metropolis-Hastings (MH) naming game, enabling two agents to focus jointly on a shared subject and develop common vocabularies. The model proves that it can handle multimodal data, even in cases of missing modalities. Integrating the MH naming game with multimodal variational autoencoders (VAE) allows agents to form perceptual categories and exchange signs within multimodal contexts. Moreover, fine-tuning the weight ratio to favor a modality that the model could learn and categorize more readily improved communication. Our evaluation of three multimodal approaches - mixture-of-experts (MoE), product-of-experts (PoE), and mixture-of-product-of-experts (MoPoE)–suggests an impact on the creation of latent spaces, the internal representations of agents. Our results from experiments with the MNIST + SVHN and Multimodal165 datasets indicate that combining the Gaussian mixture model (GMM), PoE multimodal VAE, and MH naming game substantially improved information sharing, knowledge formation, and data reconstruction.
Title: Emergent communication of multimodal deep generative models based on Metropolis-Hastings naming game
Description:
Deep generative models (DGM) are increasingly employed in emergent communication systems.
However, their application in multimodal data contexts is limited.
This study proposes a novel model that combines multimodal DGM with the Metropolis-Hastings (MH) naming game, enabling two agents to focus jointly on a shared subject and develop common vocabularies.
The model proves that it can handle multimodal data, even in cases of missing modalities.
Integrating the MH naming game with multimodal variational autoencoders (VAE) allows agents to form perceptual categories and exchange signs within multimodal contexts.
Moreover, fine-tuning the weight ratio to favor a modality that the model could learn and categorize more readily improved communication.
Our evaluation of three multimodal approaches - mixture-of-experts (MoE), product-of-experts (PoE), and mixture-of-product-of-experts (MoPoE)–suggests an impact on the creation of latent spaces, the internal representations of agents.
Our results from experiments with the MNIST + SVHN and Multimodal165 datasets indicate that combining the Gaussian mixture model (GMM), PoE multimodal VAE, and MH naming game substantially improved information sharing, knowledge formation, and data reconstruction.

Related Results

Schule und Spiel – mehr als reine Wissensvermittlung
Schule und Spiel – mehr als reine Wissensvermittlung
Die öffentliche Schule Quest to learn in New York City ist eine Modell-Schule, die in ihren Lehrmethoden auf spielbasiertes Lernen, Game Design und den Game Design Prozess setzt. I...
Metropolis-Hastings algorithm in joint-attention naming game: experimental semiotics study
Metropolis-Hastings algorithm in joint-attention naming game: experimental semiotics study
We explore the emergence of symbols during interactions between individuals through an experimental semiotic study. Previous studies have investigated how humans organize symbol sy...
Game Theory in Business Ethics: Bad Ideology or Bad Press?
Game Theory in Business Ethics: Bad Ideology or Bad Press?
Solomon’s article and Binmore’s response exemplify a standard exchange between the game theorist and those critical of applying game theory to ethics. The critic of game theory lis...
Abstract TP142: Cerebellar Transcranial Direct Stimulation to Augment Aphasia Therapy
Abstract TP142: Cerebellar Transcranial Direct Stimulation to Augment Aphasia Therapy
Introduction: Previous studies indicate that anodal transcranial Direct Current Stimulation (A-tDCS) to left hemisphere or cathodal tDCS (C-tDCS) to right hemisphere mi...
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model
Multimodal sentiment analysis is an essential task in natural language processing which refers to the fact that machines can analyze and recognize emotions through logical reasonin...
“Un estudio multimodal y dinámico de los conocimientos numéricos de estudiantes de primer grado”
“Un estudio multimodal y dinámico de los conocimientos numéricos de estudiantes de primer grado”
En esta tesis profundizamos el estudio de la cognición y comunicación numérica de niños y niñas de primeros grados de la escuela primaria en la zona andina rionegrina. Desde un enf...
Konsep Perilaku Keputusan Pembelian Game Online
Konsep Perilaku Keputusan Pembelian Game Online
E-Sport sports have been recognized as sports since 2020. The rise of technological developments makes game applications more very diverse and competitive. Data shows that the numb...

Back to Top