Javascript must be enabled to continue!

Emergent communication of multimodal deep generative models based on Metropolis-Hastings naming game

Deep generative models (DGM) are increasingly employed in emergent communication systems. However, their application in multimodal data contexts is limited. This study proposes a novel model that combines multimodal DGM with the Metropolis-Hastings (MH) naming game, enabling two agents to focus jointly on a shared subject and develop common vocabularies. The model proves that it can handle multimodal data, even in cases of missing modalities. Integrating the MH naming game with multimodal variational autoencoders (VAE) allows agents to form perceptual categories and exchange signs within multimodal contexts. Moreover, fine-tuning the weight ratio to favor a modality that the model could learn and categorize more readily improved communication. Our evaluation of three multimodal approaches - mixture-of-experts (MoE), product-of-experts (PoE), and mixture-of-product-of-experts (MoPoE)–suggests an impact on the creation of latent spaces, the internal representations of agents. Our results from experiments with the MNIST + SVHN and Multimodal165 datasets indicate that combining the Gaussian mixture model (GMM), PoE multimodal VAE, and MH naming game substantially improved information sharing, knowledge formation, and data reconstruction.

Frontiers Media SA

Nguyen Le Hoang Tadahiro Taniguchi Yoshinobu Hagiwara Akira Taniguchi

Frontiers in Robotics and AI

2024

Title: Emergent communication of multimodal deep generative models based on Metropolis-Hastings naming game

Description:

Deep generative models (DGM) are increasingly employed in emergent communication systems.

However, their application in multimodal data contexts is limited.

This study proposes a novel model that combines multimodal DGM with the Metropolis-Hastings (MH) naming game, enabling two agents to focus jointly on a shared subject and develop common vocabularies.

The model proves that it can handle multimodal data, even in cases of missing modalities.

Integrating the MH naming game with multimodal variational autoencoders (VAE) allows agents to form perceptual categories and exchange signs within multimodal contexts.

Moreover, fine-tuning the weight ratio to favor a modality that the model could learn and categorize more readily improved communication.

Our evaluation of three multimodal approaches - mixture-of-experts (MoE), product-of-experts (PoE), and mixture-of-product-of-experts (MoPoE)–suggests an impact on the creation of latent spaces, the internal representations of agents.

Our results from experiments with the MNIST + SVHN and Multimodal165 datasets indicate that combining the Gaussian mixture model (GMM), PoE multimodal VAE, and MH naming game substantially improved information sharing, knowledge formation, and data reconstruction.

Back

Die öffentliche Schule Quest to learn in New York City ist eine Modell-Schule, die in ihren Lehrmethoden auf spielbasiertes Lernen, Game Design und den Game Design Prozess setzt. I...

Imagined worldviews in John Lennon’s “Imagine”: a multimodal re-performance / Visões de mundo imaginadas no “Imagine” de John Lennon: uma re-performance multimodal

Abstract: This paper addresses the issue of multimodal re-performance, a concept developed by us, in view of the fact that the famous song “Imagine”, by John Lennon, was published ...

Metropolis-Hastings algorithm in joint-attention naming game: experimental semiotics study

We explore the emergence of symbols during interactions between individuals through an experimental semiotic study. Previous studies have investigated how humans organize symbol sy...

Game Theory in Business Ethics: Bad Ideology or Bad Press?

Solomon’s article and Binmore’s response exemplify a standard exchange between the game theorist and those critical of applying game theory to ethics. The critic of game theory lis...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Abstract TP142: Cerebellar Transcranial Direct Stimulation to Augment Aphasia Therapy

Introduction: Previous studies indicate that anodal transcranial Direct Current Stimulation (A-tDCS) to left hemisphere or cathodal tDCS (C-tDCS) to right hemisphere mi...

AFR-BERT: Attention-based mechanism feature relevance fusion multimodal sentiment analysis model

Multimodal sentiment analysis is an essential task in natural language processing which refers to the fact that machines can analyze and recognize emotions through logical reasonin...

Konsep Perilaku Keputusan Pembelian Game Online

E-Sport sports have been recognized as sports since 2020. The rise of technological developments makes game applications more very diverse and competitive. Data shows that the numb...

Email:
Password:

Email:

Emergent communication of multimodal deep generative models based on Metropolis-Hastings naming game

Related Results