Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Consistent Epistemic Planning for Multiagent Deep Reinforcement Learning

View through CrossRef
Abstract Multi-agent cooperation needs to reason about beliefs in the partially observable environment without communication, but the traditional Multi-agent Deep Reinforcement Learning (MADRL) algorithm struggles to handle the uncertainty of agents. Multi-agent Epistemic planning (MEP) tries to let the agent find a best plan to complete the cooperation task, so as to more effectively solve the uncertainty. However, inconsistent planning arises if the MADRL only adds MEP. We propose a MADRL-based policy network architecture called SMM-MEPP: Shared Mental Model - Multi-agent Epistemic Planning Policy. Firstly, Multi-agent Epistemic Planning and MADRL are investigated to build the "Perception-Planning-Action" multi-agent epistemic planning framework. Then, mental model in psychology is introduced and descript as a neural network. Thirdly, parameter sharing mechanism is utilized to achieve the shared mental model and maintain the consistency of epistemic planning. Finally, we apply the SMM-MEPP architecture to three advanced MADRL algorithms (i.e., MAAC, MADDPG and MAPPO) and conduct comparative experiments in multi-agent cooperation tasks. Experiments show that the proposed method can bring consistent planning for multiple agents, and improves convergence speed or training effect in partially observable environment without communication.
Title: Consistent Epistemic Planning for Multiagent Deep Reinforcement Learning
Description:
Abstract Multi-agent cooperation needs to reason about beliefs in the partially observable environment without communication, but the traditional Multi-agent Deep Reinforcement Learning (MADRL) algorithm struggles to handle the uncertainty of agents.
Multi-agent Epistemic planning (MEP) tries to let the agent find a best plan to complete the cooperation task, so as to more effectively solve the uncertainty.
However, inconsistent planning arises if the MADRL only adds MEP.
We propose a MADRL-based policy network architecture called SMM-MEPP: Shared Mental Model - Multi-agent Epistemic Planning Policy.
Firstly, Multi-agent Epistemic Planning and MADRL are investigated to build the "Perception-Planning-Action" multi-agent epistemic planning framework.
Then, mental model in psychology is introduced and descript as a neural network.
Thirdly, parameter sharing mechanism is utilized to achieve the shared mental model and maintain the consistency of epistemic planning.
Finally, we apply the SMM-MEPP architecture to three advanced MADRL algorithms (i.
e.
, MAAC, MADDPG and MAPPO) and conduct comparative experiments in multi-agent cooperation tasks.
Experiments show that the proposed method can bring consistent planning for multiple agents, and improves convergence speed or training effect in partially observable environment without communication.

Related Results

Epistemic Injustice
Epistemic Injustice
The concept of epistemic injustice refers to the injustice that an individual suffers specifically in their capacity as a knower or epistemic agent – that is, as someone who produc...
An epistemic justice account of students’ experiences of feedback
An epistemic justice account of students’ experiences of feedback
I am a storyteller. I believe in the power of stories to share experiences and to elucidate thoughts and ideas and to help us to make sense of complex social practices. This thesis...
College Students’ Epistemic Cognition, Epistemic Emotion, and Engagement: A Mediation Analysis
College Students’ Epistemic Cognition, Epistemic Emotion, and Engagement: A Mediation Analysis
Abstract Background: The college students' engagement has attracted the attention of scholars from various countries because it can impact student’s learning performance, ...
Epistemic Injustice or Epistemic Oppression?
Epistemic Injustice or Epistemic Oppression?
The concepts of epistemic injustice and epistemic oppression both aim to track obstacles to epistemic agencyーi.e., forms of epistemic exclusionーthat are undue and persistent. Indee...
Epistemic Diversity and Deliberation
Epistemic Diversity and Deliberation
We live in uncertain times. In the midst of polarization, the rise of fake news and disinformation and with expert knowledge and scientific argumentation losing credibility in the ...
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Temas Epistêmicos, não Epistêmicos no Ensino
Temas Epistêmicos, não Epistêmicos no Ensino
Resumo A Epistemologia da Ciência é um campo de estudo que permite analisar o desenvolvimento da ciência em uma postura dialética, que qualifica as questões internas à Ciência, rel...

Back to Top