Javascript must be enabled to continue!
A Semi Centralized Training Decentralized Execution Architecture for Multi-Agent Deep Reinforcement Learning in Traffic Signal Control
View through CrossRef
Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic signal control (ATSC) of multiple intersections. Existing approaches typically follow either a fully centralized or a fully decentralized design. Fully centralized approaches suffer from the curse of dimensionality, and reliance on a single learning server, whereas purely decentralized approaches operate under severe partial observability and lack explicit coordination resulting in suboptimal performance. These limitations motivate region-based MARL, where the network is partitioned into smaller, tightly Interdependent intersections that form regions, and training is organized around these regions. This paper introduces a Semi-Centralized Training, Decentralized Execution (SEMI-CTDE) architecture for multi intersection ATSC. Within each region, SEMI-CTDE performs centralized training with regional parameter sharing and employs composite state and reward formulations that jointly encode local and regional information. The architecture is highly transferable across different policy backbones and state–reward instantiations. Building on this architecture, we implement two models with distinct design objectives. A multi-perspective experimental analysis of the two implemented SEMI-CTDE-based models covering ablations of the architecture's core elements including rule based and fully decentralized baselines shows that they achieve consistently superior performance and remain effective across a wide range of traffic densities and distributions.
Title: A Semi Centralized Training Decentralized Execution Architecture for Multi-Agent Deep Reinforcement Learning in Traffic Signal Control
Description:
Multi-agent reinforcement learning (MARL) has emerged as a promising paradigm for adaptive traffic signal control (ATSC) of multiple intersections.
Existing approaches typically follow either a fully centralized or a fully decentralized design.
Fully centralized approaches suffer from the curse of dimensionality, and reliance on a single learning server, whereas purely decentralized approaches operate under severe partial observability and lack explicit coordination resulting in suboptimal performance.
These limitations motivate region-based MARL, where the network is partitioned into smaller, tightly Interdependent intersections that form regions, and training is organized around these regions.
This paper introduces a Semi-Centralized Training, Decentralized Execution (SEMI-CTDE) architecture for multi intersection ATSC.
Within each region, SEMI-CTDE performs centralized training with regional parameter sharing and employs composite state and reward formulations that jointly encode local and regional information.
The architecture is highly transferable across different policy backbones and state–reward instantiations.
Building on this architecture, we implement two models with distinct design objectives.
A multi-perspective experimental analysis of the two implemented SEMI-CTDE-based models covering ablations of the architecture's core elements including rule based and fully decentralized baselines shows that they achieve consistently superior performance and remain effective across a wide range of traffic densities and distributions.
Related Results
TYPES OF AI ALGORİTHMS USED İN TRAFFİC FLOW PREDİCTİON
TYPES OF AI ALGORİTHMS USED İN TRAFFİC FLOW PREDİCTİON
The increasing complexity of urban transportation systems and the growing volume of vehicles have made traffic congestion a persistent challenge in modern cities. Efficient traffic...
A Deep Reinforcement Learning-Based Method for Signal Duration Control at Intersections with Asymmetric Traffic Flows
A Deep Reinforcement Learning-Based Method for Signal Duration Control at Intersections with Asymmetric Traffic Flows
At the intersection with asymmetric traffic flow, a single neural network or other control methods cannot make a choice in time to ensure that the intersection with a large traffic...
Smart Traffic Control Using Computer Vision
Smart Traffic Control Using Computer Vision
A Smart Traffic Control System using Computer Vision utilizes cameras, image processing techniques, and machine learning algorithms to monitor, analyze, and manage traffic flow aut...
Traffic Prediction in 5G Networks Using Machine Learning
Traffic Prediction in 5G Networks Using Machine Learning
The advent of 5G technology promises a paradigm shift in the realm of
telecommunications, offering unprecedented speeds and connectivity. However, the
...
The architecture of differences
The architecture of differences
Following in the footsteps of the protagonists of the Italian architectural debate is a mark of culture and proactivity. The synthesis deriving from the artistic-humanistic factors...
MODELİNG OF TRAFFİC LİGHT CONTROL SYSTEMS
MODELİNG OF TRAFFİC LİGHT CONTROL SYSTEMS
Traffic light control systems are commonly utilized to monitor and manage the flow of autos across multiple road intersections. Since traffic jams are ubiquitous in daily life, A c...
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
STRENGTH OF BUTT WELDED BUTT JOINT OF REINFORCEMENT OF CLASS A500C
The paper presents the results of experimental studies of the strength of cross-shaped welded joints of types К1-Кт and К3-Рр [1] of thermomechanically hardened reinforcement of cl...
Experimental Study of Multi-Agent Reinforcement Learning Approaches for the Coverage Path Planning Problem
Experimental Study of Multi-Agent Reinforcement Learning Approaches for the Coverage Path Planning Problem
Purpose. The coverage path planning problem for groups of autonomous agents over a given target area is relevant to a wide range of applied systems. An increase in the dimensionali...

