Javascript must be enabled to continue!
Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
View through CrossRef
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
Description:
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3).
Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.
To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.
Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.
The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.
The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.
In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Related Results
Capacity Analysis of Vehicle Parking Area in "Terminal Petikemas Surabaya"
Capacity Analysis of Vehicle Parking Area in "Terminal Petikemas Surabaya"
Parking is one element of the means that cannot be separated from the overall road transportation system. Parking is a problem that is often found in urban transportation systems b...
KARAKTERISTIK DAN BESARAN KEBUTUHAN RUANG PARKIR PENGEMBANGAN TOKO SEMERU DI MAKASSAR
KARAKTERISTIK DAN BESARAN KEBUTUHAN RUANG PARKIR PENGEMBANGAN TOKO SEMERU DI MAKASSAR
Abstract
The problem of parking in Makassar City is something that needs attention, because some commercial areas provide parking spaces that are not suitable for the capacity ...
Heterogeneous Parking Market Subject to Parking Rationing
Heterogeneous Parking Market Subject to Parking Rationing
Different types of drivers and parking spaces delineate a heterogeneous parking market for which the literature has yet to provide a model applicable to the real world. The main ob...
Legal Protection For Consumers Of Parking Services Indonesia In Kabanjahe
Legal Protection For Consumers Of Parking Services Indonesia In Kabanjahe
Conducting this research aims to find out what the legal relationship is between parking service users and parking service managers and what are the civil responsibilities of parki...
Control via Reinforcement Learning : Controle via Aprendizado por Reforço
Control via Reinforcement Learning : Controle via Aprendizado por Reforço
This work presents a comprehensive review and practical application of Reinforcement Learning (RL) algorithms in control engineering. The theoretical groundwork of RL is laid out, ...
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...
Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
MOTORCYCLE PARKING CAPACITY ANALYSIS AROUND UNIKOM CAMPUS, JALAN DIPATIUKUR, BANDUNG CITY
MOTORCYCLE PARKING CAPACITY ANALYSIS AROUND UNIKOM CAMPUS, JALAN DIPATIUKUR, BANDUNG CITY
The level of customer satisfaction with the quality of parking services is viewed from five aspects, namely tangible, reliability, responsiveness, assurance, and empathy. The surve...

