Javascript must be enabled to continue!
Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
View through CrossRef
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
Description:
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3).
Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.
To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.
Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.
The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.
The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.
In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Related Results
Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...
Design of double intelligent parking garage based on HuiYu technology
Design of double intelligent parking garage based on HuiYu technology
Abstract
In view of the ubiquitous parking problem in the city, a double-layer intelligent parking garage that saves space, is smart and labor-saving. On the basis o...
Application of android-based parking violations reporting system to support green campus program
Application of android-based parking violations reporting system to support green campus program
The use of private vehicles by the academic community of Sebelas Maret University (UNS) is increasing every year. This has resulted in a reduced availability of vehicle parking spa...
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Parking is an activity to stop or store vehicles in a place that has been provided in public places that have a parking system. The parking system is a system that is applied to a ...
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Abstrak: Pasar sangatlah identik dengan pusat keramaian, karena pasar merupakan pusat perdagangan yang terletak di pusat kota yang sering disebut juga dengan kawasan Central Bussin...
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
The Damar road section is a connecting lane for the Pasar Raya Padang area which starting to face parking problem. The indicator is the disruption of the flow and the high accumula...
Development and Implementation of a Smart Parking Spot Allocation SystemBased on the User’s Category and Priority using Verilog HDL
Development and Implementation of a Smart Parking Spot Allocation SystemBased on the User’s Category and Priority using Verilog HDL
Finding parking spotsfor automobiles is a major issue in many large and congested cities. Usually, drivers lose time searching for parking spots, especially during pea...

