Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

View through CrossRef
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
Description:
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3).
Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.
To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.
Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.
The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.
The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.
In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Related Results

Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...
Design of double intelligent parking garage based on HuiYu technology
Design of double intelligent parking garage based on HuiYu technology
Abstract In view of the ubiquitous parking problem in the city, a double-layer intelligent parking garage that saves space, is smart and labor-saving. On the basis o...
Application of android-based parking violations reporting system to support green campus program
Application of android-based parking violations reporting system to support green campus program
The use of private vehicles by the academic community of Sebelas Maret University (UNS) is increasing every year. This has resulted in a reduced availability of vehicle parking spa...
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Parking is an activity to stop or store vehicles in a place that has been provided in public places that have a parking system. The parking system is a system that is applied to a ...
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Abstrak: Pasar sangatlah identik dengan pusat keramaian, karena pasar merupakan pusat perdagangan yang terletak di pusat kota yang sering disebut juga dengan kawasan Central Bussin...
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
The Damar road section is a connecting lane for the Pasar Raya Padang area which starting to face parking problem. The indicator is the disruption of the flow and the high accumula...

Back to Top