Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

View through CrossRef
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.
Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking
Description:
Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3).
Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.
To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.
Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.
The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.
The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.
In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Related Results

Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa
This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...
Application of android-based parking violations reporting system to support green campus program
Application of android-based parking violations reporting system to support green campus program
The use of private vehicles by the academic community of Sebelas Maret University (UNS) is increasing every year. This has resulted in a reduced availability of vehicle parking spa...
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching
Parking is an activity to stop or store vehicles in a place that has been provided in public places that have a parking system. The parking system is a system that is applied to a ...
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat
Abstrak: Pasar sangatlah identik dengan pusat keramaian, karena pasar merupakan pusat perdagangan yang terletak di pusat kota yang sering disebut juga dengan kawasan Central Bussin...
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG
The Damar road section is a connecting lane for the Pasar Raya Padang area which starting to face parking problem. The indicator is the disruption of the flow and the high accumula...
Integration of YOLOv5 Algorithm and OpenCV in Innovative Smart Parking Management Approach
Integration of YOLOv5 Algorithm and OpenCV in Innovative Smart Parking Management Approach
The problem of automatic parking lot identification and vehicle detection in open areas is becoming increasingly important due to the increase in the number of vehicles in Indonesi...
ANALYSIS OF PARKING MANAGEMENT POLICY IN GARUT REGENCY
ANALYSIS OF PARKING MANAGEMENT POLICY IN GARUT REGENCY
Parking is an inseparable component or aspect of the need for a transportation system because every trip with a private vehicle generally always starts and ends in the parking lot....

Back to Top