Javascript must be enabled to continue!

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Penerbit UTHM

Ka Heng Chan Aida Mustapha Mohammed Ahmed Jubair

Journal of Soft Computing and Data Mining

2024

Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Description:

Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.

To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.

Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.

The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.

The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.

In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Back

In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...

Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa

This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...

Application of android-based parking violations reporting system to support green campus program

The use of private vehicles by the academic community of Sebelas Maret University (UNS) is increasing every year. This has resulted in a reduced availability of vehicle parking spa...

Sistem Pembayaran Parkir Non-Tunai Berbasis Mikrokontroler dengan Metode Template Matching

Parking is an activity to stop or store vehicles in a place that has been provided in public places that have a parking system. The parking system is a system that is applied to a ...

Strategi Penanganan Kemacetan Arus Lalu Lintas Berdasarkan Persepsi Masyarakat

Abstrak: Pasar sangatlah identik dengan pusat keramaian, karena pasar merupakan pusat perdagangan yang terletak di pusat kota yang sering disebut juga dengan kawasan Central Bussin...

POTENSI PENGENDALIAN ON STREET PARKING DI RUAS JALAN DAMAR KOTA PADANG

The Damar road section is a connecting lane for the Pasar Raya Padang area which starting to face parking problem. The indicator is the disruption of the flow and the high accumula...

Integration of YOLOv5 Algorithm and OpenCV in Innovative Smart Parking Management Approach

The problem of automatic parking lot identification and vehicle detection in open areas is becoming increasingly important due to the increase in the number of vehicles in Indonesi...

ANALYSIS OF PARKING MANAGEMENT POLICY IN GARUT REGENCY

Parking is an inseparable component or aspect of the need for a transportation system because every trip with a private vehicle generally always starts and ends in the parking lot....

Email:
Password:

Email:

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Related Results