Javascript must be enabled to continue!

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Autonomous parking is a revolutionary technology that has transformed the automotive industry with the rise of deep reinforcement learning, in particular, the Twin-Delayed Deep Deterministic Policy Gradient Algorithm (TD3). Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S. To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making. Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking. The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence. The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm. In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Penerbit UTHM

Ka Heng Chan Aida Mustapha Mohammed Ahmed Jubair

Journal of Soft Computing and Data Mining

2024

Title: Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Description:

Nonetheless, the robustness of TD3 remains a significant challenge due to bias in Q-value estimates when determining how good an Action, A, taken at a particular state, S.

To investigate this gap, this paper analyzes different loss functions in TD3 to better approximate the true Q-value, which is necessary for optimal decision making.

Three loss functions are evaluated; Mean Squared Error (MSE), Mean Absolute Error (MAE) and Huber Loss via a simulation experiment for autonomous parking.

The results showed that TD3 with Huber Loss has the highest convergence speed with the fastest Actor and Critic loss convergence.

The Huber Loss function is found to be more robust and efficient than either loss function such MSE or MAE used in isolation, making it a suitable replacement for existing loss functions in the TD3 algorithm.

In the future, TD3 with Huber Loss will be used as the base model to solve overestimation problem in TD3 when the estimated Q-values that represent the expected rewards of taking an action in a particular state, are higher than their true values.

Back

Parking is one element of the means that cannot be separated from the overall road transportation system. Parking is a problem that is often found in urban transportation systems b...

KARAKTERISTIK DAN BESARAN KEBUTUHAN RUANG PARKIR PENGEMBANGAN TOKO SEMERU DI MAKASSAR

Abstract The problem of parking in Makassar City is something that needs attention, because some commercial areas provide parking spaces that are not suitable for the capacity ...

Heterogeneous Parking Market Subject to Parking Rationing

Different types of drivers and parking spaces delineate a heterogeneous parking market for which the literature has yet to provide a model applicable to the real world. The main ob...

Legal Protection For Consumers Of Parking Services Indonesia In Kabanjahe

Conducting this research aims to find out what the legal relationship is between parking service users and parking service managers and what are the civil responsibilities of parki...

Control via Reinforcement Learning : Controle via Aprendizado por Reforço

This work presents a comprehensive review and practical application of Reinforcement Learning (RL) algorithms in control engineering. The theoretical groundwork of RL is laid out, ...

Pelaksanaan Pemungutan Retribusi Parkir Di Kota Bajawa

This research aims to find out how the implementation of parking retribution in Bajawa City and the supporting and inhibiting factors of parking retribution implementation. This re...

Primerjalna književnost na prelomu tisočletja

In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...

MOTORCYCLE PARKING CAPACITY ANALYSIS AROUND UNIKOM CAMPUS, JALAN DIPATIUKUR, BANDUNG CITY

The level of customer satisfaction with the quality of parking services is viewed from five aspects, namely tangible, reliability, responsiveness, assurance, and empathy. The surve...

Email:
Password:

Email:

Comparative Analysis of Loss Functions in TD3 forAutonomous Parking

Related Results