Javascript must be enabled to continue!
Comparative Analysis of Gradient-Based Optimizers in Feedforward Neural Networks for Titanic Survival Prediction
View through CrossRef
The Titanic survival prediction problem has served as a benchmark for testing machine learning algorithms, particularly for binary classification tasks involving tabular data. While numerous models have been applied to this dataset, Feedforward Neural Networks (FNNs), also referred to as Multilayer Perceptrons (MLPs), offer unique advantages due to their ability to approximate complex functions. This study investigates the performance of FNNs for survival prediction using the Titanic dataset, focusing on the impact of gradient-based optimisation algorithms. Eight optimisers—Batch Gradient Descent (BGD), Stochastic Gradient Descent (SGD), Mini-Batch Gradient Descent, Nesterov Accelerated Gradient (NAG), Heavy Ball Method, Adam, RMSprop, and Nadam—were systematically compared across three FNN architectures: small ([64, 32, 16]), medium ([128, 64, 32]), and large ([256, 128, 64]). To enhance stability and generalisation, the models employed binary cross-entropy loss, dropout, L2 regularisation, batch normalisation, and Leaky ReLU activation. A dynamic learning rate scheduler was implemented to optimise training by adjusting the learning rate during each epoch. Models were trained using an 80-20 train-test split over 50 epochs, with performance assessed using metrics such as accuracy, precision, recall, F1 score, and cross-entropy loss. Results showed that Adam achieved the highest accuracy of 82.6% with an F1 score of 0.77 on the medium architecture, demonstrating the best balance between performance and training time. RMSprop and Nadam also delivered competitive results, particularly in terms of precision and generalisation. Smaller architectures were faster to train but showed reduced accuracy, while larger architectures marginally improved performance at the cost of longer training times. The inclusion of a learning rate scheduler further enhanced convergence and reduced overfitting, improving generalisation to unseen data. This study provides a comparative analysis of gradient-based optimisers for FNNs applied to tabular datasets, offering insights into the optimal configurations for balancing accuracy, generalisation, and computational efficiency. These findings contribute to the growing body of knowledge on leveraging neural networks for structured data tasks.
Title: Comparative Analysis of Gradient-Based Optimizers in Feedforward Neural Networks for Titanic Survival Prediction
Description:
The Titanic survival prediction problem has served as a benchmark for testing machine learning algorithms, particularly for binary classification tasks involving tabular data.
While numerous models have been applied to this dataset, Feedforward Neural Networks (FNNs), also referred to as Multilayer Perceptrons (MLPs), offer unique advantages due to their ability to approximate complex functions.
This study investigates the performance of FNNs for survival prediction using the Titanic dataset, focusing on the impact of gradient-based optimisation algorithms.
Eight optimisers—Batch Gradient Descent (BGD), Stochastic Gradient Descent (SGD), Mini-Batch Gradient Descent, Nesterov Accelerated Gradient (NAG), Heavy Ball Method, Adam, RMSprop, and Nadam—were systematically compared across three FNN architectures: small ([64, 32, 16]), medium ([128, 64, 32]), and large ([256, 128, 64]).
To enhance stability and generalisation, the models employed binary cross-entropy loss, dropout, L2 regularisation, batch normalisation, and Leaky ReLU activation.
A dynamic learning rate scheduler was implemented to optimise training by adjusting the learning rate during each epoch.
Models were trained using an 80-20 train-test split over 50 epochs, with performance assessed using metrics such as accuracy, precision, recall, F1 score, and cross-entropy loss.
Results showed that Adam achieved the highest accuracy of 82.
6% with an F1 score of 0.
77 on the medium architecture, demonstrating the best balance between performance and training time.
RMSprop and Nadam also delivered competitive results, particularly in terms of precision and generalisation.
Smaller architectures were faster to train but showed reduced accuracy, while larger architectures marginally improved performance at the cost of longer training times.
The inclusion of a learning rate scheduler further enhanced convergence and reduced overfitting, improving generalisation to unseen data.
This study provides a comparative analysis of gradient-based optimisers for FNNs applied to tabular datasets, offering insights into the optimal configurations for balancing accuracy, generalisation, and computational efficiency.
These findings contribute to the growing body of knowledge on leveraging neural networks for structured data tasks.
Related Results
Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Artificial neural network for the recognition of human emotions under a backpropagation algorithm
Artificial neural network for the recognition of human emotions under a backpropagation algorithm
The era of the technological revolution increasingly encourages the development of technologies that facilitate in one way or another people's daily activities, thus generating a g...
Sytuacja prawna wraku RMS Titanic
Sytuacja prawna wraku RMS Titanic
The legal protection of the RMS Titanic
Almost immediately after the Titanic sank on 15 April 1912, proposals were advanced to salvage the ship from her resting place. The wreck w...
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Abstract
Introduction
Tarlatamab is a Delta-like ligand 3 (DLL3) -directed bispecific T-cell engager recently approved for use in patients with advanced small cell lung cancer (SCL...
On the role of network dynamics for information processing in artificial and biological neural networks
On the role of network dynamics for information processing in artificial and biological neural networks
Understanding how interactions in complex systems give rise to various collective behaviours has been of interest for researchers across a wide range of fields. However, despite ma...
Fuzzy Chaotic Neural Networks
Fuzzy Chaotic Neural Networks
An understanding of the human brain’s local function has improved in recent years. But the cognition of human brain’s working process as a whole is still obscure. Both fuzzy logic ...
Mnemonic but not contextual feedback signals defy dedifferentiation in the aging early visual cortex
Mnemonic but not contextual feedback signals defy dedifferentiation in the aging early visual cortex
Perception is an intricate interplay between feedforward visual input and internally generated feedback signals that comprise concurrent contextual and time-distant mnemonic (episo...
Subtitling Translation Strategy and Semiotic Approach on Titanic Film
Subtitling Translation Strategy and Semiotic Approach on Titanic Film
This research is entitled “Subtitling Translation Strategy and Semiotic Approach on Titanic Film”. This research aims to identify and analyze the subtitling translation strategies ...

