Javascript must be enabled to continue!
Improveing F-beta Score in Classifying Shark Data into Shark Behaviors
View through CrossRef
One metric used to measure classification performance in machine learning is F-beta score. The objective in this thesis is to improve the average F-b score computed in classifying shark data into shark behaviors, namely; Resting, Swimming, Feeding, and Non-Directed Motion (NDM). Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic Sampling (ADASYN) are utilized to balance the data, from which pre-processed Fast Fourier Transform (FFT), Walsh-Hadamard Transform (WHT), and Autocorrelation (AC) features are extracted then classified using Convolutional Neural Network (CNN) and K-Nearest Neighbors (K-NN). All the combinations of the two balancing techniques, the three feature types, and the two machine learning algorithms are applied then compared to examine the average F-beta score improvement. Other signal processing techniques are also applied, to reduce the noise level of the recorded raw shark data and enhance its Signal-to-Noise Ratio (SNR). The average F-beta scores showed that K-NN performed at its best when using FFT-only features while CNN performed at its best when using WHT-FFT features. In the K-NN case, FFT performed better when it was used alone than when it was combined with any other feature type. On the other hand, WHT performed better when it was combined with any other feature type than when it was used alone. In the CNN case, WHT and FFT performed better together than they did separately. In other words, Combining FFT and WHT features in CNN resulted in considerably improved average F-beta score, while combining them in K-NN averaged their scores. Also, whether alone or combined with other feature types, AC did not work well in CNN as it resulted in poor average F-beta scores. In K-NN, combining AC with other feature types did not improve the average F-beta score from when it is used alone. The average F-beta scores also showed that reducing the data imbalance nature during the pre-processing phase is more effective than mitigating the misleading classification during the machine learning phase. Prior balancing was performed using SMOTE and ADASYN, while later mitigation was performed using weight-sensitive learning. SMOTE, more so ADASYN, reduced the difference between precision and recall scores, and produced higher F-beta scores. Besides the mentioned two balancing techniques, the three feature types, and the two machine learning algorithms, other pre-processing techniques that were applied to the raw data contributed to the improvement of the average F-beta score. These pre-processing techniques included framing, detrending, normalization, Ensemble Average (EA) based low-pass filtering, filter delay compensation, overlap windowing, and k-fold cross validation. For example, the average F-beta scores showed that applying EA-based low-pass filters (LPF) on the data, prior to machine learning and classification, improves Signal Power to Noise Power Ratio (SNR), and sequentially improves average F-beat scores significantly. As an end result, for the shark data used in this thesis, CNN was found to be a better choice than K-NN, and it was a better choice when using WHT-FFT as features and ADASYN as balancing technique.
Title: Improveing F-beta Score in Classifying Shark Data into Shark Behaviors
Description:
One metric used to measure classification performance in machine learning is F-beta score.
The objective in this thesis is to improve the average F-b score computed in classifying shark data into shark behaviors, namely; Resting, Swimming, Feeding, and Non-Directed Motion (NDM).
Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic Sampling (ADASYN) are utilized to balance the data, from which pre-processed Fast Fourier Transform (FFT), Walsh-Hadamard Transform (WHT), and Autocorrelation (AC) features are extracted then classified using Convolutional Neural Network (CNN) and K-Nearest Neighbors (K-NN).
All the combinations of the two balancing techniques, the three feature types, and the two machine learning algorithms are applied then compared to examine the average F-beta score improvement.
Other signal processing techniques are also applied, to reduce the noise level of the recorded raw shark data and enhance its Signal-to-Noise Ratio (SNR).
The average F-beta scores showed that K-NN performed at its best when using FFT-only features while CNN performed at its best when using WHT-FFT features.
In the K-NN case, FFT performed better when it was used alone than when it was combined with any other feature type.
On the other hand, WHT performed better when it was combined with any other feature type than when it was used alone.
In the CNN case, WHT and FFT performed better together than they did separately.
In other words, Combining FFT and WHT features in CNN resulted in considerably improved average F-beta score, while combining them in K-NN averaged their scores.
Also, whether alone or combined with other feature types, AC did not work well in CNN as it resulted in poor average F-beta scores.
In K-NN, combining AC with other feature types did not improve the average F-beta score from when it is used alone.
The average F-beta scores also showed that reducing the data imbalance nature during the pre-processing phase is more effective than mitigating the misleading classification during the machine learning phase.
Prior balancing was performed using SMOTE and ADASYN, while later mitigation was performed using weight-sensitive learning.
SMOTE, more so ADASYN, reduced the difference between precision and recall scores, and produced higher F-beta scores.
Besides the mentioned two balancing techniques, the three feature types, and the two machine learning algorithms, other pre-processing techniques that were applied to the raw data contributed to the improvement of the average F-beta score.
These pre-processing techniques included framing, detrending, normalization, Ensemble Average (EA) based low-pass filtering, filter delay compensation, overlap windowing, and k-fold cross validation.
For example, the average F-beta scores showed that applying EA-based low-pass filters (LPF) on the data, prior to machine learning and classification, improves Signal Power to Noise Power Ratio (SNR), and sequentially improves average F-beat scores significantly.
As an end result, for the shark data used in this thesis, CNN was found to be a better choice than K-NN, and it was a better choice when using WHT-FFT as features and ADASYN as balancing technique.
Related Results
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED]Keanu Reeves CBD Gummies ==❱❱ Huge Discounts:[HURRY UP ] Absolute Keanu Reeves CBD Gummies (Available)Order Online Only!! ❰❰= https://www.facebook.com/Keanu-Reeves-CBD-G...
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Abstract
Intracerebral infection of certain strains of mice with Theiler's virus results in chronic immune-mediated demyelination in spinal cord. We used mouse mutan...
KOMPOSISI JENIS HIU DAN DISTRIBUSI TITIK PENANGKAPANNYA DI PERAIRAN PESISIR CILACAP, JAWA TENGAH
KOMPOSISI JENIS HIU DAN DISTRIBUSI TITIK PENANGKAPANNYA DI PERAIRAN PESISIR CILACAP, JAWA TENGAH
ABSTRAK Ikan hiu merupakan predator tertinggi serta merupakan penjaga dan pembersih pada rantai makanan.Indonesia merupakan salah satu negara yang melakukan penangkapan hiu terbesa...
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
AbstractAims/HypothesisMiRNAs play a crucial role in regulating the islet transcriptome, influencing beta cell functions and pathways. Emerging evidence suggests that during biogen...
Persepsi Nelayan Terhadap Status Konservasi Hiu dan Pengaruhnya Terhadap Penangkapan Hiu: Studi Kasus di Kabupaten Badung, Provinsi Bali
Persepsi Nelayan Terhadap Status Konservasi Hiu dan Pengaruhnya Terhadap Penangkapan Hiu: Studi Kasus di Kabupaten Badung, Provinsi Bali
Shark is one of top predator that can define and control marine food chain. Shark breeding process is relatively slow and increase of catching activity has even given worse impact ...
Fishing for survival: importance of shark fisheries for the livelihoods of coastal communities in Western Ghana
Fishing for survival: importance of shark fisheries for the livelihoods of coastal communities in Western Ghana
AbstractSmall-scale shark fisheries support the livelihoods of a large number of coastal communities in developing countries. Shark meat comprises a cheap source of protein and is ...
The Effect of Osaka Mutation on Oligomer Formation of Full-Length Amyloid [beta]-Protein Oligomers
The Effect of Osaka Mutation on Oligomer Formation of Full-Length Amyloid [beta]-Protein Oligomers
Alzheimer's disease (AD) is the leading cause of dementia among the elderly and is characterized by loss of memory due to neuronal death. In vitro and in vivo experiments have iden...
The study of shark and ray abundance in Nusa Penida Aquatic Conservation Area
The study of shark and ray abundance in Nusa Penida Aquatic Conservation Area
Limited information is one of major problem in managing shark and ray population in Indonesia. This research aims to demonstrate a cost-effective video approach to quantify the rel...

