Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

An Adaptive Rank Aggregation-Based Ensemble Multi-Filter Feature Selection Method in Software Defect Prediction

View through CrossRef
Feature selection is known to be an applicable solution to address the problem of high dimensionality in software defect prediction (SDP). However, choosing an appropriate filter feature selection (FFS) method that will generate and guarantee optimal features in SDP is an open research issue, known as the filter rank selection problem. As a solution, the combination of multiple filter methods can alleviate the filter rank selection problem. In this study, a novel adaptive rank aggregation-based ensemble multi-filter feature selection (AREMFFS) method is proposed to resolve high dimensionality and filter rank selection problems in SDP. Specifically, the proposed AREMFFS method is based on assessing and combining the strengths of individual FFS methods by aggregating multiple rank lists in the generation and subsequent selection of top-ranked features to be used in the SDP process. The efficacy of the proposed AREMFFS method is evaluated with decision tree (DT) and naïve Bayes (NB) models on defect datasets from different repositories with diverse defect granularities. Findings from the experimental results indicated the superiority of AREMFFS over other baseline FFS methods that were evaluated, existing rank aggregation based multi-filter FS methods, and variants of AREMFFS as developed in this study. That is, the proposed AREMFFS method not only had a superior effect on prediction performances of SDP models but also outperformed baseline FS methods and existing rank aggregation based multi-filter FS methods. Therefore, this study recommends the combination of multiple FFS methods to utilize the strength of respective FFS methods and take advantage of filter–filter relationships in selecting optimal features for SDP processes.
Title: An Adaptive Rank Aggregation-Based Ensemble Multi-Filter Feature Selection Method in Software Defect Prediction
Description:
Feature selection is known to be an applicable solution to address the problem of high dimensionality in software defect prediction (SDP).
However, choosing an appropriate filter feature selection (FFS) method that will generate and guarantee optimal features in SDP is an open research issue, known as the filter rank selection problem.
As a solution, the combination of multiple filter methods can alleviate the filter rank selection problem.
In this study, a novel adaptive rank aggregation-based ensemble multi-filter feature selection (AREMFFS) method is proposed to resolve high dimensionality and filter rank selection problems in SDP.
Specifically, the proposed AREMFFS method is based on assessing and combining the strengths of individual FFS methods by aggregating multiple rank lists in the generation and subsequent selection of top-ranked features to be used in the SDP process.
The efficacy of the proposed AREMFFS method is evaluated with decision tree (DT) and naïve Bayes (NB) models on defect datasets from different repositories with diverse defect granularities.
Findings from the experimental results indicated the superiority of AREMFFS over other baseline FFS methods that were evaluated, existing rank aggregation based multi-filter FS methods, and variants of AREMFFS as developed in this study.
That is, the proposed AREMFFS method not only had a superior effect on prediction performances of SDP models but also outperformed baseline FS methods and existing rank aggregation based multi-filter FS methods.
Therefore, this study recommends the combination of multiple FFS methods to utilize the strength of respective FFS methods and take advantage of filter–filter relationships in selecting optimal features for SDP processes.

Related Results

Cigarettes with defective filters marketed for 40 years: what Philip Morris never told smokers: Table 1
Cigarettes with defective filters marketed for 40 years: what Philip Morris never told smokers: Table 1
Background: More than 90% of the cigarettes sold worldwide have a filter. Nearly all filters consist of a rod of numerous ( > 12 000) plastic-like cellulose acetate fibres. Duri...
Natural genetic variation and an alternative physiological state modify polyglutamine aggregation and toxicity in C. elegans
Natural genetic variation and an alternative physiological state modify polyglutamine aggregation and toxicity in C. elegans
Many human diseases are caused by mutations that induce misfolding and aggregation of the affected proteins, and are thought to result from failures in proteostasis. Pathways invol...
Ensemble Machine Learning Model for Software Defect Prediction
Ensemble Machine Learning Model for Software Defect Prediction
Software defect prediction is a significant activity in every software firm. It helps in producing quality software by reliable defect prediction, defect elimination, and predictio...
CFD Simulation and Optimization of a Cake Filtration System
CFD Simulation and Optimization of a Cake Filtration System
Abstract This study presents a simulation of filter cake formation during the filtration of rice hull ash and liquid mixture using ANSYS Fluent software. Filter cake...
Feature selection using a multi-strategy improved parrot optimization algorithm in software defect prediction
Feature selection using a multi-strategy improved parrot optimization algorithm in software defect prediction
Software defect detection is a critical research topic in the field of software engineering, aiming to identify potential defects during the development process to improve software...
The Efficiency of Aggregation Methods in Ensemble Filter Feature Selection Models
The Efficiency of Aggregation Methods in Ensemble Filter Feature Selection Models
Ensemble feature selection is recommended as it proves to produce a more stable subset of features and a better classification accuracy when compared to the individual feature selec...
Visual software defect prediction method based on improved recurrent criss-cross residual network
Visual software defect prediction method based on improved recurrent criss-cross residual network
Purpose This study aims to solve the problems of large training sample size, low data sample quality, low efficiency of the currently used classical model, high computational compl...
Selection Gradients
Selection Gradients
Natural selection and sexual selection are important evolutionary processes that can shape the phenotypic distributions of natural populations and, consequently, a primary goal of ...

Back to Top