Javascript must be enabled to continue!
MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy–defined energy
View through CrossRef
Abstract
Recursive feature elimination (RFE), as one of the most popular feature selection algorithms, has been extensively applied to bioinformatics. During the training, a group of candidate subsets are generated by iteratively eliminating the least important features from the original features. However, how to determine the optimal subset from them still remains ambiguous. Among most current studies, either overall accuracy or subset size (SS) is used to select the most predictive features. Using which one or both and how they affect the prediction performance are still open questions. In this study, we proposed MinE-RFE, a novel RFE-based feature selection approach by sufficiently considering the effect of both factors. Subset decision problem was reflected into subset-accuracy space and became an energy-minimization problem. We also provided a mathematical description of the relationship between the overall accuracy and SS using Gaussian Mixture Models together with spline fitting. Besides, we comprehensively reviewed a variety of state-of-the-art applications in bioinformatics using RFE. We compared their approaches of deciding the final subset from all the candidate subsets with MinE-RFE on diverse bioinformatics data sets. Additionally, we also compared MinE-RFE with some well-used feature selection algorithms. The comparative results demonstrate that the proposed approach exhibits the best performance among all the approaches. To facilitate the use of MinE-RFE, we further established a user-friendly web server with the implementation of the proposed approach, which is accessible at http://qgking.wicp.net/MinE/. We expect this web server will be a useful tool for research community.
Title: MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy–defined energy
Description:
Abstract
Recursive feature elimination (RFE), as one of the most popular feature selection algorithms, has been extensively applied to bioinformatics.
During the training, a group of candidate subsets are generated by iteratively eliminating the least important features from the original features.
However, how to determine the optimal subset from them still remains ambiguous.
Among most current studies, either overall accuracy or subset size (SS) is used to select the most predictive features.
Using which one or both and how they affect the prediction performance are still open questions.
In this study, we proposed MinE-RFE, a novel RFE-based feature selection approach by sufficiently considering the effect of both factors.
Subset decision problem was reflected into subset-accuracy space and became an energy-minimization problem.
We also provided a mathematical description of the relationship between the overall accuracy and SS using Gaussian Mixture Models together with spline fitting.
Besides, we comprehensively reviewed a variety of state-of-the-art applications in bioinformatics using RFE.
We compared their approaches of deciding the final subset from all the candidate subsets with MinE-RFE on diverse bioinformatics data sets.
Additionally, we also compared MinE-RFE with some well-used feature selection algorithms.
The comparative results demonstrate that the proposed approach exhibits the best performance among all the approaches.
To facilitate the use of MinE-RFE, we further established a user-friendly web server with the implementation of the proposed approach, which is accessible at http://qgking.
wicp.
net/MinE/.
We expect this web server will be a useful tool for research community.
Related Results
Intranasal delivery of blackberry-loaded Chitosan nanoparticles for antipsychotic potential in Ketamine-induced schizophrenia in rats
Intranasal delivery of blackberry-loaded Chitosan nanoparticles for antipsychotic potential in Ketamine-induced schizophrenia in rats
Abstract
Schizophrenia is a neuropsychiatric disorder with limited treatment options that have unwanted side effects. Clozapine, an atypical antipsychotic, has been used ...
The role of autacoids and the autonomic nervous system in cardiovascular responses to radio‐frequency energy heating
The role of autacoids and the autonomic nervous system in cardiovascular responses to radio‐frequency energy heating
Summary 1 Among the potential effects of exposure to high levels of radio‐frequency energy (RFE) (which includes microwaves), an increase in body temperature is the primary consequ...
Enhancing Soil Fertility Mapping with Hyperspectral Remote Sensing and Advanced AI: A Comparative Study of Dimensionality Reduction Techniques in Morocco
Enhancing Soil Fertility Mapping with Hyperspectral Remote Sensing and Advanced AI: A Comparative Study of Dimensionality Reduction Techniques in Morocco
As global food demand increases, farming systems experience heightened pressure to enhance productivity on limited arable land. In Africa, including Morocco, smallholder farms are ...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract
Introduction
Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
AbstractMotivation: Given the thousands of genes and the small number of samples, gene selection has emerged as an important research problem in microarray data analysis. Support V...
DynamicWeighted Particle Swarm Optimization - Support Vector Machine Optimization in Recursive Feature Elimination Feature Selection
DynamicWeighted Particle Swarm Optimization - Support Vector Machine Optimization in Recursive Feature Elimination Feature Selection
Feature Selection is a crucial step in data preprocessing to enhance machine learning efficiency, reduce computational complexity, and improve classification accuracy. The main cha...
Introducing Optimal Energy Hub Approach in Smart Green Ports based on Machine Learning Methodology
Introducing Optimal Energy Hub Approach in Smart Green Ports based on Machine Learning Methodology
Abstract
The integration of renewable energy systems in port facilities is essential for achieving sustainable and environmentally friendly operations. This paper presents ...

