Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy–defined energy

View through CrossRef
Abstract Recursive feature elimination (RFE), as one of the most popular feature selection algorithms, has been extensively applied to bioinformatics. During the training, a group of candidate subsets are generated by iteratively eliminating the least important features from the original features. However, how to determine the optimal subset from them still remains ambiguous. Among most current studies, either overall accuracy or subset size (SS) is used to select the most predictive features. Using which one or both and how they affect the prediction performance are still open questions. In this study, we proposed MinE-RFE, a novel RFE-based feature selection approach by sufficiently considering the effect of both factors. Subset decision problem was reflected into subset-accuracy space and became an energy-minimization problem. We also provided a mathematical description of the relationship between the overall accuracy and SS using Gaussian Mixture Models together with spline fitting. Besides, we comprehensively reviewed a variety of state-of-the-art applications in bioinformatics using RFE. We compared their approaches of deciding the final subset from all the candidate subsets with MinE-RFE on diverse bioinformatics data sets. Additionally, we also compared MinE-RFE with some well-used feature selection algorithms. The comparative results demonstrate that the proposed approach exhibits the best performance among all the approaches. To facilitate the use of MinE-RFE, we further established a user-friendly web server with the implementation of the proposed approach, which is accessible at http://qgking.wicp.net/MinE/. We expect this web server will be a useful tool for research community.
Title: MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy–defined energy
Description:
Abstract Recursive feature elimination (RFE), as one of the most popular feature selection algorithms, has been extensively applied to bioinformatics.
During the training, a group of candidate subsets are generated by iteratively eliminating the least important features from the original features.
However, how to determine the optimal subset from them still remains ambiguous.
Among most current studies, either overall accuracy or subset size (SS) is used to select the most predictive features.
Using which one or both and how they affect the prediction performance are still open questions.
In this study, we proposed MinE-RFE, a novel RFE-based feature selection approach by sufficiently considering the effect of both factors.
Subset decision problem was reflected into subset-accuracy space and became an energy-minimization problem.
We also provided a mathematical description of the relationship between the overall accuracy and SS using Gaussian Mixture Models together with spline fitting.
Besides, we comprehensively reviewed a variety of state-of-the-art applications in bioinformatics using RFE.
We compared their approaches of deciding the final subset from all the candidate subsets with MinE-RFE on diverse bioinformatics data sets.
Additionally, we also compared MinE-RFE with some well-used feature selection algorithms.
The comparative results demonstrate that the proposed approach exhibits the best performance among all the approaches.
To facilitate the use of MinE-RFE, we further established a user-friendly web server with the implementation of the proposed approach, which is accessible at http://qgking.
wicp.
net/MinE/.
We expect this web server will be a useful tool for research community.

Related Results

The role of autacoids and the autonomic nervous system in cardiovascular responses to radio‐frequency energy heating
The role of autacoids and the autonomic nervous system in cardiovascular responses to radio‐frequency energy heating
Summary 1 Among the potential effects of exposure to high levels of radio‐frequency energy (RFE) (which includes microwaves), an increase in body temperature is the primary consequ...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract Introduction Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
MSVM-RFE: extensions of SVM-RFE for multiclass gene selection on DNA microarray data
AbstractMotivation: Given the thousands of genes and the small number of samples, gene selection has emerged as an important research problem in microarray data analysis. Support V...
DynamicWeighted Particle Swarm Optimization - Support Vector Machine Optimization in Recursive Feature Elimination Feature Selection
DynamicWeighted Particle Swarm Optimization - Support Vector Machine Optimization in Recursive Feature Elimination Feature Selection
Feature Selection is a crucial step in data preprocessing to enhance machine learning efficiency, reduce computational complexity, and improve classification accuracy. The main cha...
Introducing Optimal Energy Hub Approach in Smart Green Ports based on Machine Learning Methodology
Introducing Optimal Energy Hub Approach in Smart Green Ports based on Machine Learning Methodology
Abstract The integration of renewable energy systems in port facilities is essential for achieving sustainable and environmentally friendly operations. This paper presents ...

Back to Top