Javascript must be enabled to continue!
Variable selection procedures under collinearity (multicollinearity)
View through CrossRef
Variable selection is an important area of statistical modeling, which is still an active area of research. In this study, we investigated the performance of four variable selection procedures: backward elimination, Bayesian, least absolute shrinkage selection operator (lasso), and ridge selection operator under collinearity. Specifically, we focused on the variable selection accuracy, prediction accuracy, and their ability to mitigate the impact of collinearity on variable selection and predictive accuracy. What we found is that sample size, and the strength of the regression coefficient impacted the selection accuracy specifically in the standard linear regression simulation, where the simulation results showed that as the sample size decreased, the selection accuracy decreased specifically for methods such as backward elimination, Bayesian, and the lasso, while that of the ridge selection operator the variable selection accuracy increased. In the high dimensional simulation, the lasso outperformed the other variable selection methods, indicating its unique features to select sparse models even when the number of parameters exceeds the sample size. However, the estimates were extremely biased for all methods investigated in the high dimensional regression simulation. Though the computational times were not reported, the ridge selection operator was the slowest, especially for large sample sizes or number of predictors. This indicates its unfeasibility in the practical world where sample sizes or number of predictors are large since it is too computationally costly. In terms of prediction, overall, Bayesian g-prior variable selection and the lasso-fitted models performed better than those from the other variable selection methods. In the last section we discuss the limitations and provide recommendations on using these methods.
Title: Variable selection procedures under collinearity (multicollinearity)
Description:
Variable selection is an important area of statistical modeling, which is still an active area of research.
In this study, we investigated the performance of four variable selection procedures: backward elimination, Bayesian, least absolute shrinkage selection operator (lasso), and ridge selection operator under collinearity.
Specifically, we focused on the variable selection accuracy, prediction accuracy, and their ability to mitigate the impact of collinearity on variable selection and predictive accuracy.
What we found is that sample size, and the strength of the regression coefficient impacted the selection accuracy specifically in the standard linear regression simulation, where the simulation results showed that as the sample size decreased, the selection accuracy decreased specifically for methods such as backward elimination, Bayesian, and the lasso, while that of the ridge selection operator the variable selection accuracy increased.
In the high dimensional simulation, the lasso outperformed the other variable selection methods, indicating its unique features to select sparse models even when the number of parameters exceeds the sample size.
However, the estimates were extremely biased for all methods investigated in the high dimensional regression simulation.
Though the computational times were not reported, the ridge selection operator was the slowest, especially for large sample sizes or number of predictors.
This indicates its unfeasibility in the practical world where sample sizes or number of predictors are large since it is too computationally costly.
In terms of prediction, overall, Bayesian g-prior variable selection and the lasso-fitted models performed better than those from the other variable selection methods.
In the last section we discuss the limitations and provide recommendations on using these methods.
Related Results
<em> Hox </em>Temporal Collinearity: Misleading Fallacy or Essential Developmental Mechanism?
<em> Hox </em>Temporal Collinearity: Misleading Fallacy or Essential Developmental Mechanism?
Kondo and collaborators recently reported the absence of Hox temporal collinearity in Xenopus tropicalis. They found none in the initiation of accumulation of Hox transcr...
Biclustering Models Under Collinearity in Simulated Biological Experiments
Biclustering Models Under Collinearity in Simulated Biological Experiments
Biclustering models allow simultaneous detection of group observations that are related to variables in a data matrix. Such methods have been applied in biological data for classif...
Bayesian regression modeling and inference of energy efficiency data: the effect of collinearity and sensitivity analysis
Bayesian regression modeling and inference of energy efficiency data: the effect of collinearity and sensitivity analysis
The majority of research predicted heating demand using linear regression models, but they did not give current building features enough context. Model problems such as Multicollin...
Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity
Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity
Aims: This research work investigated the best regression technique in handling multicollinearity using the Ridge, Least Absolute Shrinkage and Selection Operator (LASSO) and Bridg...
Use of The Principal Components in Case of The Problem multicollinearity (An Applied Study on the investment data in Sudan)
Use of The Principal Components in Case of The Problem multicollinearity (An Applied Study on the investment data in Sudan)
This study aims to apply the model Principal component Analysis to overcome the problem of multicollinearity which appears in many estimated models. The core of this method is repl...
Satisfaction in Counseling Alumni and Students
Satisfaction in Counseling Alumni and Students
The purpose of this study was to investigate satisfaction in counseling majors. The four independent variables investigated were gender, program status, employment status, and age....
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The Effect of Product Quality and Service Quality on Customer Satisfaction at SLV Room Boutique
The purpose of the study was to determine the effect of product quality and service quality on customer satisfaction at the SLV Room Boutique. The population in this study were con...

