Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Variable selection procedures under collinearity (multicollinearity)

View through CrossRef
Variable selection is an important area of statistical modeling, which is still an active area of research. In this study, we investigated the performance of four variable selection procedures: backward elimination, Bayesian, least absolute shrinkage selection operator (lasso), and ridge selection operator under collinearity. Specifically, we focused on the variable selection accuracy, prediction accuracy, and their ability to mitigate the impact of collinearity on variable selection and predictive accuracy. What we found is that sample size, and the strength of the regression coefficient impacted the selection accuracy specifically in the standard linear regression simulation, where the simulation results showed that as the sample size decreased, the selection accuracy decreased specifically for methods such as backward elimination, Bayesian, and the lasso, while that of the ridge selection operator the variable selection accuracy increased. In the high dimensional simulation, the lasso outperformed the other variable selection methods, indicating its unique features to select sparse models even when the number of parameters exceeds the sample size. However, the estimates were extremely biased for all methods investigated in the high dimensional regression simulation. Though the computational times were not reported, the ridge selection operator was the slowest, especially for large sample sizes or number of predictors. This indicates its unfeasibility in the practical world where sample sizes or number of predictors are large since it is too computationally costly. In terms of prediction, overall, Bayesian g-prior variable selection and the lasso-fitted models performed better than those from the other variable selection methods. In the last section we discuss the limitations and provide recommendations on using these methods.
University of Missouri Libraries
Title: Variable selection procedures under collinearity (multicollinearity)
Description:
Variable selection is an important area of statistical modeling, which is still an active area of research.
In this study, we investigated the performance of four variable selection procedures: backward elimination, Bayesian, least absolute shrinkage selection operator (lasso), and ridge selection operator under collinearity.
Specifically, we focused on the variable selection accuracy, prediction accuracy, and their ability to mitigate the impact of collinearity on variable selection and predictive accuracy.
What we found is that sample size, and the strength of the regression coefficient impacted the selection accuracy specifically in the standard linear regression simulation, where the simulation results showed that as the sample size decreased, the selection accuracy decreased specifically for methods such as backward elimination, Bayesian, and the lasso, while that of the ridge selection operator the variable selection accuracy increased.
In the high dimensional simulation, the lasso outperformed the other variable selection methods, indicating its unique features to select sparse models even when the number of parameters exceeds the sample size.
However, the estimates were extremely biased for all methods investigated in the high dimensional regression simulation.
Though the computational times were not reported, the ridge selection operator was the slowest, especially for large sample sizes or number of predictors.
This indicates its unfeasibility in the practical world where sample sizes or number of predictors are large since it is too computationally costly.
In terms of prediction, overall, Bayesian g-prior variable selection and the lasso-fitted models performed better than those from the other variable selection methods.
In the last section we discuss the limitations and provide recommendations on using these methods.

Related Results

<em> Hox </em>Temporal Collinearity: Misleading Fallacy or Essential Developmental Mechanism?
<em> Hox </em>Temporal Collinearity: Misleading Fallacy or Essential Developmental Mechanism?
Kondo and collaborators recently reported the absence of Hox temporal collinearity in Xenopus tropicalis. They found none in the initiation of accumulation of&nbsp; Hox transcr...
Biclustering Models Under Collinearity in Simulated Biological Experiments
Biclustering Models Under Collinearity in Simulated Biological Experiments
Biclustering models allow simultaneous detection of group observations that are related to variables in a data matrix. Such methods have been applied in biological data for classif...
Quantifying uncertainties of ecological forecasting under climate change
Quantifying uncertainties of ecological forecasting under climate change
Ecological forecasting is critical in understanding of ecological responses to climate change and is increasingly used in climate mitigation plans. The forecasts from correlative m...
Selection Gradients
Selection Gradients
Natural selection and sexual selection are important evolutionary processes that can shape the phenotypic distributions of natural populations and, consequently, a primary goal of ...
Poems
Poems
poems selection poems selection poems selection poems selection poems selection poems selection poems selection poems selection poems selection poems selection poems selection poem...
Pengaruh Kepemimpinan Kepala Sekolah, Lingkungan Kerja, dan Sarana Pembelajaran terhadap Kinerja Guru Melalui Motivasi Kerja
Pengaruh Kepemimpinan Kepala Sekolah, Lingkungan Kerja, dan Sarana Pembelajaran terhadap Kinerja Guru Melalui Motivasi Kerja
Penelitian ini mengkaji pengaruh kepemimpinan kepala sekolah, lingkungan sekolah, dan sarana pembelajaran terhadap kinerja guru SMAS Reformasi Plus, dengan motivasi guru sebagai va...
Bayesian regression modeling and inference of energy efficiency data: the effect of collinearity and sensitivity analysis
Bayesian regression modeling and inference of energy efficiency data: the effect of collinearity and sensitivity analysis
The majority of research predicted heating demand using linear regression models, but they did not give current building features enough context. Model problems such as Multicollin...

Back to Top