Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Application of Principal Component Analysis in Dealing with Multicollinearity in Modelling Clinical Data

View through CrossRef
Introduction: One of the stringent assumptions about covariates in the Cox hazard and Logistic regression modelling is that they should be independent. Incorporating correlated covariates as such into the model might distort the precision of the estimates due to multicollinearity. One way to deal with multicollinearity is by using Principal Component Analysis (PCA) technique. Aim: To demonstrate the application of PCA in dealing with correlated covariates while modelling time to event and casecontrol study data. Materials and Methods: This study was conducted at Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India, from February 2021 to January 2022. Two datasets were used for the demonstration i.e., data relates to a time to event outcome and a case-control study with binary outcome in which lipids were the correlated covariates. Three sets of Cox regression models were used to demonstrate change in hazard ratios with 95% Confidence Intervals (CI) for evaluating the effect of intervention at a different time of lipid measurement. Model I has evaluated treatment/ Body Mass Index (BMI) effect on the outcome by ignoring the effect of lipid parameters. Model II has evaluated treatment/ BMI effect on the outcome by incorporating lipid variables but ignoring multicollinearity. Model III has evaluated treatment/ BMI effect on the outcome by incorporating lipid variables through principal component analysis and thus adjusting for multicollinearity. Similarly, a logistic regression model was performed by using the same three sets of models to evaluate the effect of exposure (BMI). The comparability of lipids between the two groups for both datasets was tested using Hotelling’s T-squared statistic. Results: The lipids measured at 12th, 24th and 36th months between the two groups in the first data set as well as between cases and controls in the second data set were statistically significant. In the first dataset, at baseline, the Hazard Ratio’s (HR’s) were statistically similar irrespective of the models used; while decreasing successively with narrowing 95% CI’s as moving from model I to model III for the lipid measured at 12th, 24th and 36th months. Further, at 24th and 36th months, the HR in model-III found to be significant. In the second data set, the Odds Ratio (OR) were significant for all the three models and it was almost similar for model I and II but in model III it was elevated. Conclusion: The multicollinearity issue should be properly addressed before including correlated covariates in the Cox regression hazard and Logistic regression model. The PCA technique would be a favourable method.
Title: Application of Principal Component Analysis in Dealing with Multicollinearity in Modelling Clinical Data
Description:
Introduction: One of the stringent assumptions about covariates in the Cox hazard and Logistic regression modelling is that they should be independent.
Incorporating correlated covariates as such into the model might distort the precision of the estimates due to multicollinearity.
One way to deal with multicollinearity is by using Principal Component Analysis (PCA) technique.
Aim: To demonstrate the application of PCA in dealing with correlated covariates while modelling time to event and casecontrol study data.
Materials and Methods: This study was conducted at Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India, from February 2021 to January 2022.
Two datasets were used for the demonstration i.
e.
, data relates to a time to event outcome and a case-control study with binary outcome in which lipids were the correlated covariates.
Three sets of Cox regression models were used to demonstrate change in hazard ratios with 95% Confidence Intervals (CI) for evaluating the effect of intervention at a different time of lipid measurement.
Model I has evaluated treatment/ Body Mass Index (BMI) effect on the outcome by ignoring the effect of lipid parameters.
Model II has evaluated treatment/ BMI effect on the outcome by incorporating lipid variables but ignoring multicollinearity.
Model III has evaluated treatment/ BMI effect on the outcome by incorporating lipid variables through principal component analysis and thus adjusting for multicollinearity.
Similarly, a logistic regression model was performed by using the same three sets of models to evaluate the effect of exposure (BMI).
The comparability of lipids between the two groups for both datasets was tested using Hotelling’s T-squared statistic.
Results: The lipids measured at 12th, 24th and 36th months between the two groups in the first data set as well as between cases and controls in the second data set were statistically significant.
In the first dataset, at baseline, the Hazard Ratio’s (HR’s) were statistically similar irrespective of the models used; while decreasing successively with narrowing 95% CI’s as moving from model I to model III for the lipid measured at 12th, 24th and 36th months.
Further, at 24th and 36th months, the HR in model-III found to be significant.
In the second data set, the Odds Ratio (OR) were significant for all the three models and it was almost similar for model I and II but in model III it was elevated.
Conclusion: The multicollinearity issue should be properly addressed before including correlated covariates in the Cox regression hazard and Logistic regression model.
The PCA technique would be a favourable method.

Related Results

Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity
Comparative Analysis of Ridge, Bridge and Lasso Regression Models In the Presence of Multicollinearity
Aims: This research work investigated the best regression technique in handling multicollinearity using the Ridge, Least Absolute Shrinkage and Selection Operator (LASSO) and Bridg...
Use of The Principal Components in Case of The Problem multicollinearity (An Applied Study on the investment data in Sudan)
Use of The Principal Components in Case of The Problem multicollinearity (An Applied Study on the investment data in Sudan)
This study aims to apply the model Principal component Analysis to overcome the problem of multicollinearity which appears in many estimated models. The core of this method is repl...
Advanced Financial Modelling and Analysis
Advanced Financial Modelling and Analysis
Abstract: This chapter, "Advanced Financial Modelling and Analysis," provides an in-depth exploration of the principles, techniques, and applications of financial modelling in the ...
A Monte Carlo Experiment on the Asymptotic Efficiency of System Estimators under Multicollinearity
A Monte Carlo Experiment on the Asymptotic Efficiency of System Estimators under Multicollinearity
Classical linear regression model assumes that there is no multicollinearity among the explanatory variables in a regression model. Contrary to this assumption, where multicollinea...
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED]Keanu Reeves CBD Gummies ==❱❱ Huge Discounts:[HURRY UP ] Absolute Keanu Reeves CBD Gummies (Available)Order Online Only!! ❰❰= https://www.facebook.com/Keanu-Reeves-CBD-G...
Complex Collision Tumors: A Systematic Review
Complex Collision Tumors: A Systematic Review
Abstract Introduction: A collision tumor consists of two distinct neoplastic components located within the same organ, separated by stromal tissue, without histological intermixing...
Modular Circuit Synthesis Oriented Modelling Approach for Non-Isolated DC-DC Converters in DCM
Modular Circuit Synthesis Oriented Modelling Approach for Non-Isolated DC-DC Converters in DCM
The continued and envisioned large-scale integration of renewable energy sources as a reaction to rising global temperatures and climate change will need a readily available DC gri...
Statistical evaluation of research performance of young university scholars: A case study
Statistical evaluation of research performance of young university scholars: A case study
Abstract The research performance of a small group of 49 young scholars, such as doctoral students, postdoctoral and junior researchers, working in different technical and scientif...

Back to Top