Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A novel method for handling missing data in health care real-world study: Optimal Intact Subset Method

View through CrossRef
Abstract Handling missing data is indispensable in health-care real-world data processing. Imputing method may introduce error and multicollinearity. Therefore, we explored (Optimal Intact Subset Method, OIS.Method) to avoid the issues. By exploring an optimal deleting way of columns and rows with missing data, a subset retaining most information of original datasets was determined. Traditionally, we can traverse all deleting ways. But the computational cost is too high to use in large datasets. OIS.Method used an indicator to determine the optimal deleting order which can ascertain the optimal deleting way and simplify computing. In order to validate the effectiveness of OIS.Method, we compared OIS.Method with five other missing data handling methods in simulated real-world classification datasets. Additionally, we validated OIS.Method in two real-world classification tasks. In simulated datasets, the performance of OIS.Method was best(highest AUC was 1). In real-world datasets, OIS.Method could acquire better classification performance. Take AUC for an example: OIS.Method VS Simple Impute VS Random Forest VS Modified Random Forest, 0.8179±0.0005 VS 0.8116±0.0002 VS 0.8087±0.0009 VS 0.8093±0.0014 in task1, and 0.7028±0.0126 VS 0.6963±0.0231 VS 0.6957±0.0247 VS 0.6699±0.0249 in task2. The calculation of OIS.Method is smaller, and it is well-suited for large real-world datasets.
Title: A novel method for handling missing data in health care real-world study: Optimal Intact Subset Method
Description:
Abstract Handling missing data is indispensable in health-care real-world data processing.
Imputing method may introduce error and multicollinearity.
Therefore, we explored (Optimal Intact Subset Method, OIS.
Method) to avoid the issues.
By exploring an optimal deleting way of columns and rows with missing data, a subset retaining most information of original datasets was determined.
Traditionally, we can traverse all deleting ways.
But the computational cost is too high to use in large datasets.
OIS.
Method used an indicator to determine the optimal deleting order which can ascertain the optimal deleting way and simplify computing.
In order to validate the effectiveness of OIS.
Method, we compared OIS.
Method with five other missing data handling methods in simulated real-world classification datasets.
Additionally, we validated OIS.
Method in two real-world classification tasks.
In simulated datasets, the performance of OIS.
Method was best(highest AUC was 1).
In real-world datasets, OIS.
Method could acquire better classification performance.
Take AUC for an example: OIS.
Method VS Simple Impute VS Random Forest VS Modified Random Forest, 0.
8179±0.
0005 VS 0.
8116±0.
0002 VS 0.
8087±0.
0009 VS 0.
8093±0.
0014 in task1, and 0.
7028±0.
0126 VS 0.
6963±0.
0231 VS 0.
6957±0.
0247 VS 0.
6699±0.
0249 in task2.
The calculation of OIS.
Method is smaller, and it is well-suited for large real-world datasets.

Related Results

Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Handling Missing Data in COVID-19 Incidence Estimation: Secondary Data Analysis
Abstract Background The COVID-19 pandemic has revealed significant challenges in disease forecasting and in developing a public health response, ...
ACKNOWLEDGMENTS
ACKNOWLEDGMENTS
The UP Manila Health Policy Development Hub recognizes the invaluable contribution of the participants in theseries of roundtable discussions listed below: RTD: Beyond Hospit...
[RETRACTED] Optimal Max Keto - Does It ReallyWork? v1
[RETRACTED] Optimal Max Keto - Does It ReallyWork? v1
[RETRACTED]Shedding the unwanted weight and controlling the calories of your body is the most challenging and complicated process. As we start aging, we have to deal with lots of...
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
Long-range superharmonic Josephson current and spin-triplet pairing correlations in a junction with ferromagnetic bilayers
AbstractThe long-range spin-triplet supercurrent transport is an interesting phenomenon in the superconductor/ferromagnet ("Equation missing") heterostructure containing noncolline...
Ehealth Communication
Ehealth Communication
Ehealth, also known as E-health, is a relatively new area of health communication inquiry that examines the development, implementation, and application of a broad range of evolvin...
Evaluating Safe Patient Handling Systems: Is There a Better Way?
Evaluating Safe Patient Handling Systems: Is There a Better Way?
<p>The literature presented here shows that injuries suffered by staff and patients due to patient handling are preventable but patient handling injuries to health care worke...
Autonomy on Trial
Autonomy on Trial
Photo by CHUTTERSNAP on Unsplash Abstract This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...

Back to Top