Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Abstract 13403: Data Nuggets for Matching Large Clinical Datasets: Application in a 350,000 Record Perinatal Database

View through CrossRef
Introduction: Research based on observational designs often include examination of outcome assessments across sub-populations. The sub-populations can be small, thereby potentially losing representativeness of the target population. The exact, approximate, and propensity score matching are popular methods to address this issue but become inefficient for large data sets. Hypothesis: Data Nuggets will produce matching results that are similar to conventional methods while performing the matching orders of magnitude faster. Methods: Data Nuggets is a novel data reduction technique that preserves the structure of the data by creating a collection of representative data points and contains the information about the centers, scale and weight for each group represented by a nugget. Observational data and simulations were used to show that matching data nuggets instead of individual patients is more efficient due to using data that is orders of magnitude smaller than the original and corrects for bias. We tested a Data Nuggets matching algorithm in a perinatal database with over 350,000 records with varying number of nuggets (between 100 and 800) against a conventional matching and full model fitting. We fit models to examine the association of preeclampsia on gestational duration, after adjusting or matching on age, race and body mass-index, and infant sex. All variables were scaled before fitting the models. Results: Models with a few hundred data nuggets produced results similar to those using conventional matching. Estimates of coefficients predicting gestational age via pre-eclampsia ranged from -0.73 weeks (100 nuggets) to -0.71 weeks (1,200 nuggets) compared to -0.72 (SE = 0.01) in the approximate matching and -0.80 (SE = 0.01) in the full (unmatched) data model (figure). Conclusions: Data Nuggets matching achieved results similar to those produced by the conventional matching method and is preferable for large data sets because of speed and efficiency.
Title: Abstract 13403: Data Nuggets for Matching Large Clinical Datasets: Application in a 350,000 Record Perinatal Database
Description:
Introduction: Research based on observational designs often include examination of outcome assessments across sub-populations.
The sub-populations can be small, thereby potentially losing representativeness of the target population.
The exact, approximate, and propensity score matching are popular methods to address this issue but become inefficient for large data sets.
Hypothesis: Data Nuggets will produce matching results that are similar to conventional methods while performing the matching orders of magnitude faster.
Methods: Data Nuggets is a novel data reduction technique that preserves the structure of the data by creating a collection of representative data points and contains the information about the centers, scale and weight for each group represented by a nugget.
Observational data and simulations were used to show that matching data nuggets instead of individual patients is more efficient due to using data that is orders of magnitude smaller than the original and corrects for bias.
We tested a Data Nuggets matching algorithm in a perinatal database with over 350,000 records with varying number of nuggets (between 100 and 800) against a conventional matching and full model fitting.
We fit models to examine the association of preeclampsia on gestational duration, after adjusting or matching on age, race and body mass-index, and infant sex.
All variables were scaled before fitting the models.
Results: Models with a few hundred data nuggets produced results similar to those using conventional matching.
Estimates of coefficients predicting gestational age via pre-eclampsia ranged from -0.
73 weeks (100 nuggets) to -0.
71 weeks (1,200 nuggets) compared to -0.
72 (SE = 0.
01) in the approximate matching and -0.
80 (SE = 0.
01) in the full (unmatched) data model (figure).
Conclusions: Data Nuggets matching achieved results similar to those produced by the conventional matching method and is preferable for large data sets because of speed and efficiency.

Related Results

Effect of Bacopa monnieri Extract on Storage and Microbial Quality of Vacuum Packaged Chicken Nuggets
Effect of Bacopa monnieri Extract on Storage and Microbial Quality of Vacuum Packaged Chicken Nuggets
The present study was done to assess the antioxidant potential of a herb viz. Bacopa monnieri L. in enhancing the shelf-life as well as adding function to chicken nuggets. Meat pro...
Timing of perinatal death; causes, circumstances, and regional variations among reviewed deaths in Ethiopia
Timing of perinatal death; causes, circumstances, and regional variations among reviewed deaths in Ethiopia
Introduction Ethiopia is one of the countries facing a very high burden of perinatal death in the world. Despite taking several measures to reduce the burden of stillbirth, the pac...
Italian Ornithological Commission (COI) - Report 30
Italian Ornithological Commission (COI) - Report 30
Italian Ornithological Commission (COI) - Report 30. This report refers to records from January 1st 2020 to December 31st 2021, with the addition of a number of records from previo...
Hubungan asfiksia perinatal dengan gangguan fungsi sel rambut luar koklea
Hubungan asfiksia perinatal dengan gangguan fungsi sel rambut luar koklea
Latar belakang: Bayi baru lahir dengan asfiksia perinatal dapat mengalami gangguan fungsi sel rambut luar  pada kokleanya. Tujuan:  Mengetahui   hubungan   asfiksia  perinatal  den...
DEVELOPMENT AND QUALITY EVALUATION OF CHICKEN NUGGETS USING FISH MEAT
DEVELOPMENT AND QUALITY EVALUATION OF CHICKEN NUGGETS USING FISH MEAT
Background: Chicken nuggets are widely consumed convenience meat products but are nutritionally limited in omega-3 long-chain polyunsaturated fatty acids, particularly eicosapentae...
Microbiological and Storage Quality Attributes Analysis of Papaver somniferum (Poppy) Fortified Fish Nuggets
Microbiological and Storage Quality Attributes Analysis of Papaver somniferum (Poppy) Fortified Fish Nuggets
Efficacy of ground poppy seed paste fortification in fish nuggets was analyzed. The use of ground poppy seed in fish nuggets formulation had no effect on moisture and protein conte...

Back to Top