Javascript must be enabled to continue!
Leveraging Clinical Notes and Natural Language Processing for Dementia Detection (Preprint)
View through CrossRef
BACKGROUND
Routinely collected data (e.g. coded hospital data, clinical notes) are widely being used to develop dementia prevalence estimates. This is limited and often returns low sensitivity (around 0.14-0.26) because dementia is not always recorded in the hospital notes in such a way as to allow hospital coders to code for dementia.
OBJECTIVE
We aim to develop various NLP models with clinical notes for more accurate identification of dementia than using structured (coded) data.
METHODS
After filtering patients who were admitted before 2021 in Peninsula Health, the Australia National Care Heathy Ageing data team established three validation cohorts based on 1048 patients: (1) confirmed dementia, (2) confirmed non-dementia, and (3) possible/uncertain dementia. 62 dementia related terms and phrases were identified by medical experts, we extended the list with UMLS and generated 245 dementia related key concepts. For each patient, all the clinical notes were used. We applied a fine-tuned MedCat model on the clinical notes and generated the corresponding UMLS annotations for each patient. For model development, we developed four statistical Machine Learning models: Naive Bayes, Logistic Regression, Support Vector Machine and RandomForest, as well as a Deep Learning pipeline which combines text selection and a fine tuned ClinicalBioBERT. We evaluated the above models within two settings: fine grained (dementia, non-dementia, uncertain) and binary (dementia vs non-dementia) classification. 10 fold cross validation was used to evaluate the classifiers’ performance.
RESULTS
We found that the dementia patients had three times more medical notes than non-dementia patients (241-283 v.s. 66-74). In the dementia group >25% of documents were progress notes. Among the four statistical models, Random Forest performed the best for binary classification and Linear SVM performed the best on fine grained classification. The deep learning pipeline didn’t perform well, the main reason could be that the black box encoding of the ClinicalBioBERT results in heavy information loss.
CONCLUSIONS
The 245 dementia related UMLS concepts that we recognize are useful for dementia detection based on medical notes. Random Forest and Support Vector Machine achieved the best performance for binary and fine grained dementia classification. Deep learning models are not ready yet for the low resource biomedical classification tasks.
Title: Leveraging Clinical Notes and Natural Language Processing for Dementia Detection (Preprint)
Description:
BACKGROUND
Routinely collected data (e.
g.
coded hospital data, clinical notes) are widely being used to develop dementia prevalence estimates.
This is limited and often returns low sensitivity (around 0.
14-0.
26) because dementia is not always recorded in the hospital notes in such a way as to allow hospital coders to code for dementia.
OBJECTIVE
We aim to develop various NLP models with clinical notes for more accurate identification of dementia than using structured (coded) data.
METHODS
After filtering patients who were admitted before 2021 in Peninsula Health, the Australia National Care Heathy Ageing data team established three validation cohorts based on 1048 patients: (1) confirmed dementia, (2) confirmed non-dementia, and (3) possible/uncertain dementia.
62 dementia related terms and phrases were identified by medical experts, we extended the list with UMLS and generated 245 dementia related key concepts.
For each patient, all the clinical notes were used.
We applied a fine-tuned MedCat model on the clinical notes and generated the corresponding UMLS annotations for each patient.
For model development, we developed four statistical Machine Learning models: Naive Bayes, Logistic Regression, Support Vector Machine and RandomForest, as well as a Deep Learning pipeline which combines text selection and a fine tuned ClinicalBioBERT.
We evaluated the above models within two settings: fine grained (dementia, non-dementia, uncertain) and binary (dementia vs non-dementia) classification.
10 fold cross validation was used to evaluate the classifiers’ performance.
RESULTS
We found that the dementia patients had three times more medical notes than non-dementia patients (241-283 v.
s.
66-74).
In the dementia group >25% of documents were progress notes.
Among the four statistical models, Random Forest performed the best for binary classification and Linear SVM performed the best on fine grained classification.
The deep learning pipeline didn’t perform well, the main reason could be that the black box encoding of the ClinicalBioBERT results in heavy information loss.
CONCLUSIONS
The 245 dementia related UMLS concepts that we recognize are useful for dementia detection based on medical notes.
Random Forest and Support Vector Machine achieved the best performance for binary and fine grained dementia classification.
Deep learning models are not ready yet for the low resource biomedical classification tasks.
Related Results
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
Understanding the knowledge and attitudes to dementia in Sub‐Saharan Africa: A systematic review
Understanding the knowledge and attitudes to dementia in Sub‐Saharan Africa: A systematic review
AbstractBackgroundA direct result of an ageing population is an increase in the prevalence of chronic non‐communicable diseases such as dementia. Creating dementia awareness and pr...
Prescription Patterns of Anti-dementia and Psychotropic Drugs in People Living With Dementia in China
Prescription Patterns of Anti-dementia and Psychotropic Drugs in People Living With Dementia in China
Abstract
Pharmacotherapy of dementia is a critical intervention for managing symptoms of and slowing progression of dementia. However, evidence on prescribing patter...
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program
Abstract
Funding Acknowledgements
Type of funding sources: None.
INTRODUCTION Patients with heart failure (HF)...
Translating ‘dementia friends’ programme to undergraduate medical and nursing practice: a qualitative exploration
Translating ‘dementia friends’ programme to undergraduate medical and nursing practice: a qualitative exploration
Abstract
Introduction
Dementia awareness is a key priority of medical and nursing pre-registration education. The ‘dementia friends’ programme is an...
Dementia Subtypes: A Study From Dementia Clinic In A Referral Neuroscience Hospital, Bangladesh
Dementia Subtypes: A Study From Dementia Clinic In A Referral Neuroscience Hospital, Bangladesh
Background: Dementia is one of the major causes of disability and dependency among older people globally but still it is not explored very well in most parts of the world particula...
ENGAGE-DEM : a model of engagement of people with dementia
ENGAGE-DEM : a model of engagement of people with dementia
Dementia is a neurodegenerative disease that affects cognition, producing a reduction in thinking, problem-solving, and mnemonic abilities, functioning, preventing affected people ...

