Javascript must be enabled to continue!
Intrinsically Interpretable Decision Trees for Healthcare Applications
View through CrossRef
Abstract
The deployment of machine learning for high-stakes decision support may demand algorithms that are intrinsically interpretable so that a model can be interrogated by a user to understand how the model arrived at an output given an input. Decision trees exemplify intrinsically interpretable machine learning but to achieve acceptable performance they rely on ensemble methods which sacrifice their interpretability because there is no single, or ‘canonical’ tree for the user to inspect. Sequential Monte Carlo Decision Trees are a novel method for training populations of decision trees that do not rely on ensembles or subspace sampling and achieve comparable performance to other tree-based methods. With SMC-DTs, each of the population of trees are trained on the entire feature space and each is a candidate for an intrinsically interpretable canonical decision tree. In this paper, we propose a tree-similarity algorithm to assist a user in choosing a final, canonical high-performing tree that is consistent (in structure) with other high-performing candidate trees. This enables the user to reduce the pool of candidates to a small number that can be inspected to select a final tree for deployment. We show that SMC-DTs are competitive with well-established methods Random Forest, Extreme Gradient Boosting Trees and Classification and Regression Trees on four benchmark health-related data sets. We then demonstrate the combination of SMC-DTs with the tree similarity method to derive and present an interpretable decision tree to identify university students likely to experience suicidal ideation using summary and administrative data university welfare services would be able to access for their students (i.e. without detailed clinical information). We conclude that combining SMC-DTs with tree-similarity offers a plausible way of delivering intrinsic interpretability for use in mixed-data type tabular data sets.
Springer Science and Business Media LLC
Title: Intrinsically Interpretable Decision Trees for Healthcare Applications
Description:
Abstract
The deployment of machine learning for high-stakes decision support may demand algorithms that are intrinsically interpretable so that a model can be interrogated by a user to understand how the model arrived at an output given an input.
Decision trees exemplify intrinsically interpretable machine learning but to achieve acceptable performance they rely on ensemble methods which sacrifice their interpretability because there is no single, or ‘canonical’ tree for the user to inspect.
Sequential Monte Carlo Decision Trees are a novel method for training populations of decision trees that do not rely on ensembles or subspace sampling and achieve comparable performance to other tree-based methods.
With SMC-DTs, each of the population of trees are trained on the entire feature space and each is a candidate for an intrinsically interpretable canonical decision tree.
In this paper, we propose a tree-similarity algorithm to assist a user in choosing a final, canonical high-performing tree that is consistent (in structure) with other high-performing candidate trees.
This enables the user to reduce the pool of candidates to a small number that can be inspected to select a final tree for deployment.
We show that SMC-DTs are competitive with well-established methods Random Forest, Extreme Gradient Boosting Trees and Classification and Regression Trees on four benchmark health-related data sets.
We then demonstrate the combination of SMC-DTs with the tree similarity method to derive and present an interpretable decision tree to identify university students likely to experience suicidal ideation using summary and administrative data university welfare services would be able to access for their students (i.
e.
without detailed clinical information).
We conclude that combining SMC-DTs with tree-similarity offers a plausible way of delivering intrinsic interpretability for use in mixed-data type tabular data sets.
Related Results
Perceptions of Telemedicine and Rural Healthcare Access in a Developing Country: A Case Study of Bayelsa State, Nigeria
Perceptions of Telemedicine and Rural Healthcare Access in a Developing Country: A Case Study of Bayelsa State, Nigeria
Abstract
Introduction
Telemedicine is the remote delivery of healthcare services using information and communication technologies and has gained global recognition as a solution to...
Autonomy on Trial
Autonomy on Trial
Photo by CHUTTERSNAP on Unsplash
Abstract
This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...
PERSPECTIVES FOR COMPETITION IN THE HEALTHCARE INDUSTRY
PERSPECTIVES FOR COMPETITION IN THE HEALTHCARE INDUSTRY
A paradox has been established in the modern healthcare industry - consumers can choose between many alternatives but with high uncertainty, while healthcare establishments have nu...
A systematic review on the healthcare system in Jordan: Strengths, weaknesses, and opportunities for improvement
A systematic review on the healthcare system in Jordan: Strengths, weaknesses, and opportunities for improvement
Introduction: This systematic review examines the strengths and weaknesses of Jordan's healthcare system, providing valuable insights for healthcare providers, policymakers, and re...
KNOWLEDGE, ATTITUDES, AND PERCEPTIONS OF HEALTHCARE PROFESSIONALS ON THE USE OF ARTIFICIAL INTELLIGENCE IN HEALTHCARE – A CROSS-SECTIONAL STUDY
KNOWLEDGE, ATTITUDES, AND PERCEPTIONS OF HEALTHCARE PROFESSIONALS ON THE USE OF ARTIFICIAL INTELLIGENCE IN HEALTHCARE – A CROSS-SECTIONAL STUDY
ABSTRACT
Background: Artificial Intelligence (AI) has become an increasingly integral part of healthcare systems, offering the potential to enhance diagnosis, treatment, and patien...
The Hazards of Data Mining in Healthcare
The Hazards of Data Mining in Healthcare
From the mid-1990s, data mining methods have been used to explore and find patterns and relationships in healthcare data. During the 1990s and early 2000's, data mining was a topic...
Decision support tools for the management in a Dry Afromontane Forest in Ethiopia
Decision support tools for the management in a Dry Afromontane Forest in Ethiopia
Ethiopia is one of the tropical countries endowed with diverse forest formations. These forests provide large amounts of wood that can be used for furniture, construction, and dome...
Big Data in Healthcare
Big Data in Healthcare
improve quality in healthcare, it is very much important to store, manage and retrieve as well as use the data & information properly as it has great potential to help leaders ...

