Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Framework for Benefit-Based Multiclass Classification of Diseases

View through CrossRef
Abstract Purpose: Health datasets typically comprise of data that are heavily skewed towards the healthy class, thus resulting in classifiers erring towards this majority class. Due to this imbalance of data, traditional performance metrics, such as accuracy, are not appropriate for evaluating the performance of classifiers with the minority class (disease-affected). In addition, classifiers are trained under the assumption that the costs or benefits associated with different decision outcomes are equal. However, this is usually not the case with health data since there are different benefits/costs associated with the correct/incorrect identification of disease affected/unhealthy persons rather than healthy individuals. In this paper we address these problems by examining benefits/costs both when training and evaluating the performance of classifiers. Furthermore,we focus on multiclass classification where the outcome can be one of three or more options. Methods: We propose modifications to the Naive Bayes and Logistic Regression algorithms to incorporate costs and benefits when training for the multiclass scenario, as well as compare these to a recently proposed algorithm in the field, hierarchical cost-sensitive kernel logistic regression, and also an adapted hierarchical approach with our cost-benefit based logistic regression model. Wedemonstrate the effectiveness of all approaches for fetal health classification, vertebral column classification and hepatitis C/fibrosis/cirrhosis prediction. Results: Our proposed multiclass Logistic Regression algorithm outperformed all other algorithms, improving performance with the more critical classes. Conclusion: Our proposed multiclass Logistic Regression algorithm is robust and suitable for cases where costs and benefits of the various decision outcomes are important.
Research Square Platform LLC
Title: Framework for Benefit-Based Multiclass Classification of Diseases
Description:
Abstract Purpose: Health datasets typically comprise of data that are heavily skewed towards the healthy class, thus resulting in classifiers erring towards this majority class.
Due to this imbalance of data, traditional performance metrics, such as accuracy, are not appropriate for evaluating the performance of classifiers with the minority class (disease-affected).
In addition, classifiers are trained under the assumption that the costs or benefits associated with different decision outcomes are equal.
However, this is usually not the case with health data since there are different benefits/costs associated with the correct/incorrect identification of disease affected/unhealthy persons rather than healthy individuals.
In this paper we address these problems by examining benefits/costs both when training and evaluating the performance of classifiers.
Furthermore,we focus on multiclass classification where the outcome can be one of three or more options.
Methods: We propose modifications to the Naive Bayes and Logistic Regression algorithms to incorporate costs and benefits when training for the multiclass scenario, as well as compare these to a recently proposed algorithm in the field, hierarchical cost-sensitive kernel logistic regression, and also an adapted hierarchical approach with our cost-benefit based logistic regression model.
Wedemonstrate the effectiveness of all approaches for fetal health classification, vertebral column classification and hepatitis C/fibrosis/cirrhosis prediction.
Results: Our proposed multiclass Logistic Regression algorithm outperformed all other algorithms, improving performance with the more critical classes.
Conclusion: Our proposed multiclass Logistic Regression algorithm is robust and suitable for cases where costs and benefits of the various decision outcomes are important.

Related Results

Multiclass Classification of Thyroid gland data employing Linear Discriminant Analysis
Multiclass Classification of Thyroid gland data employing Linear Discriminant Analysis
Abstract With the advent of internet, complex data is tremendously increased. It is essential to analyze the data. Multiclass classification, has been an important problem ...
Multiclass Classification of Thyroid gland data employing Linear Discriminant Analysis
Multiclass Classification of Thyroid gland data employing Linear Discriminant Analysis
Abstract With the advent of internet, complex data is tremendously increased. It is essential to analyze the data. Multiclass classification, has been an important problem ...
Predicting Air Quality Based on Multiclass Machine Learning Techniques
Predicting Air Quality Based on Multiclass Machine Learning Techniques
India is among the most polluted nations with severe environmental implications of pollution increase in several bas cities. For the past few years’ Indian cities have witnessed an...
A Prediction of Network Intrusion Using CNN-LSTM
A Prediction of Network Intrusion Using CNN-LSTM
At present, network attacks have become a worldwide issue as they disturb the functioning and performance of the computer network. Network attacks are a serious problem they may ca...
Analyse von Orphan- Drug-Verfahren in der frühen Nutzenbewertung: RCTs versus best-verfügbare vergleichende Evidenz
Analyse von Orphan- Drug-Verfahren in der frühen Nutzenbewertung: RCTs versus best-verfügbare vergleichende Evidenz
For drugs intended to use in rare diseases (orphan drugs), the requirements of the early benefit assessment according to § 35a SGB V (Social Security Code, book 5) pose a particula...
Detecting Hope in Social Media Discourse Using Machine and Deep Learning Classifiers
Detecting Hope in Social Media Discourse Using Machine and Deep Learning Classifiers
Hope speech refers to messages that convey optimism, support, or expectations of a better future. With the increasing use of social media as a medium for self-expression, analysing...
Framework for Benefit-Based Multiclass Classification
Framework for Benefit-Based Multiclass Classification
Abstract Health datasets typically comprise of data that are heavily skewed towards the healthy class, thus resulting in classifiers being biased towards this majority clas...
Improving Medical Document Classification via Feature Engineering
Improving Medical Document Classification via Feature Engineering
<p dir="ltr">Document classification (DC) is the task of assigning the predefined labels to unseen documents by utilizing the model trained on the available labeled documents...

Back to Top