Javascript must be enabled to continue!
Optimizing Lung Cancer Risk Prediction with Advanced Machine Learning Algorithms and Techniques
View through CrossRef
Lung cancer is among the leading causes of cancer death in the U.S.A. as well as globally and causes more deaths than breast, prostate, and colorectal cancers combined. It thus presents a significant health burden globally, with an estimated new case diagnosed and death toll at 2.2 and 1.8 million annually, respectively. Given the complexity of the etiology of lung cancer, there is a real urgent need for more accurate and reliable prediction models with the capability to integrate diverse risk factors. While current modalities for screening and imaging clinical conditions are effective, they are often costly and invasive. The study's main objective was to develop and evaluate machine learning models, using integrated demographic, environmental, and lifestyle variables for predicting lung cancer risk. The source of dataset for lung cancer risk prediction was retrieved from multiple sources, particularly, Cleveland hospital records as well as public health databases in the U.S; Besides, we also used large-scale epidemiology studies such as the National Lung Screening Trial (NLST) or the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. These sources provided invaluable datasets to which machine learning models were developed, as they contained very valuable information on demographic data, past medical history, lifestyle habits, and clinical symptoms. In this study, the experiment used 3 machine learning algorithms: Logistic Regression, XG-Boost, and Random Forest. Accuracy, precision, recall, as well as F1 score, are used as performance metrics. Overall, the performance of the Logistic Regression model surpassed the Random Forest and XG-Boost models. It had the highest scores in all the metrics, particularly, accuracy, precision, recall, and F1 score. This is indicative that the model Logistic Regression was slightly better at balancing the true positives and false positives and false negatives. The Random Forest model exemplified an intermediate performance, positioning itself second to the Logistic Regression. A significant volume of empirical studies has established that the different machine learning techniques, such as Logistic Regression and Random Forest considerably improve the detection of lung cancer. Although logistic regression, due to its simplicity and interpretability, remains very useful, Random Forest and XG-Boost are much more capable of modeling difficult nonlinear interactions in high-dimensional data. Advanced models like these will provide far more accurate, personalized risk estimates and have the potential to be a powerful contribution to early detection and better clinical decisions regarding lung cancer.
Al-Kindi Center for Research and Development
Title: Optimizing Lung Cancer Risk Prediction with Advanced Machine Learning Algorithms and Techniques
Description:
Lung cancer is among the leading causes of cancer death in the U.
S.
A.
as well as globally and causes more deaths than breast, prostate, and colorectal cancers combined.
It thus presents a significant health burden globally, with an estimated new case diagnosed and death toll at 2.
2 and 1.
8 million annually, respectively.
Given the complexity of the etiology of lung cancer, there is a real urgent need for more accurate and reliable prediction models with the capability to integrate diverse risk factors.
While current modalities for screening and imaging clinical conditions are effective, they are often costly and invasive.
The study's main objective was to develop and evaluate machine learning models, using integrated demographic, environmental, and lifestyle variables for predicting lung cancer risk.
The source of dataset for lung cancer risk prediction was retrieved from multiple sources, particularly, Cleveland hospital records as well as public health databases in the U.
S; Besides, we also used large-scale epidemiology studies such as the National Lung Screening Trial (NLST) or the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial.
These sources provided invaluable datasets to which machine learning models were developed, as they contained very valuable information on demographic data, past medical history, lifestyle habits, and clinical symptoms.
In this study, the experiment used 3 machine learning algorithms: Logistic Regression, XG-Boost, and Random Forest.
Accuracy, precision, recall, as well as F1 score, are used as performance metrics.
Overall, the performance of the Logistic Regression model surpassed the Random Forest and XG-Boost models.
It had the highest scores in all the metrics, particularly, accuracy, precision, recall, and F1 score.
This is indicative that the model Logistic Regression was slightly better at balancing the true positives and false positives and false negatives.
The Random Forest model exemplified an intermediate performance, positioning itself second to the Logistic Regression.
A significant volume of empirical studies has established that the different machine learning techniques, such as Logistic Regression and Random Forest considerably improve the detection of lung cancer.
Although logistic regression, due to its simplicity and interpretability, remains very useful, Random Forest and XG-Boost are much more capable of modeling difficult nonlinear interactions in high-dimensional data.
Advanced models like these will provide far more accurate, personalized risk estimates and have the potential to be a powerful contribution to early detection and better clinical decisions regarding lung cancer.
Related Results
Abstract 1345: Evidence for genetic mediation of lung cancer through hay fever.
Abstract 1345: Evidence for genetic mediation of lung cancer through hay fever.
Abstract
Introduction: In the past decade, advances in genetics have led to the discovery of numerous lung cancer susceptibility variants. The majority of these vari...
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Are Cervical Ribs Indicators of Childhood Cancer? A Narrative Review
Abstract
A cervical rib (CR), also known as a supernumerary or extra rib, is an additional rib that forms above the first rib, resulting from the overgrowth of the transverse proce...
Edoxaban and Cancer-Associated Venous Thromboembolism: A Meta-analysis of Clinical Trials
Edoxaban and Cancer-Associated Venous Thromboembolism: A Meta-analysis of Clinical Trials
Abstract
Introduction
Cancer patients face a venous thromboembolism (VTE) risk that is up to 50 times higher compared to individuals without cancer. In 2010, direct oral anticoagul...
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract OI-1: OI-1 Decoding breast cancer predisposition genes
Abstract
Women with one or more first-degree female relatives with a history of breast cancer have a two-fold increased risk of developing breast cancer. This risk i...
Integrating quantum neural networks with machine learning algorithms for optimizing healthcare diagnostics and treatment outcomes
Integrating quantum neural networks with machine learning algorithms for optimizing healthcare diagnostics and treatment outcomes
The rapid advancements in artificial intelligence (AI) and quantum computing have catalyzed an unprecedented shift in the methodologies utilized for healthcare diagnostics and trea...
Prediction using Machine Learning
Prediction using Machine Learning
This chapter begins with a concise introduction to machine learning and the
classification of machine learning systems (supervised learning, unsupervised learning,
and reinforcemen...
Abstract 1657: Genome-wide association study of lung cancer: Variation in TP63 gene confers the risk of lung adenocarcinoma
Abstract 1657: Genome-wide association study of lung cancer: Variation in TP63 gene confers the risk of lung adenocarcinoma
Abstract
Lung cancer is the most common cause of death from cancer worldwide, and its incidence is increasing in East Asian and Western countries. Lung cancer compri...
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Microwave Ablation with or Without Chemotherapy in Management of Non-Small Cell Lung Cancer: A Systematic Review
Abstract
Introduction
Microwave ablation (MWA) has emerged as a minimally invasive treatment for patients with inoperable non-small cell lung cancer (NSCLC). However, whether it i...

