Javascript must be enabled to continue!
Online Learning with Survival Data
View through CrossRef
Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn. Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold. We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event. We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption. Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization. We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies. Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios. Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.
Title: Online Learning with Survival Data
Description:
Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn.
Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold.
We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event.
We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption.
Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization.
We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies.
Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios.
Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.
Related Results
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Small Cell Lung Cancer and Tarlatamab: A Meta-Analysis of Clinical Trials
Abstract
Introduction
Tarlatamab is a Delta-like ligand 3 (DLL3) -directed bispecific T-cell engager recently approved for use in patients with advanced small cell lung cancer (SCL...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Abstract P1-20-19: Local therapy and its association on survival outcomes in patients with de novo stage IV breast cancer: Results from a retrospective cohort study
Abstract P1-20-19: Local therapy and its association on survival outcomes in patients with de novo stage IV breast cancer: Results from a retrospective cohort study
Abstract
Background: Stage IV breast cancer was considered to be an incurable disease with a poor survival outcome. Among them, patients with de novo stage IV breast...
Online simulated workplace tasks to enhance business English learning
Online simulated workplace tasks to enhance business English learning
PurposeThis paper aims to provide evidence that online well-designed educational tasks can provide more relevant and richer active learning environment for business English learner...
Speaking Test Apps Development: English Learning Enhancement in Online Learning Complexity
Speaking Test Apps Development: English Learning Enhancement in Online Learning Complexity
Abstract. This research tried to map the strength and weaknesses and evaluate the English teaching and learning processes in Balinese EFL learners during the online class. The use ...
HUMAN RESOURCE DEVELOPMENT INNOVATION IN CITIES
HUMAN RESOURCE DEVELOPMENT INNOVATION IN CITIES
Abstract. Humanity is currently in deep sadness due to the outbreak of Coronavirus Disease (Covid-19) which has hit almost all parts of the world. Indonesia spontaneously implement...

