Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Online Learning with Survival Data

View through CrossRef
Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn. Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold. We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event. We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption. Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization. We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies. Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios. Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.
Title: Online Learning with Survival Data
Description:
Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn.
Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold.
We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event.
We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption.
Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization.
We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies.
Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios.
Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.

Related Results

Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
E-Learning
E-Learning
E-Learning ist heute aus keinem pädagogischen Lehrraum mehr wegzudenken. In allen Bereichen von Schule über die berufliche bis zur universitären Ausbildung und besonders im Bereich...
NURSING STUDENTS’ LEARNING EXPERIENCES IN AN ONLINE LEARNING COURSE
NURSING STUDENTS’ LEARNING EXPERIENCES IN AN ONLINE LEARNING COURSE
<p>To improve the quality of online learning in Indonesia higher education, Faculty of Nursing (FoN), Universitas Pelita Harapan (UPH) supported by the Directorate of Higher ...
Systematics of Literature Reviews: Learning Model of Discovery Learning in Science Learning
Systematics of Literature Reviews: Learning Model of Discovery Learning in Science Learning
The development of the 21st century has affected the world of education. Current education students must be led to learn more creatively and actively. This study aims Furthermore, ...
ONLINE INSTRUCTIONAL STRATEGIES
ONLINE INSTRUCTIONAL STRATEGIES
Online instructional strategies refer to the methods and approaches that guide the organization of learning activities, course content, and student engagement in online courses. St...
IDENTIFYING BARRIERS IN E – LEARNING, A MEDICAL STUDENT’S PERSPECTIVE
IDENTIFYING BARRIERS IN E – LEARNING, A MEDICAL STUDENT’S PERSPECTIVE
Objective: To recognize the barriers in different modes of e learning, from the medical student’s perspective during the period of Covid 19 pandemic.   Study Desi...
Online Learning Self-Efficacy during an Emergent Transition: A Cross-sectional Survey among Undergraduate Students in Saudi Arabia
Online Learning Self-Efficacy during an Emergent Transition: A Cross-sectional Survey among Undergraduate Students in Saudi Arabia
Introduction: The Coronavirus Disease-2019 (COVID-19) pandemic significantly affected higher education, necessitating a sudden shift to virtual classes in response to COVID-19 rest...

Back to Top