Javascript must be enabled to continue!

Online Learning with Survival Data

Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn. Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold. We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event. We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption. Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization. We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies. Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios. Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.

Elsevier BV

Arielle Anderer Hamsa Bastani John Silberholz

2026

Title: Online Learning with Survival Data

Description:

Decision-makers frequently utilize adaptive experiments to optimize time-to-event outcomes, such as accelerating healthcare screenings or delaying customer churn.

Traditional multi-armed bandit algorithms fail in these settings because they assume outcome delays are non-informative, leading practitioners to rely on dichotomization, a heuristic that collapses continuous timing into binary outcomes at a fixed threshold.

We introduce "survival bandits," a principled class of algorithms that integrate the Cox proportional hazards model to utilize the full temporal signal of every event.

We analytically contrast the regret of both approaches under a large-n limit with Weibull-distributed survival times satisfying the proportional hazards assumption.

Our analytical results prove that dichotomization imposes substantial inefficiency, increasing regret by 41% to 54% even under optimal parameterization.

We further demonstrate that survival bandits are uniquely robust to event rate uncertainty, whereas dichotomized approaches suffer significant performance degradation due to fragile threshold dependencies.

Even when the proportional hazards assumption is violated, we show that survival bandits effectively identify the best arm under realistic scenarios.

Simulations using real-world cervical cancer screening data validate our findings, demonstrating that survival bandits consistently reduce regret relative to the best-performing dichotomized algorithms.

Back

Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

E-Learning

E-Learning ist heute aus keinem pädagogischen Lehrraum mehr wegzudenken. In allen Bereichen von Schule über die berufliche bis zur universitären Ausbildung und besonders im Bereich...

NURSING STUDENTS’ LEARNING EXPERIENCES IN AN ONLINE LEARNING COURSE

<p>To improve the quality of online learning in Indonesia higher education, Faculty of Nursing (FoN), Universitas Pelita Harapan (UPH) supported by the Directorate of Higher ...

Systematics of Literature Reviews: Learning Model of Discovery Learning in Science Learning

The development of the 21st century has affected the world of education. Current education students must be led to learn more creatively and actively. This study aims Furthermore, ...

ONLINE INSTRUCTIONAL STRATEGIES

Online instructional strategies refer to the methods and approaches that guide the organization of learning activities, course content, and student engagement in online courses. St...

Effect of Learning Management Using Problem-based Learning on Fine Arts Basic Ability of Freshmen in Suzhou Arts and Design Institute, The People’s Republic of China

Background and Aim: Learning Management Using Problem-Based Learning students can have better development of creativity, the ability to apply in real-world situations, aesthetic ap...

IDENTIFYING BARRIERS IN E – LEARNING, A MEDICAL STUDENT’S PERSPECTIVE

Objective: To recognize the barriers in different modes of e learning, from the medical student’s perspective during the period of Covid 19 pandemic. Study Desi...

Online Learning Self-Efficacy during an Emergent Transition: A Cross-sectional Survey among Undergraduate Students in Saudi Arabia

Introduction: The Coronavirus Disease-2019 (COVID-19) pandemic significantly affected higher education, necessitating a sudden shift to virtual classes in response to COVID-19 rest...

Email:
Password:

Email:

Online Learning with Survival Data

Related Results