Javascript must be enabled to continue!

A Learning and Rectification Algorithm for Nonstationary Online Linear Programming

This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon. To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context. This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability. Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework. This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance. Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility. The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification. Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets. To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.2% average revenue gain in online A/B tests.

Elsevier BV

Jiawen Yang Xingyu Lu Caihua Chen Wei Lu Baozhuang Niu Jun Zhou

2026

Title: A Learning and Rectification Algorithm for Nonstationary Online Linear Programming

Description:

This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon.

To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context.

This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability.

Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework.

This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance.

Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility.

The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification.

Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets.

To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.

2% average revenue gain in online A/B tests.

Back

Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...

Sensitivity Analysis of Flood Risk Estimation under Nonstationary Conditions: A Case Study of the Weihe River, China

Flood risk has been increasing in many basins of the world, due to the global water cycle change driving by the global climate warming. To deal with the nonstationary properties of...

14. Rectification

Rectification is an equitable remedy through which the court can rectify, or correct a mistake in a written contract. This chapter examines two principal forms of rectification: co...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Programming model abstractions for optimizing I/O intensive applications

This thesis contributes from the perspective of task-based programming models to the efforts of optimizing I/O intensive applications. Throughout this thesis, we propose programmin...

Covered solution for a grey linear program based on a general formula for the inverse of a grey matrix

Purpose – This paper attempts to establish the general formula for computing the inverse of grey matrix, and the results are applied to solve grey linear programmin...

Analisis Pembelajaran Algoritma Pemrograman

Programming algorithm learning analysis is a study that examines the approaches, methods, and results of the algorithm teaching and learning process in the world of programming. Th...

WEB PROGRAMMING

"Web Programming" is a comprehensive book that provides a detailed overview of various aspects of web programming. The book is co-authored by Dr. Chitra Ravi and Dr. Mohan Kumar S,...

Email:
Password:

Email:

A Learning and Rectification Algorithm for Nonstationary Online Linear Programming

Related Results