Javascript must be enabled to continue!
A Learning and Rectification Algorithm for Nonstationary Online Linear Programming
View through CrossRef
This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon. To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context. This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability. Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework. This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance. Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility. The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification. Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets. To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.2% average revenue gain in online A/B tests.
Title: A Learning and Rectification Algorithm for Nonstationary Online Linear Programming
Description:
This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon.
To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context.
This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability.
Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework.
This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance.
Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility.
The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification.
Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets.
To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.
2% average revenue gain in online A/B tests.
Related Results
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic
Abstract
Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
Sensitivity Analysis of Flood Risk Estimation under Nonstationary Conditions: A Case Study of the Weihe River, China
Sensitivity Analysis of Flood Risk Estimation under Nonstationary Conditions: A Case Study of the Weihe River, China
Flood risk has been increasing in many basins of the world, due to the global water cycle change driving by the global climate warming. To deal with the nonstationary properties of...
14. Rectification
14. Rectification
Rectification is an equitable remedy through which the court can rectify, or correct a mistake in a written contract. This chapter examines two principal forms of rectification: co...
Programming model abstractions for optimizing I/O intensive applications
Programming model abstractions for optimizing I/O intensive applications
This thesis contributes from the perspective of task-based programming models to the efforts of optimizing I/O intensive applications. Throughout this thesis, we propose programmin...
Covered solution for a grey linear program based on a general formula for the inverse of a grey matrix
Covered solution for a grey linear program based on a general formula for the inverse of a grey matrix
Purpose
– This paper attempts to establish the general formula for computing the inverse of grey matrix, and the results are applied to solve grey linear programmin...
Analisis Pembelajaran Algoritma Pemrograman
Analisis Pembelajaran Algoritma Pemrograman
Programming algorithm learning analysis is a study that examines the approaches, methods, and results of the algorithm teaching and learning process in the world of programming. Th...
WEB PROGRAMMING
WEB PROGRAMMING
"Web Programming" is a comprehensive book that provides a detailed overview of various aspects of web programming. The book is co-authored by Dr. Chitra Ravi and Dr. Mohan Kumar S,...
Interdisciplinary perspective on architectural programming: current status and future directions
Interdisciplinary perspective on architectural programming: current status and future directions
PurposeArchitectural programming, as a critical phase in construction projects, has been widely recognized for its importance and advantages throughout the construction process. Wi...

