Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Learning and Rectification Algorithm for Nonstationary Online Linear Programming

View through CrossRef
This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon. To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context. This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability. Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework. This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance. Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility. The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification. Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets. To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.2% average revenue gain in online A/B tests.
Title: A Learning and Rectification Algorithm for Nonstationary Online Linear Programming
Description:
This paper investigates online linear programming (OLP) under resource capacity constraints, aiming to maximize total reward over a finite horizon.
To address the limitations of stochastic models in nonstationary settings and the conservatism of adversarial models, we introduce a globally nonstationary and locally stationary arrival model to the OLP context.
This framework partitions the horizon into known periods where requests are independent and identically distributed within each period, while the underlying distribution varies across periods, balancing environmental evolution with local stability.
Assuming the true distribution is unknown, but an inaccurate prior is available, we propose the Gradient Descent with Learning and Rectification (GDLR) framework.
This dual-based method optimizes budget consumption for each request through two complementary components: a learning component that leverages local stationarity to refine estimates by combining prior knowledge with real-time observations, and a rectification component that counters global nonstationarity by adaptively adjusting estimates based on cumulative resource imbalance.
Theoretically, we demonstrate that rectification serves as robust optimization against estimation errors in dual and ensures primal feasibility.
The regret bound of GDLR decomposes into intrinsic stochasticity, estimation error, and allocation suboptimality, with the latter two significantly reduced via learning and rectification.
Empirically, our approach consistently outperforms baselines on synthetic and real-world datasets.
To our knowledge, this work presents the first enterprise OLP deployment at Alipay, achieving a 2.
2% average revenue gain in online A/B tests.

Related Results

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Nonstationary Flood Hazard Analysis in Response to Climate Change and Population Growth
Nonstationary Flood Hazard Analysis in Response to Climate Change and Population Growth
The predictions of flood hazard over the design life of a hydrological project are of great importance for hydrological engineering design under the changing environment. The conce...
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Initial Experience with Pediatrics Online Learning for Nonclinical Medical Students During the COVID-19 Pandemic 
Abstract Background: To minimize the risk of infection during the COVID-19 pandemic, the learning mode of universities in China has been adjusted, and the online learning o...
14. Rectification
14. Rectification
Rectification is an equitable remedy through which the court can rectify, or correct a mistake in a written contract. This chapter examines two principal forms of rectification: co...
Sensitivity Analysis of Flood Risk Estimation under Nonstationary Conditions: A Case Study of the Weihe River, China
Sensitivity Analysis of Flood Risk Estimation under Nonstationary Conditions: A Case Study of the Weihe River, China
Flood risk has been increasing in many basins of the world, due to the global water cycle change driving by the global climate warming. To deal with the nonstationary properties of...
Active Learning in the Context of the Teaching/Learning of Computer Programming: A Systematic Review
Active Learning in the Context of the Teaching/Learning of Computer Programming: A Systematic Review
Aim/Purpose: This paper presents the results of a systematic literature review that sought to identify the studies that relate the different pedagogical techniques by which active ...
16. Rectification of the Register of Title
16. Rectification of the Register of Title
Titles in the Complete series combine extracts from a wide range of primary materials with clear explanatory text to provide readers with a complete introductory resource. This cha...
16. Rectification of the Register of Title
16. Rectification of the Register of Title
Titles in the Complete series combine extracts from a wide range of primary materials with clear explanatory text to provide readers with a complete introductory resource. This cha...

Back to Top