Javascript must be enabled to continue!

Is This Predictor More Informative than Another? A Decision-Theoretical Comparison

In many real-world applications, a model provider provides probabilistic forecasts to downstream decision-makers who use them to make decisions under diverse payoff objectives. The provider may have access to multiple predictive models, each potentially miscalibrated, and must choose which model to deploy in order to maximize the usefulness of predictions for downstream decisions. A central challenge arises: how can the provider meaningfully compare two predictors when neither is guaranteed to be well-calibrated, and when the relevant decision tasks may differ across users and contexts?  <div> <br> </div> <div> To answer this, our first contribution introduces the notion of the informativeness gap between any two predictors, defined as the maximum normalized payoff advantage one predictor offers over the other across all decision-making tasks. Our framework strictly generalizes several existing notions: it subsumes U-Calibration [Kleinberg et al., 2023] and Calibration Decision Loss [Hu and Wu, 2024], which compare a miscalibrated predictor to its calibrated counterpart, and it recovers Blackwell informativeness [Blackwell, 1951, 1953] as a special case when both predictors are perfectly calibrated. Our second contribution is a dual characterization of the informativeness gap, which gives rise to a natural informativeness measure that can be viewed as a relaxed variant of the earth mover's distance (EMD) between two prediction distributions. We show that this measure satisfies natural desiderata: it is complete and sound, and it can be estimated sample-efficiently in the prediction-only access setting. Along the way, we also obtain novel combinatorial structural results when applying this measure to perfectly calibrated predictors. We complement our theory with experiments on LLM-based forecasters in real-world weather and Bitcoin prediction tasks, showing that the informativeness gap offers a more decision-relevant alternative to traditional metrics like the Brier score/ECE, and provides a principled lens for evaluating how ad hoc calibration post-processing affects downstream decision usefulness. </div>

Elsevier BV

Yiding Feng Liuhan Qian Wei Tang

2026

Title: Is This Predictor More Informative than Another? A Decision-Theoretical Comparison

Description:

In many real-world applications, a model provider provides probabilistic forecasts to downstream decision-makers who use them to make decisions under diverse payoff objectives.

The provider may have access to multiple predictive models, each potentially miscalibrated, and must choose which model to deploy in order to maximize the usefulness of predictions for downstream decisions.

A central challenge arises: how can the provider meaningfully compare two predictors when neither is guaranteed to be well-calibrated, and when the relevant decision tasks may differ across users and contexts?  <div> <br> </div> <div> To answer this, our first contribution introduces the notion of the informativeness gap between any two predictors, defined as the maximum normalized payoff advantage one predictor offers over the other across all decision-making tasks.

Our framework strictly generalizes several existing notions: it subsumes U-Calibration [Kleinberg et al.

, 2023] and Calibration Decision Loss [Hu and Wu, 2024], which compare a miscalibrated predictor to its calibrated counterpart, and it recovers Blackwell informativeness [Blackwell, 1951, 1953] as a special case when both predictors are perfectly calibrated.

Our second contribution is a dual characterization of the informativeness gap, which gives rise to a natural informativeness measure that can be viewed as a relaxed variant of the earth mover's distance (EMD) between two prediction distributions.

We show that this measure satisfies natural desiderata: it is complete and sound, and it can be estimated sample-efficiently in the prediction-only access setting.

Along the way, we also obtain novel combinatorial structural results when applying this measure to perfectly calibrated predictors.

We complement our theory with experiments on LLM-based forecasters in real-world weather and Bitcoin prediction tasks, showing that the informativeness gap offers a more decision-relevant alternative to traditional metrics like the Brier score/ECE, and provides a principled lens for evaluating how ad hoc calibration post-processing affects downstream decision usefulness.

</div>.

Back

Related Results

Autonomy on Trial

Photo by CHUTTERSNAP on Unsplash Abstract This paper critically examines how US bioethics and health law conceptualize patient autonomy, contrasting the rights-based, individualist...

GIS BASED DECISION SUPPORT SYSTEM FOR SEISMIC RISK IN BUCHAREST. CASE STUDY – THE HISTORICAL CENTRE

Because of the increasing volume of information, problem decisions tend to be more difficult to deal with. Achieving an objective and making a suitable decision may become a real c...

Informative Lagrange Multipliers in the Nonlinear Parametric Programming Model

Abstract The shadow price expresses the marginal cost with respect to the variation of constraints, and it is extremely useful in the sensitivity analysis of nonlinear prog...

Regional Flood Frequency Analysis Using an Artificial Neural Network Model

This paper presents the results from a study on the application of an artificial neural network (ANN) model for regional flood frequency analysis (RFFA). The study was conducted us...

metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure

Abstract Intrinsically disordered proteins and protein regions make up a substantial fraction of many proteomes where they play a wide variety of essential roles. A...

Operational decision-making with machine learning and causal inference

Optimizing operational decisions, routine actions within some business or operational process, is a key challenge across a variety of domains and application areas. The increasing ...

A novel linguistic decision making approach based on attribute correlation and EDAS method

AbstractOne of characteristics of large-scale linguistic decision making problems is that decision information with respect to decision making attributes is derived from multi-sour...

CONCEPTUAL DECISION MODEL

Decision making is mostly based on decision concepts and decision models built in decision support systems. Type of decision problem determines application. This paper presents a c...

Email:
Password:

Email: