"Calibeating": Beating Forecasters at Their Own Game

09/11/2022
by   Dean P. Foster, et al.
1

In order to identify expertise, forecasters should not be tested by their calibration score, which can always be made arbitrarily small, but rather by their Brier score. The Brier score is the sum of the calibration score and the refinement score; the latter measures how good the sorting into bins with the same forecast is, and thus attests to "expertise." This raises the question of whether one can gain calibration without losing expertise, which we refer to as "calibeating." We provide an easy way to calibeat any forecast, by a deterministic online procedure. We moreover show that calibeating can be achieved by a stochastic procedure that is itself calibrated, and then extend the results to simultaneously calibeating multiple procedures, and to deterministic procedures that are continuously calibrated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2022

Forecast Hedging and Calibration

Calibration means that forecasts and average realized frequencies are cl...
research
10/13/2022

Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics

We propose to smooth out the calibration score, which measures how good ...
research
12/23/2020

Testing whether a Learning Procedure is Calibrated

A learning procedure takes as input a dataset and performs inference for...
research
04/27/2022

Faster online calibration without randomization: interval forecasts and the power of two choices

We study the problem of making calibrated probabilistic forecasts for a ...
research
02/21/2023

A Unifying Perspective on Multi-Calibration: Unleashing Game Dynamics for Multi-Objective Learning

We provide a unifying framework for the design and analysis of multi-cal...
research
08/19/2022

Improving knockoffs with conditional calibration

The knockoff filter of Barber and Candes (arXiv:1404.5609) is a flexible...
research
05/10/2022

Preliminary assessment of a cost-effective headphone calibration procedure for soundscape evaluations

The introduction of ISO 12913-2:2018 has provided a framework for standa...

Please sign up or login with your details

Forgot password? Click here to reset