Valid sequential inference on probability forecast performance

03/15/2021
by   Alexander Henzi, et al.
0

Probability forecasts for binary events play a central role in many applications. Their quality is commonly assessed with proper scoring rules, which assign forecasts a numerical score such that a correct forecast achieves a minimal expected score. In this paper, we construct e-values for testing the statistical significance of score differences of competing forecasts in sequential settings. E-values have been proposed as an alternative to p-values for hypothesis testing, and they can easily be transformed into conservative p-values by taking the multiplicative inverse. The e-values proposed in this article are valid in finite samples without any assumptions on the data generating processes. They also allow optional stopping, so a forecast user may decide to interrupt evaluation taking into account the available data at any time and still draw statistically valid inference, which is generally not true for classical p-value based tests. In a case study on postprocessing of precipitation forecasts, state-of-the-art forecasts dominance tests and e-values lead to the same conclusions.

READ FULL TEXT
research
09/24/2021

Sequentially valid tests for forecast calibration

Forecasting and forecast evaluation are inherently sequential tasks. Pre...
research
09/30/2021

Comparing Sequential Forecasters

Consider two or more forecasters, each making a sequence of predictions ...
research
11/29/2022

Score-based calibration testing for multivariate forecast distributions

Multivariate distributional forecasts have become widespread in recent y...
research
05/25/2021

Ranking earthquake forecasts using proper scoring rules: Binary events in a low probability environment

Operational earthquake forecasting for risk management and communication...
research
10/28/2019

Testing Forecast Rationality for Measures of Central Tendency

Rational respondents to economic surveys may report as a point forecast ...
research
04/10/2020

Forecasts with Bayesian vector autoregressions under real time conditions

This paper investigates the sensitivity of forecast performance measures...
research
09/06/2021

Using Proxies to Improve Forecast Evaluation

Comparative evaluation of forecasts of statistical functionals relies on...

Please sign up or login with your details

Forgot password? Click here to reset