Rethinking Evaluation Metric for Probability Estimation Models Using Esports Data

09/12/2023
by   Euihyeon Choi, et al.
0

Probability estimation models play an important role in various fields, such as weather forecasting, recommendation systems, and sports analysis. Among several models estimating probabilities, it is difficult to evaluate which model gives reliable probabilities since the ground-truth probabilities are not available. The win probability estimation model for esports, which calculates the win probability under a certain game state, is also one of the fields being actively studied in probability estimation. However, most of the previous works evaluated their models using accuracy, a metric that only can measure the performance of discrimination. In this work, we firstly investigate the Brier score and the Expected Calibration Error (ECE) as a replacement of accuracy used as a performance evaluation metric for win probability estimation models in esports field. Based on the analysis, we propose a novel metric called Balance score which is a simple yet effective metric in terms of six good properties that probability estimation metric should have. Under the general condition, we also found that the Balance score can be an effective approximation of the true expected calibration error which has been imperfectly approximated by ECE using the binning technique. Extensive evaluations using simulation studies and real game snapshot data demonstrate the promising potential to adopt the proposed metric not only for the win probability estimation model for esports but also for evaluating general probability estimation models.

READ FULL TEXT

page 1

page 2

research
04/30/2023

Calibration Error Estimation Using Fuzzy Binning

Neural network-based decisions tend to be overconfident, where their raw...
research
06/12/2019

Who Will Win It? An In-game Win Probability Model for Football

In-game win probability is a statistical metric that provides a sports t...
research
03/10/2023

Machine learning for sports betting: should forecasting models be optimised for accuracy or calibration?

Sports betting's recent federal legalisation in the USA coincides with t...
research
05/26/2019

Towards reliable and fair probabilistic predictions: field-aware calibration with neural networks

In machine learning, it is observed that probabilistic predictions somet...
research
03/28/2013

Relevance As a Metric for Evaluating Machine Learning Algorithms

In machine learning, the choice of a learning algorithm that is suitable...
research
06/25/2023

TCE: A Test-Based Approach to Measuring Calibration Error

This paper proposes a new metric to measure the calibration error of pro...
research
01/13/2022

Beyond chord vocabularies: Exploiting pitch-relationships in a chord estimation metric

Chord estimation metrics treat chord labels as independent of one anothe...

Please sign up or login with your details

Forgot password? Click here to reset