Evaluating Predictive Uncertainty and Robustness to Distributional Shift Using Real World Data

11/08/2021
by   Kumud Lakara, et al.
10

Most machine learning models operate under the assumption that the training, testing and deployment data is independent and identically distributed (i.i.d.). This assumption doesn't generally hold true in a natural setting. Usually, the deployment data is subject to various types of distributional shifts. The magnitude of a model's performance is proportional to this shift in the distribution of the dataset. Thus it becomes necessary to evaluate a model's uncertainty and robustness to distributional shifts to get a realistic estimate of its expected performance on real-world data. Present methods to evaluate uncertainty and model's robustness are lacking and often fail to paint the full picture. Moreover, most analysis so far has primarily focused on classification tasks. In this paper, we propose more insightful metrics for general regression tasks using the Shifts Weather Prediction Dataset. We also present an evaluation of the baseline methods using these metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

Evaluating Predictive Uncertainty under Distributional Shift on Dialogue Dataset

In open-domain dialogues, predictive uncertainties are mainly evaluated ...
research
12/16/2021

Benchmarking Uncertainty Qualification on Biosignal Classification Tasks under Dataset Shift

A biosignal is a signal that can be continuously measured from human bod...
research
10/20/2021

Distributionally Robust Classifiers in Sentiment Analysis

In this paper, we propose sentiment classification models based on BERT ...
research
07/01/2022

Robustness of Epinets against Distributional Shifts

Recent work introduced the epinet as a new approach to uncertainty model...
research
11/30/2016

Reliable Evaluation of Neural Network for Multiclass Classification of Real-world Data

This paper presents a systematic evaluation of Neural Network (NN) for c...
research
07/15/2021

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

There has been significant research done on developing methods for impro...
research
06/30/2022

Shifts 2.0: Extending The Dataset of Real Distributional Shifts

Distributional shift, or the mismatch between training and deployment da...

Please sign up or login with your details

Forgot password? Click here to reset