Controlled abstention neural networks for identifying skillful predictions for classification problems

04/16/2021
by   Elizabeth A. Barnes, et al.
0

The earth system is exceedingly complex and often chaotic in nature, making prediction incredibly challenging: we cannot expect to make perfect predictions all of the time. Instead, we look for specific states of the system that lead to more predictable behavior than others, often termed "forecasts of opportunity." When these opportunities are not present, scientists need prediction systems that are capable of saying "I don't know." We introduce a novel loss function, termed the "NotWrong loss", that allows neural networks to identify forecasts of opportunity for classification problems. The NotWrong loss introduces an abstention class that allows the network to identify the more confident samples and abstain (say "I don't know") on the less confident samples. The abstention loss is designed to abstain on a user-defined fraction of the samples via a PID controller. Unlike many machine learning methods used to reject samples post-training, the NotWrong loss is applied during training to preferentially learn from the more confident samples. We show that the NotWrong loss outperforms other existing loss functions for multiple climate use cases. The implementation of the proposed loss function is straightforward in most network architectures designed for classification as it only requires the addition of an abstention class to the output layer and modification of the loss function.

READ FULL TEXT

page 10

page 12

research
04/16/2021

Controlled abstention neural networks for identifying skillful predictions for regression problems

The earth system is exceedingly complex and often chaotic in nature, mak...
research
06/04/2021

A novel multi-scale loss function for classification problems in machine learning

We introduce two-scale loss functions for use in various gradient descen...
research
11/14/2017

Loss Functions for Multiset Prediction

We study the problem of multiset prediction. The goal of multiset predic...
research
06/07/2021

Error Loss Networks

A novel model called error loss network (ELN) is proposed to build an er...
research
06/29/2019

Deep Gamblers: Learning to Abstain with Portfolio Theory

We deal with the selective classification problem (supervised-learning p...
research
08/13/2023

Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic Networks

Representation learning methods have revolutionized machine learning on ...
research
12/02/2022

Loss shaping enhances exact gradient learning with EventProp in Spiking Neural Networks

In a recent paper Wunderlich and Pehle introduced the EventProp algorith...

Please sign up or login with your details

Forgot password? Click here to reset