kFolden: k-Fold Ensemble for Out-Of-Distribution Detection

08/29/2021
by   Xiaoya Li, et al.
0

Out-of-Distribution (OOD) detection is an important problem in natural language processing (NLP). In this work, we propose a simple yet effective framework kFolden, which mimics the behaviors of OOD detection during training without the use of any external data. For a task with k training labels, kFolden induces k sub-models, each of which is trained on a subset with k-1 categories with the left category masked unknown to the sub-model. Exposing an unknown label to the sub-model during training, the model is encouraged to learn to equally attribute the probability to the seen k-1 labels for the unknown label, enabling this framework to simultaneously resolve in- and out-distribution examples in a natural way via OOD simulations. Taking text classification as an archetype, we develop benchmarks for OOD detection using existing text classification datasets. By conducting comprehensive comparisons and analyses on the developed benchmarks, we demonstrate the superiority of kFolden against current methods in terms of improving OOD detection performances while maintaining improved in-domain classification accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2022

Many-Class Text Classification with Matching

In this work, we formulate Text Classification as a Matching problem bet...
research
04/04/2023

Multidimensional Perceptron for Efficient and Explainable Long Text Classification

Because of the inevitable cost and complexity of transformer and pre-tra...
research
06/18/2021

Label Mask for Multi-Label Text Classification

One of the key problems in multi-label text classification is how to tak...
research
06/16/2018

Joint Input-Label Embedding for Neural Text Classification

Neural text classification methods typically treat output classes as cat...
research
11/10/2022

Towards Human-Centred Explainability Benchmarks For Text Classification

Progress on many Natural Language Processing (NLP) tasks, such as text c...
research
07/26/2019

Weakly Supervised Domain Detection

In this paper we introduce domain detection as a new natural language pr...
research
11/11/2022

Misinformation Detection using Persuasive Writing Strategies

The spread of misinformation is a prominent problem in today's society, ...

Please sign up or login with your details

Forgot password? Click here to reset