Out-of-Distribution Generalization in Text Classification: Past, Present, and Future

05/23/2023
by   Linyi Yang, et al.
0

Machine learning (ML) systems in natural language processing (NLP) face significant challenges in generalizing to out-of-distribution (OOD) data, where the test distribution differs from the training data distribution. This poses important questions about the robustness of NLP models and their high accuracy, which may be artificially inflated due to their underlying sensitivity to systematic biases. Despite these challenges, there is a lack of comprehensive surveys on the generalization challenge from an OOD perspective in text classification. Therefore, this paper aims to fill this gap by presenting the first comprehensive review of recent progress, methods, and evaluations on this topic. We furth discuss the challenges involved and potential future research directions. By providing quick access to existing work, we hope this survey will encourage future research in this area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2020

A Survey on Text Classification: From Shallow to Deep Learning

Text classification is the most fundamental and essential task in natura...
research
08/22/2022

Recent Advances in Text-to-SQL: A Survey of What We Have and What We Expect

Text-to-SQL has attracted attention from both the natural language proce...
research
05/05/2023

A Survey on Out-of-Distribution Detection in NLP

Out-of-distribution (OOD) detection is essential for the reliable and sa...
research
11/02/2020

Automatic Detection of Machine Generated Text: A Critical Survey

Text generative models (TGMs) excel in producing text that matches the s...
research
03/18/2023

Requirement Formalisation using Natural Language Processing and Machine Learning: A Systematic Review

Improvement of software development methodologies attracts developers to...
research
05/04/2020

NLP in FinTech Applications: Past, Present and Future

Financial Technology (FinTech) is one of the worldwide rapidly-rising to...
research
12/28/2022

A System-Level View on Out-of-Distribution Data in Robotics

When testing conditions differ from those represented in training data, ...

Please sign up or login with your details

Forgot password? Click here to reset