Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF

12/09/2021
by   Su Zhu, et al.
0

Data sparsity problem is a key challenge of Natural Language Understanding (NLU), especially for a new target domain. By training an NLU model in source domains and applying the model to an arbitrary target domain directly (even without fine-tuning), few-shot NLU becomes crucial to mitigate the data scarcity issue. In this paper, we propose to improve prototypical networks with vector projection distance and abstract triangular Conditional Random Field (CRF) for the few-shot NLU. The vector projection distance exploits projections of contextual word embeddings on label vectors as word-label similarities, which is equivalent to a normalized linear model. The abstract triangular CRF learns domain-agnostic label transitions for joint intent classification and slot filling tasks. Extensive experiments demonstrate that our proposed methods can significantly surpass strong baselines. Specifically, our approach can achieve a new state-of-the-art on two few-shot NLU benchmarks (Few-Joint and SNIPS) in Chinese and English without fine-tuning on target domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2020

Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding

Few-shot slot tagging becomes appealing for rapid domain transfer and ad...
research
06/10/2020

Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network

In this paper, we explore the slot tagging with only a few labeled suppo...
research
10/07/2021

Bridge to Target Domain by Prototypical Contrastive Learning and Label Confusion: Re-explore Zero-Shot Learning for Slot Filling

Zero-shot cross-domain slot filling alleviates the data dependence in th...
research
09/11/2021

Prior Omission of Dissimilar Source Domain(s) for Cost-Effective Few-Shot Learning

Few-shot slot tagging is an emerging research topic in the field of Natu...
research
03/13/2021

Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling

Predicting user intent and detecting the corresponding slots from text a...
research
04/27/2021

Graphical Modeling for Multi-Source Domain Adaptation

Multi-Source Domain Adaptation (MSDA) focuses on transferring the knowle...
research
12/04/2020

Few-Shot Event Detection with Prototypical Amortized Conditional Random Field

Event Detection, a fundamental task of Information Extraction, tends to ...

Please sign up or login with your details

Forgot password? Click here to reset