Revisit Out-Of-Vocabulary Problem for Slot Filling: A Unified Contrastive Frameword with Multi-level Data Augmentations

02/27/2023
by   Daichi Guo, et al.
0

In real dialogue scenarios, the existing slot filling model, which tends to memorize entity patterns, has a significantly reduced generalization facing Out-of-Vocabulary (OOV) problems. To address this issue, we propose an OOV robust slot filling model based on multi-level data augmentations to solve the OOV problem from both word and slot perspectives. We present a unified contrastive learning framework, which pull representations of the origin sample and augmentation samples together, to make the model resistant to OOV problems. We evaluate the performance of the model from some specific slots and carefully design test data with OOV word perturbation to further demonstrate the effectiveness of OOV words. Experiments on two datasets show that our approach outperforms the previous sota methods in terms of both OOV slots and words.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2022

PSSAT: A Perturbed Semantic Structure Awareness Transferring Method for Perturbation-Robust Slot Filling

Most existing slot filling models tend to memorize inherent patterns of ...
research
08/09/2023

Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

Recent advanced methods in Natural Language Understanding for Task-orien...
research
10/26/2021

An Explicit-Joint and Supervised-Contrastive Learning Framework for Few-Shot Intent Classification and Slot Filling

Intent classification (IC) and slot filling (SF) are critical building b...
research
03/24/2022

mcBERT: Momentum Contrastive Learning with BERT for Zero-Shot Slot Filling

Zero-shot slot filling has received considerable attention to cope with ...
research
10/08/2020

Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding

Intent detection and slot filling are two closely related tasks for buil...
research
07/18/2020

Slot Contrastive Networks: A Contrastive Approach for Representing Objects

Unsupervised extraction of objects from low-level visual data is an impo...
research
09/19/2020

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

Most of the existing works for dialogue generation are data-driven model...

Please sign up or login with your details

Forgot password? Click here to reset