Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification

05/24/2023
by   Mujeen Sung, et al.
0

Intent classification (IC) plays an important role in task-oriented dialogue systems as it identifies user intents from given utterances. However, models trained on limited annotations for IC often suffer from a lack of generalization to unseen intent classes. We propose a novel pre-training method for text encoders that uses contrastive learning with intent psuedo-labels to produce embeddings that are well-suited for IC tasks. By applying this pre-training strategy, we also introduce the pre-trained intent-aware encoder (PIE). Specifically, we first train a tagger to identify key phrases within utterances that are crucial for interpreting intents. We then use these extracted phrases to create examples for pre-training a text encoder in a contrastive manner. As a result, our PIE model achieves up to 5.4 higher accuracy than the previous state-of-the-art pre-trained sentence encoder for the N-way zero- and one-shot settings on four IC datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2021

Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning

In this work, we focus on a more challenging few-shot intent detection s...
research
06/22/2022

Template-based Approach to Zero-shot Intent Recognition

The recent advances in transfer learning techniques and pre-training of ...
research
05/15/2022

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization

It is challenging to train a good intent classifier for a task-oriented ...
research
05/09/2023

Traffic Forecasting on New Roads Unseen in the Training Data Using Spatial Contrastive Pre-Training

New roads are being constructed all the time. However, the capabilities ...
research
04/06/2021

Contrastive Syn-to-Real Generalization

Training on synthetic data can be beneficial for label or data-scarce sc...
research
05/25/2023

Extracting Text Representations for Terms and Phrases in Technical Domains

Extracting dense representations for terms and phrases is a task of grea...
research
04/14/2023

WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

Retrieving charts from a large corpus is a fundamental task that can ben...

Please sign up or login with your details

Forgot password? Click here to reset