Active Learning for Domain Classification in a Commercial Spoken Personal Assistant

08/29/2019
by   Xi C. Chen, et al.
0

We describe a method for selecting relevant new training data for the LSTM-based domain selection component of our personal assistant system. Adding more annotated training data for any ML system typically improves accuracy, but only if it provides examples not already adequately covered in the existing data. However, obtaining, selecting, and labeling relevant data is expensive. This work presents a simple technique that automatically identifies new helpful examples suitable for human annotation. Our experimental results show that the proposed method, compared with random-selection and entropy-based methods, leads to higher accuracy improvements given a fixed annotation budget. Although developed and tested in the setting of a commercial intelligent assistant, the technique is of wider applicability.

READ FULL TEXT
research
01/18/2022

Active Learning for Open-set Annotation

Existing active learning studies typically work in the closed-set settin...
research
09/12/2023

Annotating Data for Fine-Tuning a Neural Ranker? Current Active Learning Strategies are not Better than Random Selection

Search methods based on Pretrained Language Models (PLM) have demonstrat...
research
04/28/2020

Active Learning for Coreference Resolution using Discrete Annotation

We improve upon pairwise annotation for active learning in coreference r...
research
07/04/2021

Multiple-criteria Based Active Learning with Fixed-size Determinantal Point Processes

Active learning aims to achieve greater accuracy with less training data...
research
08/06/2020

Cross-Model Image Annotation Platform with Active Learning

We have seen significant leapfrog advancement in machine learning in rec...
research
07/07/2023

Training Ensembles with Inliers and Outliers for Semi-supervised Active Learning

Deep active learning in the presence of outlier examples poses a realist...

Please sign up or login with your details

Forgot password? Click here to reset