Data Augmentation for Improving Tail-traffic Robustness in Skill-routing for Dialogue Systems

06/07/2023
by   Ting-Wei Wu, et al.
0

Large-scale conversational systems typically rely on a skill-routing component to route a user request to an appropriate skill and interpretation to serve the request. In such system, the agent is responsible for serving thousands of skills and interpretations which create a long-tail distribution due to the natural frequency of requests. For example, the samples related to play music might be a thousand times more frequent than those asking for theatre show times. Moreover, inputs used for ML-based skill routing are often a heterogeneous mix of strings, embedding vectors, categorical and scalar features which makes employing augmentation-based long-tail learning approaches challenging. To improve the skill-routing robustness, we propose an augmentation of heterogeneous skill-routing data and training targeted for robust operation in long-tail data regimes. We explore a variety of conditional encoder-decoder generative frameworks to perturb original data fields and create synthetic training data. To demonstrate the effectiveness of the proposed method, we conduct extensive experiments using real-world data from a commercial conversational system. Based on the experiment results, the proposed approach improves more than 80 traffic instances in the skill-routing replication task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2021

Neural model robustness for skill routing in large-scale conversational AI systems: A design choice exploration

Current state-of-the-art large-scale conversational AI or intelligent di...
research
04/14/2022

Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems

Skill routing is an important component in large-scale conversational sy...
research
04/26/2021

Handling Long-Tail Queries with Slice-Aware Conversational Systems

We have been witnessing the usefulness of conversational AI systems such...
research
07/28/2022

Learning Personalized Representations using Graph Convolutional Network

Generating representations that precisely reflect customers' behavior is...
research
07/28/2022

Learning Dynamic Manipulation Skills from Haptic-Play

In this paper, we propose a data-driven skill learning approach to solve...
research
06/05/2023

Improving Conversational Recommendation Systems via Counterfactual Data Simulation

Conversational recommender systems (CRSs) aim to provide recommendation ...

Please sign up or login with your details

Forgot password? Click here to reset