Sentence Level Curriculum Learning for Improved Neural Conversational Models

05/15/2023
by   Sean Paulsen, et al.
0

Designing machine intelligence to converse with a human user necessarily requires an understanding of how humans participate in conversation, and thus conversation modeling is an important task in natural language processing. New breakthroughs in architecture and data gathering continue to push the performance of such conversational AI models. However, designs neglect the gradual buildup in sentence structure and complexity experienced by humans as we learn to communicate. During training, our model accepts one or more sentences as input and attempts to predict the next sentence in the conversation one word at a time, so our goal is to separate training into segments, with each segment's corpus comprised of longer sentence pairs than the previous one. This will mimic the desired "buildup" component of human learning. We begin with only "short" length sentence pairs, then only "medium" length pairs, and so on. A majority of our experiments were toward optimizing this technique, ensuring a proper representation of the technique's potential, since many of the details were new questions. Our segment-trained models were then able to achieve lower validation loss at the end of training than models trained with standard text preparation. This segmented training is straightforward to implement and our results provide a general direction for future research to implement and improve it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2019

Fine-Grained Sentence Functions for Short-Text Conversation

Sentence function is an important linguistic feature referring to a user...
research
04/07/2022

Testing the limits of natural language models for predicting human language judgments

Neural network language models can serve as computational hypotheses abo...
research
07/14/2023

Understanding Multi-Turn Toxic Behaviors in Open-Domain Chatbots

Recent advances in natural language processing and machine learning have...
research
07/07/2021

Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling

Over the last several years, end-to-end neural conversational agents hav...
research
06/19/2015

A Neural Conversational Model

Conversational modeling is an important task in natural language underst...
research
12/02/2019

Fiction Sentence Expansion and Enhancement via Focused Objective and Novelty Curve Sampling

We describe the task of sentence expansion and enhancement, in which a s...
research
05/22/2019

Sentence Length

The distribution of sentence length in ordinary language is not well cap...

Please sign up or login with your details

Forgot password? Click here to reset