Enhance word representation for out-of-vocabulary on Ubuntu dialogue corpus

02/07/2018
by   Jianxiong Dong, et al.
0

Ubuntu dialogue corpus is the largest public available dialogue corpus to make it feasible to build end-to-end deep neural network models directly from the conversation data. One challenge of Ubuntu dialogue corpus is the large number of out-of-vocabulary words. In this paper we proposed a method which combines the general pre-trained word embedding vectors with those generated on the task-specific training set to address this issue. We integrated character embedding into Chen et al's Enhanced LSTM method (ESIM) and used it to evaluate the effectiveness of our proposed method. For the task of next utterance selection, the proposed method has demonstrated a significant performance improvement against original ESIM and the new model has achieved state-of-the-art results on both Ubuntu dialogue corpus and Douban conversation corpus. In addition, we investigated the performance impact of end-of-utterance and end-of-turn token tags.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2018

Building Sequential Inference Models for End-to-End Response Selection

This paper presents an end-to-end response selection model for Track 1 o...
research
02/15/2018

Improving Retrieval Modeling Using Cross Convolution Networks And Multi Frequency Word Embedding

To build a satisfying chatbot that has the ability of managing a goal-or...
research
07/28/2019

CAiRE: An End-to-End Empathetic Chatbot

In this paper, we present an end-to-end empathetic conversation agent CA...
research
08/02/2019

Dialogue Act Classification in Group Chats with DAG-LSTMs

Dialogue act (DA) classification has been studied for the past two decad...
research
09/29/2017

The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word Meanings

We motivate and describe a new freely available human-human dialogue dat...
research
08/09/2022

Positively transitioned sentiment dialogue corpus for developing emotion-affective open-domain chatbots

In this paper, we describe a data enhancement method for developing Emil...
research
04/29/2019

A Persona-based Multi-turn Conversation Model in an Adversarial Learning Framework

In this paper, we extend the persona-based sequence-to-sequence (Seq2Seq...

Please sign up or login with your details

Forgot password? Click here to reset