Learning to Start for Sequence to Sequence Architecture

08/19/2016
by   Qingfu Zhu, et al.
0

The sequence to sequence architecture is widely used in the response generation and neural machine translation to model the potential relationship between two sentences. It typically consists of two parts: an encoder that reads from the source sentence and a decoder that generates the target sentence word by word according to the encoder's output and the last generated word. However, it faces to the cold start problem when generating the first word as there is no previous word to refer. Existing work mainly use a special start symbol </s>to generate the first word. An obvious drawback of these work is that there is not a learnable relationship between words and the start symbol. Furthermore, it may lead to the error accumulation for decoding when the first word is incorrectly generated. In this paper, we proposed a novel approach to learning to generate the first word in the sequence to sequence architecture rather than using the start symbol. Experimental results on the task of response generation of short text conversation show that the proposed approach outperforms the state-of-the-art approach in both of the automatic and manual evaluations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2018

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Most recent approaches use the sequence-to-sequence model for paraphrase...
research
03/05/2018

Word Embedding Attention Network: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Most recent approaches use the sequence-to-sequence model for paraphrase...
research
07/01/2021

Modeling Target-side Inflection in Placeholder Translation

Placeholder translation systems enable the users to specify how a specif...
research
02/27/2019

Learning to Generate Questions by Learning What not to Generate

Automatic question generation is an important technique that can improve...
research
11/13/2019

Mark my Word: A Sequence-to-Sequence Approach to Definition Modeling

Defining words in a textual context is a useful task both for practical ...
research
04/11/2018

Text2Colors: Guiding Image Colorization through Text-Driven Palette Generation

In this paper, we propose a novel approach to generate multiple color pa...
research
02/28/2020

Metaphoric Paraphrase Generation

This work describes the task of metaphoric paraphrase generation, in whi...

Please sign up or login with your details

Forgot password? Click here to reset