DeepAI
Log In Sign Up

Improvement of a dedicated model for open domain persona-aware dialogue generation

08/27/2020
by   Qiang Han, et al.
0

This paper analyzes some speed and performance improvement methods of Transformer architecture in recent years, mainly its application in dedicated model training. The dedicated model studied here refers to the open domain persona-aware dialogue generation model, and the dataset is multi turn short dialogue, The total length of a single input sequence is no more than 105 tokens. Therefore, many improvements in the architecture and attention mechanism of transformer architecture for long sequence processing are not discussed in this paper. The source code of the experiments has been open sourced: https://github.com/ghosthamlet/persona

READ FULL TEXT

page 1

page 2

page 3

page 4

10/27/2022

Terminology-aware Medical Dialogue Generation

Medical dialogue generation aims to generate responses according to a hi...
12/16/2019

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

Neural network models usually suffer from the challenge of incorporating...
07/26/2019

Multi-turn Dialogue Response Generation with Autoregressive Transformer Models

Neural dialogue models, despite their successes, still suffer from lack ...
10/21/2020

Multi-Domain Dialogue State Tracking based on State Graph

We investigate the problem of multi-domain Dialogue State Tracking (DST)...
09/15/2020

Multi-Referenced Training for Dialogue Response Generation

In open-domain dialogue response generation, a dialogue context can be c...
06/06/2021

Attend and Select: A Segment Attention based Selection Mechanism for Microblog Hashtag Generation

Automatic microblog hashtag generation can help us better and faster und...
10/09/2022

Strong Gravitational Lensing Parameter Estimation with Vision Transformer

Quantifying the parameters and corresponding uncertainties of hundreds o...