Focus-Constrained Attention Mechanism for CVAE-based Response Generation

09/25/2020
by   Zhi Cui, et al.
0

To model diverse responses for a given post, one promising way is to introduce a latent variable into Seq2Seq models. The latent variable is supposed to capture the discourse-level information and encourage the informativeness of target responses. However, such discourse-level information is often too coarse for the decoder to be utilized. To tackle it, our idea is to transform the coarse-grained discourse-level information into fine-grained word-level information. Specifically, we firstly measure the semantic concentration of corresponding target response on the post words by introducing a fine-grained focus signal. Then, we propose a focus-constrained attention mechanism to take full advantage of focus in well aligning the input to the target response. The experimental results demonstrate that by exploiting the fine-grained signal, our model can generate more diverse and informative responses compared with several state-of-the-art models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2020

Predict and Use Latent Patterns for Short-Text Conversation

Many neural network models nowadays have achieved promising performances...
research
06/18/2023

Focusing on Relevant Responses for Multi-modal Rumor Detection

In the absence of an authoritative statement about a rumor, people may e...
research
06/05/2019

Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection

In human conversation an input post is open to multiple potential respon...
research
04/03/2023

Identifiability of Cognitive Diagnosis Models with Polytomous Responses

Cognitive Diagnosis Models (CDMs) are a powerful statistical and psychom...
research
09/19/2020

Enhancing Dialogue Generation via Multi-Level Contrastive Learning

Most of the existing works for dialogue generation are data-driven model...
research
01/03/2023

Fast Parallel Algorithms for Enumeration of Simple, Temporal, and Hop-Constrained Cycles

Cycles are one of the fundamental subgraph patterns and being able to en...
research
06/30/2020

PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning

To build a high-quality open-domain chatbot, we introduce the effective ...

Please sign up or login with your details

Forgot password? Click here to reset