A Face-to-Face Neural Conversation Model

12/04/2018
by   Hang Chu, et al.
1

Neural networks have recently become good at engaging in dialog. However, current approaches are based solely on verbal text, lacking the richness of a real face-to-face conversation. We propose a neural conversation model that aims to read and generate facial gestures alongside with text. This allows our model to adapt its response based on the "mood" of the conversation. In particular, we introduce an RNN encoder-decoder that exploits the movement of facial muscles, as well as the verbal conversation. The decoder consists of two layers, where the lower layer aims at generating the verbal response and coarse facial expressions, while the second layer fills in the subtle gestures, making the generated output more smooth and natural. We train our neural network by having it "watch" 250 movies. We showcase our joint face-text model in generating more natural conversations through automatic metrics and a human study. We demonstrate an example application with a face-to-face chatting avatar.

READ FULL TEXT

page 1

page 2

page 3

page 7

page 8

research
08/21/2019

A Realistic Face-to-Face Conversation System based on Deep Neural Networks

To improve the experiences of face-to-face conversation with avatar, thi...
research
03/09/2015

Neural Responding Machine for Short-Text Conversation

We propose Neural Responding Machine (NRM), a neural network-based respo...
research
03/15/2022

DialogueNeRF: Towards Realistic Avatar Face-to-face Conversation Video Generation

Conversation is an essential component of virtual avatar activities in t...
research
06/11/2020

Let's face it: Probabilistic multi-modal interlocutor-aware generation of facial gestures in dyadic settings

To enable more natural face-to-face interactions, conversational agents ...
research
09/02/2021

Mechanical Chameleons: Evaluating the effects of a social robot's non-verbal behavior on social influence

In this paper we present a pilot study which investigates how non-verbal...
research
01/26/2021

Text2Gestures: A Transformer-Based Network for Generating Emotive Body Gestures for Virtual Agents

We present Text2Gestures, a transformer-based learning method to interac...
research
10/13/2020

Labeling the Phrase Set of the Conversation Agent, Rinna

Mapping spoken text to gestures is an important research area for robots...

Please sign up or login with your details

Forgot password? Click here to reset