Animating an Autonomous 3D Talking Avatar

by   Dominik Borer, et al.

One of the main challenges with embodying a conversational agent is annotating how and when motions can be played and composed together in real-time, without any visual artifact. The inherent problem is to do so---for a large amount of motions---without introducing mistakes in the annotation. To our knowledge, there is no automatic method that can process animations and automatically label actions and compatibility between them. In practice, a state machine, where clips are the actions, is created manually by setting connections between the states with the timing parameters for these connections. Authoring this state machine for a large amount of motions leads to a visual overflow, and increases the amount of possible mistakes. In consequence, conversational agent embodiments are left with little variations and quickly become repetitive. In this paper, we address this problem with a compact taxonomy of chit chat behaviors, that we can utilize to simplify and partially automate the graph authoring process. We measured the time required to label actions of an embodiment using our simple interface, compared to the standard state machine interface in Unreal Engine, and found that our approach is 7 times faster. We believe that our labeling approach could be a path to automated labeling: once a sub-set of motions are labeled (using our interface), we could learn a prediction that could attribute a label to new clips---allowing to really scale up virtual agent embodiments.


page 1

page 4

page 6

page 7


Comparing Conventional and Conversational Search Interaction using Implicit Evaluation Methods

Conversational search applications offer the prospect of improved user e...

Annotating Motion Primitives for Simplifying Action Search in Reinforcement Learning

Reinforcement learning in large-scale environments is challenging due to...

Rates of convergence to the local time of Oscillating and Skew Brownian Motions

In this paper a class of statistics based on high frequency observations...

Autonomous Grounding of Visual Field Experience through Sensorimotor Prediction

In a developmental framework, autonomous robots need to explore the worl...

Motion Generation Considering Situation with Conditional Generative Adversarial Networks for Throwing Robots

When robots work in a cluttered environment, the constraints for motions...

Please sign up or login with your details

Forgot password? Click here to reset