Fine-grained Event Learning of Human-Object Interaction with LSTM-CRF

09/30/2017
by   Tuan Do, et al.
0

Event learning is one of the most important problems in AI. However, notwithstanding significant research efforts, it is still a very complex task, especially when the events involve the interaction of humans or agents with other objects, as it requires modeling human kinematics and object movements. This study proposes a methodology for learning complex human-object interaction (HOI) events, involving the recording, annotation and classification of event interactions. For annotation, we allow multiple interpretations of a motion capture by slicing over its temporal span, for classification, we use Long-Short Term Memory (LSTM) sequential models with Conditional Randon Field (CRF) for constraints of outputs. Using a setup involving captures of human-object interaction as three dimensional inputs, we argue that this approach could be used for event types involving complex spatio-temporal dynamics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2018

Hierarchical Long Short-Term Concurrent Memory for Human Interaction Recognition

In this paper, we aim to address the problem of human interaction recogn...
research
10/02/2017

Learning event representation: As sparse as possible, but not sparser

Selecting an optimal event representation is essential for event classif...
research
01/05/2020

Exploiting Event Cameras for Spatio-Temporal Prediction of Fast-Changing Trajectories

This paper investigates trajectory prediction for robotics, to improve t...
research
07/13/2022

Diverse Dance Synthesis via Keyframes with Transformer Controllers

Existing keyframe-based motion synthesis mainly focuses on the generatio...
research
02/14/2020

A Comparison of Pooling Methods on LSTM Models for Rare Acoustic Event Classification

Acoustic event classification (AEC) and acoustic event detection (AED) r...
research
01/10/2020

Matrix-LSTM: a Differentiable Recurrent Surface for Asynchronous Event-Based Data

Dynamic Vision Sensors (DVSs) asynchronously stream events in correspond...
research
02/07/2023

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

Object affordance is an important concept in hand-object interaction, pr...

Please sign up or login with your details

Forgot password? Click here to reset