Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living

08/03/2022
by   Zdravko Marinov, et al.
0

Domain shifts, such as appearance changes, are a key challenge in real-world applications of activity recognition models, which range from assistive robotics and smart homes to driver observation in intelligent vehicles. For example, while simulations are an excellent way of economical data collection, a Synthetic-to-Real domain shift leads to a > 60 recognizing activities of Daily Living (ADLs). We tackle this challenge and introduce an activity domain generation framework which creates novel ADL appearances (novel domains) from different existing activity modalities (source domains) inferred from video training data. Our framework computes human poses, heatmaps of body joints, and optical flow maps and uses them alongside the original RGB videos to learn the essence of source domains in order to generate completely new ADL domains. The model is optimized by maximizing the distance between the existing source appearances and the generated novel appearances while ensuring that the semantics of an activity is preserved through an additional classification loss. While source data multimodality is an important concept in this design, our setup does not rely on multi-sensor setups, (i.e., all source modalities are inferred from a single video only.) The newly created activity domains are then integrated in the training of the ADL classification networks, resulting in models far less susceptible to changes in data distributions. Extensive experiments on the Synthetic-to-Real benchmark Sims4Action demonstrate the potential of the domain generation paradigm for cross-domain ADL recognition, setting new state-of-the-art results. Our code is publicly available at https://github.com/Zrrr1997/syn2real_DG

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
07/12/2021

Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games

Recognizing Activities of Daily Living (ADL) is a vital process for inte...
research
06/26/2018

Cross-position Activity Recognition with Stratified Transfer Learning

Human activity recognition aims to recognize the activities of daily liv...
research
03/27/2022

Audio-Adaptive Activity Recognition Across Video Domains

This paper strives for activity recognition under domain shift, for exam...
research
03/02/2022

TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration

Traditional video-based human activity recognition has experienced remar...
research
02/01/2022

Should I take a walk? Estimating Energy Expenditure from Video Data

We explore the problem of automatically inferring the amount of kilocalo...
research
05/17/2021

VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living

Many attempts have been made towards combining RGB and 3D poses for the ...
research
08/19/2022

ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization

Modality selection is an important step when designing multimodal system...

Please sign up or login with your details

Forgot password? Click here to reset