Identity Preserve Transform: Understand What Activity Classification Models Have Learnt

12/13/2019
by   Jialing Lyu, et al.
7

Activity classification has observed great success recently. The performance on small dataset is almost saturated and people are moving towards larger datasets. What leads to the performance gain on the model and what the model has learnt? In this paper we propose identity preserve transform (IPT) to study this problem. IPT manipulates the nuisance factors (background, viewpoint, etc.) of the data while keeping those factors related to the task (human motion) unchanged. To our surprise, we found popular models are using highly correlated information (background, object) to achieve high classification accuracy, rather than using the essential information (human motion). This can explain why an activity classification model usually fails to generalize to datasets it is not trained on. We implement IPT in two forms, i.e. image-space transform and 3D transform, using synthetic images. The tool will be made open-source to help study model and dataset design.

READ FULL TEXT

page 3

page 5

page 6

page 8

research
02/11/2014

Animation of 3D Human Model Using Markerless Motion Capture Applied To Sports

Markerless motion capture is an active research in 3D virtualization. In...
research
07/21/2020

Creating a Large-scale Synthetic Dataset for Human Activity Recognition

Creating and labelling datasets of videos for use in training Human Acti...
research
06/23/2021

Human Activity Recognition using Continuous Wavelet Transform and Convolutional Neural Networks

Quite a few people in the world have to stay under permanent surveillanc...
research
04/14/2023

Explaining, Analyzing, and Probing Representations of Self-Supervised Learning Models for Sensor-based Human Activity Recognition

In recent years, self-supervised learning (SSL) frameworks have been ext...
research
04/10/2023

Human Motion Detection Based on Dual-Graph and Weighted Nuclear Norm Regularizations

Motion detection has been widely used in many applications, such as surv...
research
12/14/2016

Disentangling Space and Time in Video with Hierarchical Variational Auto-encoders

There are many forms of feature information present in video data. Princ...
research
03/08/2018

Motion deblurring of faces

Face analysis is a core part of computer vision, in which remarkable pro...

Please sign up or login with your details

Forgot password? Click here to reset