Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019

06/22/2019
by   Xiaohan Wang, et al.
0

In this report, we present the Baidu-UTS submission to the EPIC-Kitchens Action Recognition Challenge in CVPR 2019. This is the winning solution to this challenge. In this task, the goal is to predict verbs, nouns, and actions from the vocabulary for each video segment. The EPIC-Kitchens dataset contains various small objects, intense motion blur, and occlusions. It is challenging to locate and recognize the object that an actor interacts with. To address these problems, we utilize object detection features to guide the training of 3D Convolutional Neural Networks (CNN), which can significantly improve the accuracy of noun prediction. Specifically, we introduce a Gated Feature Aggregator module to learn from the clip feature and the object feature. This module can strengthen the interaction between the two kinds of activations and avoid gradient exploding. Experimental results demonstrate our approach outperforms other methods on both seen and unseen test set.

READ FULL TEXT
research
07/03/2020

Egocentric Action Recognition by Video Attention and Temporal Context

We present the submission of Samsung AI Centre Cambridge to the CVPR2020...
research
10/20/2022

Transformer-based Action recognition in hand-object interacting scenarios

This report describes the 2nd place solution to the ECCV 2022 Human Body...
research
04/19/2021

A Competitive Method to VIPriors Object Detection Challenge

In this report, we introduce the technical details of our submission to ...
research
06/28/2020

DHARI Report to EPIC-Kitchens 2020 Object Detection Challenge

In this report, we describe the technical details of oursubmission to th...
research
04/01/2021

Motion Guided Attention Fusion to Recognize Interactions from Videos

We present a dual-pathway approach for recognizing fine-grained interact...
research
08/22/2023

Opening the Vocabulary of Egocentric Actions

Human actions in egocentric videos are often hand-object interactions co...
research
11/12/2015

Hand-Object Interaction and Precise Localization in Transitive Action Recognition

Action recognition in still images has seen major improvement in recent ...

Please sign up or login with your details

Forgot password? Click here to reset