The Multi-Modal Video Reasoning and Analyzing Competition

08/18/2021
by   Haoran Peng, et al.
4

In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summarize the top-performing methods submitted by the participants in this competition and show their results achieved in the competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2021

NTIRE 2021 Multi-modal Aerial View Object Classification Challenge

In this paper, we introduce the first Challenge on Multi-modal Aerial Vi...
research
11/19/2018

iQIYI-VID: A Large Dataset for Multi-modal Person Identification

Person identification in the wild is very challenging due to great varia...
research
10/20/2022

VideoPipe 2022 Challenge: Real-World Video Understanding for Urban Pipe Inspection

Video understanding is an important problem in computer vision. Currentl...
research
10/30/2021

Top1 Solution of QQ Browser 2021 Ai Algorithm Competition Track 1 : Multimodal Video Similarity

In this paper, we describe the solution to the QQ Browser 2021 Ai Algori...
research
08/19/2013

Seeing What You're Told: Sentence-Guided Activity Recognition In Video

We present a system that demonstrates how the compositional structure of...
research
10/07/2021

A Baseline Framework for Part-level Action Parsing and Action Recognition

This technical report introduces our 2nd place solution to Kinetics-TPS ...
research
01/01/2023

Hierarchical Explanations for Video Action Recognition

We propose Hierarchical ProtoPNet: an interpretable network that explain...

Please sign up or login with your details

Forgot password? Click here to reset