Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition

07/31/2019
by   Wenhao Wu, et al.
14

Video Recognition has drawn great research interest and great progress has been made. A suitable frame sampling strategy can improve the accuracy and efficiency of recognition. However, mainstream solutions generally adopt hand-crafted frame sampling strategies for recognition. It could degrade the performance, especially in untrimmed videos, due to the variation of frame-level saliency. To this end, we concentrate on improving untrimmed video classification via developing a learning-based frame sampling strategy. We intuitively formulate the frame sampling procedure as multiple parallel Markov decision processes, each of which aims at picking out a frame/clip by gradually adjusting an initial sampling. Then we propose to solve the problems with multi-agent reinforcement learning (MARL). Our MARL framework is composed of a novel RNN-based context-aware observation network which jointly models context information among nearby agents and historical states of a specific agent, a policy network which generates the probability distribution over a predefined action space at each step and a classification network for reward calculation as well as final recognition. Extensive experimental results show that our MARL-based scheme remarkably outperforms hand-crafted strategies with various 2D and 3D baseline methods. Our single RGB model achieves a comparable performance of ActivityNet v1.3 champion submission with multi-modal multi-model fusion and new state-of-the-art results on YouTube Birds and YouTube Cars.

READ FULL TEXT

page 4

page 8

research
07/29/2020

Compare and Select: Video Summarization with Multi-Agent Reinforcement Learning

Video summarization aims at generating concise video summaries from the ...
research
04/10/2023

Improving ABR Performance for Short Video Streaming Using Multi-Agent Reinforcement Learning with Expert Guidance

In the realm of short video streaming, popular adaptive bitrate (ABR) al...
research
09/14/2023

Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing

We consider a decentralized formulation of the active hypothesis testing...
research
03/03/2023

Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning

The multi-agent setting is intricate and unpredictable since the behavio...
research
05/19/2022

Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam Search

Existing trackers usually select a location or proposal with the maximum...
research
10/09/2017

Multitask training with unlabeled data for end-to-end sign language fingerspelling recognition

We address the problem of automatic American Sign Language fingerspellin...
research
08/07/2023

Mamba: Bringing Multi-Dimensional ABR to WebRTC

Contemporary real-time video communication systems, such as WebRTC, use ...

Please sign up or login with your details

Forgot password? Click here to reset