Actor and Action Video Segmentation from a Sentence

03/20/2018
by   Kirill Gavrilyuk, et al.
0

This paper strives for pixel-level segmentation of actors and their actions in video content. Different from existing works, which all learn to segment from a fixed vocabulary of actor and action pairs, we infer the segmentation from a natural language input sentence. This allows to distinguish between fine-grained actors in the same super-category, identify actor and action instances, and segment pairs that are outside of the actor and action vocabulary. We propose a fully-convolutional model for pixel-level actor and action segmentation using an encoder-decoder architecture optimized for video. To show the potential of actor and action video segmentation from a sentence, we extend two popular actor and action datasets with more than 7,500 natural language descriptions. Experiments demonstrate the quality of the sentence-guided segmentations, the generalization ability of our model, and its advantage for traditional actor and action segmentation compared to the state-of-the-art.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 11

page 12

page 13

page 14

research
11/02/2020

Actor and Action Modular Network for Text-based Video Segmentation

The actor and action semantic segmentation is a challenging problem that...
research
12/02/2018

Multi-modal Capsule Routing for Actor and Action Video Segmentation Conditioned on Natural Language Queries

In this paper, we propose an end-to-end capsule network for pixel level ...
research
11/13/2013

A Study of Actor and Action Semantic Retention in Video Supervoxel Segmentation

Existing methods in the semantic computer vision community seem unable t...
research
11/22/2020

We don't Need Thousand Proposals Single Shot Actor-Action Detection in Videos

We propose SSA2D, a simple yet effective end-to-end deep network for act...
research
05/14/2021

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation

Language-queried video actor segmentation aims to predict the pixel-leve...
research
07/23/2018

Actor-Action Semantic Segmentation with Region Masks

In this paper, we study the actor-action semantic segmentation problem, ...
research
10/22/2018

Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces

Value-based approaches can be difficult to use in continuous action spac...

Please sign up or login with your details

Forgot password? Click here to reset