Actor-Action Semantic Segmentation with Grouping Process Models

12/30/2015
by   Chenliang Xu, et al.
0

Actor-action semantic segmentation made an important step toward advanced video understanding problems: what action is happening; who is performing the action; and where is the action in space-time. Current models for this problem are local, based on layered CRFs, and are unable to capture long-ranging interaction of video parts. We propose a new model that combines these local labeling CRFs with a hierarchical supervoxel decomposition. The supervoxels provide cues for possible groupings of nodes, at various scales, in the CRFs to encourage adaptive, high-order groups for more effective labeling. Our model is dynamic and continuously exchanges information during inference: the local CRFs influence what supervoxels in the hierarchy are active, and these active nodes influence the connectivity in the CRF; we hence call it a grouping process model. The experimental results on a recent large-scale video dataset show a large margin of 60 demonstrates the effectiveness of the dynamic, bidirectional flow between labeling and grouping.

READ FULL TEXT

page 2

page 6

page 7

research
07/23/2018

Actor-Action Semantic Segmentation with Region Masks

In this paper, we study the actor-action semantic segmentation problem, ...
research
11/13/2013

A Study of Actor and Action Semantic Retention in Video Supervoxel Segmentation

Existing methods in the semantic computer vision community seem unable t...
research
11/02/2020

Actor and Action Modular Network for Text-based Video Segmentation

The actor and action semantic segmentation is a challenging problem that...
research
08/11/2016

Learning Dynamic Hierarchical Models for Anytime Scene Labeling

With increasing demand for efficient image and video analysis, test-time...
research
05/22/2023

HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation

Current semantic segmentation models have achieved great success under t...
research
05/26/2020

ALBA : Reinforcement Learning for Video Object Segmentation

We consider the challenging problem of zero-shot video object segmentati...
research
07/19/2018

Three for one and one for three: Flow, Segmentation, and Surface Normals

Optical flow, semantic segmentation, and surface normals represent diffe...

Please sign up or login with your details

Forgot password? Click here to reset