Learning to Segment Rigid Motions from Two Frames

01/11/2021
by   Gengshan Yang, et al.
5

Appearance-based detectors achieve remarkable performance on common scenes, but tend to fail for scenarios lack of training data. Geometric motion segmentation algorithms, however, generalize to novel scenes, but have yet to achieve comparable performance to appearance-based ones, due to noisy motion estimations and degenerate motion configurations. To combine the best of both worlds, we propose a modular network, whose architecture is motivated by a geometric analysis of what independent object motions can be recovered from an egomotion field. It takes two consecutive frames as input and predicts segmentation masks for the background and multiple rigidly moving objects, which are then parameterized by 3D rigid transformations. Our method achieves state-of-the-art performance for rigid motion segmentation on KITTI and Sintel. The inferred rigid motions lead to a significant improvement for depth and scene flow estimation. At the time of submission, our method ranked 1st on KITTI scene flow leaderboard, out-performing the best published method (scene flow error: 4.89

READ FULL TEXT

page 1

page 2

page 3

page 5

page 6

page 8

page 14

page 15

research
12/01/2020

RAFT-3D: Scene Flow using Rigid-Motion Embeddings

We address the problem of scene flow: given a pair of stereo or RGB-D vi...
research
07/26/2017

Cascaded Scene Flow Prediction using Semantic Segmentation

Given two consecutive frames from a pair of stereo cameras, 3D scene flo...
research
06/08/2023

Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

A truly generalizable approach to rigid segmentation and motion estimati...
research
08/21/2022

Objects Can Move: 3D Change Detection by Geometric Transformation Constistency

AR/VR applications and robots need to know when the scene has changed. A...
research
03/24/2022

Quantum Motion Segmentation

Motion segmentation is a challenging problem that seeks to identify inde...
research
09/18/2022

SF2SE3: Clustering Scene Flow into SE(3)-Motions via Proposal and Selection

We propose SF2SE3, a novel approach to estimate scene dynamics in form o...
research
05/05/2021

Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes

We propose a method to train deep networks to decompose videos into 3D g...

Please sign up or login with your details

Forgot password? Click here to reset