Semantic Attention Flow Fields for Dynamic Scene Decomposition

03/02/2023
by   Yiqing Liang, et al.
0

We present SAFF: a dynamic neural volume reconstruction of a casual monocular video that consists of time-varying color, density, scene flow, semantics, and attention information. The semantics and attention let us identify salient foreground objects separately from the background in arbitrary spacetime views. We add two network heads to represent the semantic and attention information. For optimization, we design semantic attention pyramids from DINO-ViT outputs that trade detail with whole-image context. After optimization, we perform a saliency-aware clustering to decompose the scene. For evaluation on real-world dynamic scene decomposition across spacetime, we annotate object masks in the NVIDIA Dynamic Scene Dataset. We demonstrate that SAFF can decompose dynamic scenes without affecting RGB or depth reconstruction quality, that volume-integrated SAFF outperforms 2D baselines, and that SAFF improves foreground/background segmentation over recent static/dynamic split methods. Project Webpage: https://visual.cs.brown.edu/saff

READ FULL TEXT

page 5

page 6

page 7

page 8

page 13

page 14

page 17

page 18

research
03/20/2022

Stochastic Video Prediction with Structure and Motion

While stochastic video prediction models enable future prediction under ...
research
06/06/2022

Volumetric Disentanglement for 3D Scene Manipulation

Recently, advances in differential volumetric rendering enabled signific...
research
11/21/2022

Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering

We present Tensor4D, an efficient yet effective approach to dynamic scen...
research
03/15/2019

Live Reconstruction of Large-Scale Dynamic Outdoor Worlds

Standard 3D reconstruction pipelines assume stationary world, therefore ...
research
05/29/2023

Alignment-free HDR Deghosting with Semantics Consistent Transformer

High dynamic range (HDR) imaging aims to retrieve information from multi...
research
01/24/2023

K-Planes: Explicit Radiance Fields in Space, Time, and Appearance

We introduce k-planes, a white-box model for radiance fields in arbitrar...
research
11/30/2021

Hole-robust Wireframe Detection

"Wireframe" is a line segment based representation designed to well capt...

Please sign up or login with your details

Forgot password? Click here to reset