A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

03/17/2023
by   Xiaotao Hu, et al.
0

The performance of video prediction has been greatly boosted by advanced deep neural networks. However, most of the current methods suffer from large model sizes and require extra inputs, e.g., semantic/depth maps, for promising performance. For efficiency consideration, in this paper, we propose a Dynamic Multi-scale Voxel Flow Network (DMVFN) to achieve better video prediction performance at lower computational costs with only RGB images, than previous methods. The core of our DMVFN is a differentiable routing module that can effectively perceive the motion scales of video frames. Once trained, our DMVFN selects adaptive sub-networks for different inputs at the inference stage. Experiments on several benchmarks demonstrate that our DMVFN is an order of magnitude faster than Deep Voxel Flow and surpasses the state-of-the-art iterative-based OPT on generated image quality. Our code and demo are available at https://huxiaotaostasy.github.io/DMVFN/.

READ FULL TEXT

page 4

page 6

page 7

page 12

research
02/08/2017

Video Frame Synthesis using Deep Voxel Flow

We address the problem of synthesizing new video frames in an existing v...
research
06/20/2023

Multi-Scale Occ: 4th Place Solution for CVPR 2023 3D Occupancy Prediction Challenge

In this report, we present the 4th place solution for CVPR 2023 3D occup...
research
05/12/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

To facilitate depth-based 3D action recognition, 3D dynamic voxel (3DV) ...
research
04/06/2022

Multi-Scale Memory-Based Video Deblurring

Video deblurring has achieved remarkable progress thanks to the success ...
research
12/10/2019

Pillar in Pillar: Multi-Scale and Dynamic Feature Extraction for 3D Object Detection in Point Clouds

Sparsity and varied density are two of the main obstacles for 3D detecti...
research
10/25/2022

Salient Object Detection via Dynamic Scale Routing

Recent research advances in salient object detection (SOD) could largely...
research
12/26/2021

Learning Cross-Scale Prediction for Efficient Neural Video Compression

In this paper, we present the first neural video codec that can compete ...

Please sign up or login with your details

Forgot password? Click here to reset