Unsupervised Learning for Optical Flow Estimation Using Pyramid Convolution LSTM

07/26/2019
by   Shuosen Guan, et al.
25

Most of current Convolution Neural Network (CNN) based methods for optical flow estimation focus on learning optical flow on synthetic datasets with groundtruth, which is not practical. In this paper, we propose an unsupervised optical flow estimation framework named PCLNet. It uses pyramid Convolution LSTM (ConvLSTM) with the constraint of adjacent frame reconstruction, which allows flexibly estimating multi-frame optical flows from any video clip. Besides, by decoupling motion feature learning and optical flow representation, our method avoids complex short-cut connections used in existing frameworks while improving accuracy of optical flow estimation. Moreover, different from those methods using specialized CNN architectures for capturing motion, our framework directly learns optical flow from the features of generic CNNs and thus can be easily embedded in any CNN based frameworks for other tasks. Extensive experiments have verified that our method not only estimates optical flow effectively and accurately, but also obtains comparable performance on action recognition.

READ FULL TEXT

page 4

page 6

page 7

research
03/05/2021

Unsupervised Motion Representation Enhanced Network for Action Recognition

Learning reliable motion representation between consecutive frames, such...
research
08/21/2020

ATG-PVD: Ticketing Parking Violations on A Drone

In this paper, we introduce a novel suspect-and-investigate framework, w...
research
07/10/2020

STaRFlow: A SpatioTemporal Recurrent Cell for Lightweight Multi-Frame Optical Flow Estimation

We present a new lightweight CNN-based algorithm for multi-frame optical...
research
01/06/2022

EM-driven unsupervised learning for efficient motion segmentation

This paper presents a CNN-based fully unsupervised method for motion seg...
research
04/23/2023

TransFlow: Transformer as Flow Learner

Optical flow is an indispensable building block for various important co...
research
02/25/2020

ScopeFlow: Dynamic Scene Scoping for Optical Flow

We propose to modify the common training protocols of optical flow, lead...
research
01/25/2022

Semantically Video Coding: Instill Static-Dynamic Clues into Structured Bitstream for AI Tasks

Traditional media coding schemes typically encode image/video into a sem...

Please sign up or login with your details

Forgot password? Click here to reset