CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow

03/31/2022
by   Xiuchao Sui, et al.
4

Optical flow estimation aims to find the 2D motion field by identifying corresponding pixels between two images. Despite the tremendous progress of deep learning-based optical flow methods, it remains a challenge to accurately estimate large displacements with motion blur. This is mainly because the correlation volume, the basis of pixel matching, is computed as the dot product of the convolutional features of the two images. The locality of convolutional features makes the computed correlations susceptible to various noises. On large displacements with motion blur, noisy correlations could cause severe errors in the estimated flow. To overcome this challenge, we propose a new architecture "CRoss-Attentional Flow Transformer" (CRAFT), aiming to revitalize the correlation volume computation. In CRAFT, a Semantic Smoothing Transformer layer transforms the features of one frame, making them more global and semantically stable. In addition, the dot-product correlations are replaced with transformer Cross-Frame Attention. This layer filters out feature noises through the Query and Key projections, and computes more accurate correlations. On Sintel (Final) and KITTI (foreground) benchmarks, CRAFT has achieved new state-of-the-art performance. Moreover, to test the robustness of different models on large motions, we designed an image shifting attack that shifts input images to generate large artificial motions. Under this attack, CRAFT performs much more robustly than two representative methods, RAFT and GMA. The code of CRAFT is is available at https://github.com/askerlee/craft.

READ FULL TEXT

page 1

page 7

page 8

page 13

page 14

page 15

page 16

page 17

research
12/20/2022

CGCV:Context Guided Correlation Volume for Optical Flow Neural Networks

Optical flow, which computes the apparent motion from a pair of video fr...
research
03/04/2021

Optical Flow Estimation from a Single Motion-blurred Image

In most of computer vision applications, motion blur is regarded as an u...
research
03/26/2020

RAFT: Recurrent All-Pairs Field Transforms for Optical Flow

We introduce Recurrent All-Pairs Field Transforms (RAFT), a new deep net...
research
03/21/2022

Global Matching with Overlapping Attention for Optical Flow Estimation

Optical flow estimation is a fundamental task in computer vision. Recent...
research
04/23/2023

TransFlow: Transformer as Flow Learner

Optical flow is an indispensable building block for various important co...
research
01/15/2022

Learning Temporally and Semantically Consistent Unpaired Video-to-video Translation Through Pseudo-Supervision From Synthetic Optical Flow

Unpaired video-to-video translation aims to translate videos between a s...
research
05/27/2022

FlowNet-PET: Unsupervised Learning to Perform Respiratory Motion Correction in PET Imaging

To correct for breathing motion in PET imaging, an interpretable and uns...

Please sign up or login with your details

Forgot password? Click here to reset