Jointly Aligning Millions of Images with Deep Penalised Reconstruction Congealing

08/12/2019
by   Roberto Annunziata, et al.
4

Extrapolating fine-grained pixel-level correspondences in a fully unsupervised manner from a large set of misaligned images can benefit several computer vision and graphics problems, e.g. co-segmentation, super-resolution, image edit propagation, structure-from-motion, and 3D reconstruction. Several joint image alignment and congealing techniques have been proposed to tackle this problem, but robustness to initialisation, ability to scale to large datasets, and alignment accuracy seem to hamper their wide applicability. To overcome these limitations, we propose an unsupervised joint alignment method leveraging a densely fused spatial transformer network to estimate the warping parameters for each image and a low-capacity auto-encoder whose reconstruction error is used as an auxiliary measure of joint alignment. Experimental results on digits from multiple versions of MNIST (i.e., original, perturbed, affNIST and infiMNIST) and faces from LFW, show that our approach is capable of aligning millions of images with high accuracy and robustness to different levels and types of perturbation. Moreover, qualitative and quantitative results suggest that the proposed method outperforms state-of-the-art approaches both in terms of alignment quality and robustness to initialisation.

READ FULL TEXT

page 1

page 2

page 5

page 6

page 7

page 8

research
09/06/2016

Confidence-aware Levenberg-Marquardt optimization for joint motion estimation and super-resolution

Motion estimation across low-resolution frames and the reconstruction of...
research
05/24/2022

Unsupervised Difference Learning for Noisy Rigid Image Alignment

Rigid image alignment is a fundamental task in computer vision, while th...
research
09/18/2018

Multiple Combined Constraints for Image Stitching

Several approaches to image stitching use different constraints to estim...
research
11/19/2019

Joint Super-Resolution and Alignment of Tiny Faces

Super-resolution (SR) and landmark localization of tiny faces are highly...
research
07/11/2018

DeSTNet: Densely Fused Spatial Transformer Networks

Modern Convolutional Neural Networks (CNN) are extremely powerful on a r...
research
04/14/2023

CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction

Structured reconstruction is a non-trivial dense prediction problem, whi...
research
04/27/2021

SrvfNet: A Generative Network for Unsupervised Multiple Diffeomorphic Shape Alignment

We present SrvfNet, a generative deep learning framework for the joint m...

Please sign up or login with your details

Forgot password? Click here to reset