Transformable Bottleneck Networks

04/13/2019
by   Kyle Olszewski, et al.
10

We propose a novel approach to performing fine-grained 3D manipulation of image content via a convolutional neural network, which we call the Transformable Bottleneck Network (TBN). It applies given spatial transformations directly to a volumetric bottleneck within our encoder-bottleneck-decoder architecture. Multi-view supervision encourages the network to learn to spatially disentangle the feature space within the bottleneck. The resulting spatial structure can be manipulated with arbitrary spatial transformations. We demonstrate the efficacy of TBNs for novel view synthesis, achieving state-of-the-art results on a challenging benchmark. We demonstrate that the bottlenecks produced by networks trained for this task contain meaningful spatial structure that allows us to intuitively perform a variety of image manipulations in 3D, well beyond the rigid transformations seen during training. These manipulations include non-uniform scaling, non-rigid warping, and combining content from different images. Finally, we extract explicit 3D structure from the bottleneck, performing impressive 3D reconstruction from a single input image.

READ FULL TEXT

page 5

page 7

page 8

research
06/14/2021

Flow Guided Transformable Bottleneck Networks for Motion Retargeting

Human motion retargeting aims to transfer the motion of one person in a ...
research
01/11/2018

Non-Rigid Image Registration Using Self-Supervised Fully Convolutional Networks without Training Data

A novel non-rigid image registration algorithm is built upon fully convo...
research
07/18/2020

Volumetric Transformer Networks

Existing techniques to encode spatial invariance within deep convolution...
research
04/12/2020

Learning Spatial Relationships between Samples of Image Shapes

Many applications including image based classification and retrieval of ...
research
01/05/2016

Weakly-supervised Disentangling with Recurrent Transformations for 3D View Synthesis

An important problem for both graphics and vision is to synthesize novel...
research
10/06/2020

How Convolutional Neural Network Architecture Biases Learned Opponency and Colour Tuning

Recent work suggests that changing Convolutional Neural Network (CNN) ar...
research
08/16/2019

A Cooperative Autoencoder for Population-Based Regularization of CNN Image Registration

Spatial transformations are enablers in a variety of medical image analy...

Please sign up or login with your details

Forgot password? Click here to reset