Perspective Transformation Layer

01/14/2022
by   Nishan Khatri, et al.
142

Incorporating geometric transformations that reflect the relative position changes between an observer and an object into computer vision and deep learning models has attracted much attention in recent years. However, the existing proposals mainly focus on affine transformations that cannot fully show viewpoint changes. Furthermore, current solutions often apply a neural network module to learn a single transformation matrix, which ignores the possibility for various viewpoints and creates extra to-be-trained module parameters. In this paper, a layer (PT layer) is proposed to learn the perspective transformations that not only model the geometries in affine transformation but also reflect the viewpoint changes. In addition, being able to be directly trained with gradient descent like traditional layers such as convolutional layers, a single proposed PT layer can learn an adjustable number of multiple viewpoints without training extra module parameters. The experiments and evaluations confirm the superiority of the proposed PT layer.

READ FULL TEXT

page 2

page 4

page 5

page 6

research
10/27/2021

Similarity and Matching of Neural Network Representations

We employ a toolset – dubbed Dr. Frankenstein – to analyse the similarit...
research
07/19/2015

A concise parametrisation of affine transformation

Good parametrisations of affine transformations are essential to interpo...
research
01/12/2017

Modularized Morphing of Neural Networks

In this work we study the problem of network morphism, an effective lear...
research
07/12/2018

HyperNets and their application to learning spatial transformations

In this paper we propose a conceptual framework for higher-order artific...
research
09/01/2023

Affine-Transformation-Invariant Image Classification by Differentiable Arithmetic Distribution Module

Although Convolutional Neural Networks (CNNs) have achieved promising re...
research
12/06/2017

Top-down Flow Transformer Networks

We study the deformation fields of feature maps across convolutional net...
research
04/15/2021

Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels

This paper presents a end-to-end framework for the F0 transformation in ...

Please sign up or login with your details

Forgot password? Click here to reset