Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

04/28/2023
by   Hendrik Sommerhoff, et al.
0

The success of deep learning is frequently described as the ability to train all parameters of a network on a specific application in an end-to-end fashion. Yet, several design choices on the camera level, including the pixel layout of the sensor, are considered as pre-defined and fixed, and high resolution, regular pixel layouts are considered to be the most generic ones in computer vision and graphics, treating all regions of an image as equally important. While several works have considered non-uniform, , hexagonal or foveated, pixel layouts in hardware and image processing, the layout has not been integrated into the end-to-end learning paradigm so far. In this work, we present the first truly end-to-end trained imaging pipeline that optimizes the size and distribution of pixels on the imaging sensor jointly with the parameters of a given neural network on a specific task. We derive an analytic, differentiable approach for the sensor layout parameterization that allows for task-specific, local varying pixel resolutions. We present two pixel layout parameterization functions: rectangular and curvilinear grid shapes that retain a regular topology. We provide a drop-in module that approximates sensor simulation given existing high-resolution images to directly connect our method with existing deep learning models. We show that network predictions benefit from learnable pixel layouts for two different downstream tasks, classification and semantic segmentation.

READ FULL TEXT

page 2

page 6

page 8

research
07/26/2018

Superpixel Sampling Networks

Superpixels provide an efficient low/mid-level representation of image d...
research
04/21/2022

Physics vs. Learned Priors: Rethinking Camera and Algorithm Design for Task-Specific Imaging

Cameras were originally designed using physics-based heuristics to captu...
research
11/17/2016

DSAC - Differentiable RANSAC for Camera Localization

RANSAC is an important algorithm in robust optimization and a central bu...
research
02/12/2019

A system for generating complex physically accurate sensor images for automotive applications

We describe an open-source simulator that creates sensor irradiance and ...
research
08/21/2020

Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid

In modern computer vision, images are typically represented as a fixed u...
research
05/23/2016

Learning Sensor Multiplexing Design through Back-propagation

Recent progress on many imaging and vision tasks has been driven by the ...
research
04/09/2019

A Non-linear Differential CNN-Rendering Module for 3D Data Enhancement

In this work we introduce a differential rendering module which allows n...

Please sign up or login with your details

Forgot password? Click here to reset