See far with TPNET: a Tile Processor and a CNN Symbiosis

11/20/2018
by   Andrey Filippov, et al.
6

Throughout the evolution of the neural networks more specialized cells were added to the set of basic building blocks. These cells aim to improve training convergence, increase the overall performance, and reduce the number of required labels, all while preserving the expressive power of the universal network. Inspired by the partitioning of the human visual perception system between the eyes and the cerebral cortex, we present TPNET, which offloads universal and application-specific CNN from the bulk processing of the high resolution pixel data and performs the translation-variant image correction while delegating all non-linear decision making to the network. In this work, we explore application of TPNET to 3D perception with a narrow-baseline (0.0001-0.0025) quad stereo camera and prove that a trained network provides a disparity prediction from the 2D phase correlation output by the Tile Processor (TP) that is twice as accurate as the prediction from a carefully hand-crafted algorithm. The TP in turn reduces the dimensions of the input features of the network and provides instrument-invariant and translation-invariant data, making real-time high resolution stereo 3D perception feasible and easing the requirement to have a complete end-to-end network.

READ FULL TEXT

page 3

page 6

page 8

research
03/13/2021

ORStereo: Occlusion-Aware Recurrent Stereo Matching for 4K-Resolution Images

Stereo reconstruction models trained on small images do not generalize w...
research
11/23/2019

Deep-Learning Assisted High-Resolution Binocular Stereo Depth Reconstruction

This work presents dense stereo reconstruction using high-resolution ima...
research
06/06/2022

WHU-Stereo: A Challenging Benchmark for Stereo Matching of High-Resolution Satellite Images

Stereo matching of high-resolution satellite images (HRSI) is still a fu...
research
07/08/2018

Automatic Classification of Defective Photovoltaic Module Cells in Electroluminescence Images

Electroluminescence (EL) imaging is a useful modality for the inspection...
research
10/28/2021

Neural Disparity Refinement for Arbitrary Resolution Stereo

We introduce a novel architecture for neural disparity refinement aimed ...
research
04/11/2023

PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors

Conventional image sensors digitize high-resolution images at fast frame...
research
06/30/2023

The Human Auditory System and Audio

This work reviews the human auditory system, elucidating some of the spe...

Please sign up or login with your details

Forgot password? Click here to reset