MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation

06/27/2020
by   Jun Liu, et al.
69

Predicting depth from a single image is an attractive research topic since it provides one more dimension of information to enable machines to better perceive the world. Recently, deep learning has emerged as an effective approach to monocular depth estimation. As obtaining labeled data is costly, there is a recent trend to move from supervised learning to unsupervised learning to obtain monocular depth. However, most unsupervised learning methods capable of achieving high depth prediction accuracy will require a deep network architecture which will be too heavy and complex to run on embedded devices with limited storage and memory spaces. To address this issue, we propose a new powerful network with a recurrent module to achieve the capability of a deep network while at the same time maintaining an extremely lightweight size for real-time high performance unsupervised monocular depth prediction from video sequences. Besides, a novel efficient upsample block is proposed to fuse the features from the associated encoder layer and recover the spatial size of features with the small number of model parameters. We validate the effectiveness of our approach via extensive experiments on the KITTI dataset. Our new model can run at a speed of about 110 frames per second (fps) on a single GPU, 37 fps on a single CPU, and 2 fps on a Raspberry Pi 3. Moreover, it achieves higher depth accuracy with nearly 33 times fewer model parameters than state-of-the-art models. To the best of our knowledge, this work is the first extremely lightweight neural network trained on monocular video sequences for real-time unsupervised monocular depth estimation, which opens up the possibility of implementing deep learning-based real-time unsupervised monocular depth prediction on low-cost embedded devices.

READ FULL TEXT

page 10

page 13

page 23

page 25

page 27

page 30

research
06/29/2018

Towards real-time unsupervised monocular depth estimation on CPU

Unsupervised depth estimation from a single image is a very attractive t...
research
11/24/2021

MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation

Depth is a vital piece of information for autonomous vehicles to perceiv...
research
03/08/2023

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

Unsupervised methods have showed promising results on monocular depth es...
research
03/17/2023

TKN: Transformer-based Keypoint Prediction Network For Real-time Video Prediction

Video prediction is a complex time-series forecasting task with great po...
research
08/24/2021

Real-Time Monocular Human Depth Estimation and Segmentation on Embedded Systems

Estimating a scene's depth to achieve collision avoidance against moving...
research
05/13/2019

Lightweight Monocular Depth Estimation Model by Joint End-to-End Filter pruning

Convolutional neural networks (CNNs) have emerged as the state-of-the-ar...
research
03/18/2023

Channel-Aware Distillation Transformer for Depth Estimation on Nano Drones

Autonomous navigation of drones using computer vision has achieved promi...

Please sign up or login with your details

Forgot password? Click here to reset