CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks

07/24/2018
by   Andrew Spek, et al.
4

Since the resurgence of CNNs the robotic vision community has developed a range of algorithms that perform classification, semantic segmentation and structure prediction (depths, normals, surface curvature) using neural networks. While some of these models achieve state-of-the art results and super human level performance, deploying these models in a time critical robotic environment remains an ongoing challenge. Real-time frameworks are of paramount importance to build a robotic society where humans and robots integrate seamlessly. To this end, we present a novel real-time structure prediction framework that predicts depth at 30fps on an NVIDIA-TX2. At the time of writing, this is the first piece of work to showcase such a capability on a mobile platform. We also demonstrate with extensive experiments that neural networks with very large model capacities can be leveraged in order to train accurate condensed model architectures in a "from teacher to student" style knowledge transfer.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

research
12/09/2014

Real-Time Grasp Detection Using Convolutional Neural Networks

We present an accurate, real-time approach to robotic grasp detection ba...
research
06/23/2017

Joint Prediction of Depths, Normals and Surface Curvature from RGB Images using CNNs

Understanding the 3D structure of a scene is of vital importance, when i...
research
09/10/2015

Real-time Sign Language Fingerspelling Recognition using Convolutional Neural Networks from Depth map

Sign language recognition is important for natural and convenient commun...
research
07/16/2018

ENG: End-to-end Neural Geometry for Robust Depth and Pose Estimation using CNNs

Recovering structure and motion parameters given a image pair or a seque...
research
08/24/2021

Real-Time Monocular Human Depth Estimation and Segmentation on Embedded Systems

Estimating a scene's depth to achieve collision avoidance against moving...
research
11/20/2020

MobileDepth: Efficient Monocular Depth Prediction on Mobile Devices

Depth prediction is fundamental for many useful applications on computer...
research
09/05/2017

SeDAR - Semantic Detection and Ranging: Humans can localise without LiDAR, can robots?

How does a person work out their location using a floorplan? It is proba...

Please sign up or login with your details

Forgot password? Click here to reset