UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory

09/07/2016
by   Iasonas Kokkinos, et al.
0

In this work we introduce a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture that is trained end-to-end. Such a universal network can act like a `swiss knife' for vision tasks; we call this architecture an UberNet to indicate its overarching nature. We address two main technical challenges that emerge when broadening up the range of tasks handled by a single CNN: (i) training a deep architecture while relying on diverse training sets and (ii) training many (potentially unlimited) tasks with a limited memory budget. Properly addressing these two problems allows us to train accurate predictors for a host of tasks, without compromising accuracy. Through these advances we train in an end-to-end manner a CNN that simultaneously addresses (a) boundary detection (b) normal estimation (c) saliency estimation (d) semantic segmentation (e) human part segmentation (f) semantic boundary detection, (g) region proposal generation and object detection. We obtain competitive performance while jointly addressing all of these tasks in 0.7 seconds per frame on a single GPU. A demonstration of this system can be found at http://cvn.ecp.fr/ubernet/.

READ FULL TEXT

page 1

page 11

page 12

research
04/23/2015

High-for-Low and Low-for-High: Efficient Boundary Detection from Deep Object Features and its Applications to High-Level Vision

Most of the current boundary detection systems rely exclusively on low-l...
research
05/07/2015

Object detection via a multi-region & semantic segmentation-aware CNN model

We propose an object detection system that relies on a multi-region deep...
research
01/17/2017

Convolutional Oriented Boundaries: From Image Segmentation to High-Level Tasks

We present Convolutional Oriented Boundaries (COB), which produces multi...
research
03/19/2020

HyNNA: Improved Performance for Neuromorphic Vision Sensor based Surveillance using Hybrid Neural Network Architecture

Applications in the Internet of Video Things (IoVT) domain have very tig...
research
04/19/2020

An end-to-end CNN framework for polarimetric vision tasks based on polarization-parameter-constructing network

Pixel-wise operations between polarimetric images are important for proc...
research
05/30/2016

End-to-End Instance Segmentation with Recurrent Attention

While convolutional neural networks have gained impressive success recen...
research
09/26/2022

Diversified Dynamic Routing for Vision Tasks

Deep learning models for vision tasks are trained on large datasets unde...

Please sign up or login with your details

Forgot password? Click here to reset