Minimizing the Effect of Noise and Limited Dataset Size in Image Classification Using Depth Estimation as an Auxiliary Task with Deep Multitask Learning

08/22/2022
by   Khashayar Namdar, et al.
10

Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask loss functions are the most effective approach of implementing dMTL, b) limited dataset size primarily contributes to classification inaccuracy, and c) depth estimation is mostly impacted by noise. In order to further validate the results, we manually labeled the NYU Depth V2 dataset for scene classification tasks. As a contribution to the field, we have made the data in python native format publicly available as an open-source dataset and provided the scene labels. Our experiments on MNIST and NYU-Depth-V2 show dMTL improves generalizability of the classifiers when the dataset is noisy and the number of examples is limited.

READ FULL TEXT

page 4

page 5

page 6

page 8

page 10

research
04/13/2021

Single Image Depth Estimation: An Overview

We review solutions to the problem of depth estimation, arguably the mos...
research
09/20/2017

Estimated Depth Map Helps Image Classification

We consider image classification with estimated depth. This problem fall...
research
12/05/2020

Depth estimation from 4D light field videos

Depth (disparity) estimation from 4D Light Field (LF) images has been a ...
research
07/25/2019

MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and Classification

We introduce MultiDepth, a novel training strategy and convolutional neu...
research
09/18/2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

In this study, we address the challenge of 3D scene structure recovery f...
research
12/09/2019

Exploiting Dual-Gate Ambipolar CNFETs for Scalable Machine Learning Classification

Ambipolar carbon nanotube based field-effect transistors (AP-CNFETs) exh...
research
10/24/2011

Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Scene understanding includes many related sub-tasks, such as scene categ...

Please sign up or login with your details

Forgot password? Click here to reset