Master's Thesis : Deep Learning for Visual Recognition

10/18/2016
by   Rémi Cadene, et al.
0

The goal of our research is to develop methods advancing automatic visual recognition. In order to predict the unique or multiple labels associated to an image, we study different kind of Deep Neural Networks architectures and methods for supervised features learning. We first draw up a state-of-the-art review of the Convolutional Neural Networks aiming to understand the history behind this family of statistical models, the limit of modern architectures and the novel techniques currently used to train deep CNNs. The originality of our work lies in our approach focusing on tasks with a low amount of data. We introduce different models and techniques to achieve the best accuracy on several kind of datasets, such as a medium dataset of food recipes (100k images) for building a web API, or a small dataset of satellite images (6,000) for the DSG online challenge that we've won. We also draw up the state-of-the-art in Weakly Supervised Learning, introducing different kind of CNNs able to localize regions of interest. Our last contribution is a framework, build on top of Torch7, for training and testing deep models on any visual recognition tasks and on datasets of any scale.

READ FULL TEXT

page 22

page 23

page 24

page 27

page 29

page 33

page 36

page 40

research
11/30/2016

Attend in groups: a weakly-supervised deep learning framework for learning from web data

Large-scale datasets have driven the rapid development of deep neural ne...
research
03/23/2023

The effectiveness of MAE pre-pretraining for billion-scale pretraining

This paper revisits the standard pretrain-then-finetune paradigm used in...
research
05/17/2021

Rethinking "Batch" in BatchNorm

BatchNorm is a critical building block in modern convolutional neural ne...
research
05/22/2018

Training Convolutional Networks with Web Images

In this thesis we investigate the effect of using web images to build a ...
research
05/28/2022

Data Generation for Satellite Image Classification Using Self-Supervised Representation Learning

Supervised deep neural networks are the-state-of-the-art for many tasks ...
research
04/19/2015

DEEP-CARVING: Discovering Visual Attributes by Carving Deep Neural Nets

Most of the approaches for discovering visual attributes in images deman...
research
02/10/2021

Memory-Associated Differential Learning

Conventional Supervised Learning approaches focus on the mapping from in...

Please sign up or login with your details

Forgot password? Click here to reset