Few-Shot Object Recognition from Machine-Labeled Web Images

12/19/2016
by   Zhongwen Xu, et al.
0

With the tremendous advances of Convolutional Neural Networks (ConvNets) on object recognition, we can now obtain reliable enough machine-labeled annotations easily by predictions from off-the-shelf ConvNets. In this work, we present an abstraction memory based framework for few-shot learning, building upon machine-labeled image annotations. Our method takes some large-scale machine-annotated datasets (e.g., OpenImages) as an external memory bank. In the external memory bank, the information is stored in the memory slots with the form of key-value, where image feature is regarded as key and label embedding serves as value. When queried by the few-shot examples, our model selects visually similar data from the external memory bank, and writes the useful information obtained from related external data into another memory bank, i.e., abstraction memory. Long Short-Term Memory (LSTM) controllers and attention mechanisms are utilized to guarantee the data written to the abstraction memory is correlated to the query example. The abstraction memory concentrates information from the external memory bank, so that it makes the few-shot recognition effective. In the experiments, we firstly confirm that our model can learn to conduct few-shot object recognition on clean human-labeled data from ImageNet dataset. Then, we demonstrate that with our model, machine-labeled image annotations are very effective and abundant resources to perform object recognition on novel categories. Experimental results show that our proposed model with machine-labeled annotations achieves great performance, only with a gap of 1

READ FULL TEXT

page 1

page 8

research
09/04/2015

Object Recognition from Short Videos for Robotic Perception

Deep neural networks have become the primary learning technique for obje...
research
12/10/2021

A Label Correction Algorithm Using Prior Information for Automatic and Accurate Geospatial Object Recognition

Thousands of scanned historical topographic maps contain valuable inform...
research
06/24/2016

Captioning Images with Diverse Objects

Recent captioning models are limited in their ability to scale and descr...
research
04/08/2023

Interpretable Multi Labeled Bengali Toxic Comments Classification using Deep Learning

This paper presents a deep learning-based pipeline for categorizing Beng...
research
02/18/2023

Neural Attention Memory

We propose a novel perspective of the attention mechanism by reinventing...
research
11/17/2016

Multimodal Memory Modelling for Video Captioning

Video captioning which automatically translates video clips into natural...
research
09/18/2017

Learning a Fully Convolutional Network for Object Recognition using very few Data

In recent years, data-driven methods have shown great success for extrac...

Please sign up or login with your details

Forgot password? Click here to reset