Self-tuned Visual Subclass Learning with Shared Samples An Incremental Approach

05/22/2014
by   Hossein Azizpour, et al.
0

Computer vision tasks are traditionally defined and evaluated using semantic categories. However, it is known to the field that semantic classes do not necessarily correspond to a unique visual class (e.g. inside and outside of a car). Furthermore, many of the feasible learning techniques at hand cannot model a visual class which appears consistent to the human eye. These problems have motivated the use of 1) Unsupervised or supervised clustering as a preprocessing step to identify the visual subclasses to be used in a mixture-of-experts learning regime. 2) Felzenszwalb et al. part model and other works model mixture assignment with latent variables which is optimized during learning 3) Highly non-linear classifiers which are inherently capable of modelling multi-modal input space but are inefficient at the test time. In this work, we promote an incremental view over the recognition of semantic classes with varied appearances. We propose an optimization technique which incrementally finds maximal visual subclasses in a regularized risk minimization framework. Our proposed approach unifies the clustering and classification steps in a single algorithm. The importance of this approach is its compliance with the classification via the fact that it does not need to know about the number of clusters, the representation and similarity measures used in pre-processing clustering methods a priori. Following this approach we show both qualitatively and quantitatively significant results. We show that the visual subclasses demonstrate a long tail distribution. Finally, we show that state of the art object detection methods (e.g. DPM) are unable to use the tails of this distribution comprising 50% of the training samples. In fact we show that DPM performance slightly increases on average by the removal of this half of the data.

READ FULL TEXT

page 2

page 8

page 12

page 13

research
07/04/2018

TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces

The immense success of deep learning based methods in computer vision he...
research
11/11/2014

Supervised Classification of Flow Cytometric Samples via the Joint Clustering and Matching (JCM) Procedure

We consider the use of the Joint Clustering and Matching (JCM) procedure...
research
08/21/2023

Audio-Visual Class-Incremental Learning

In this paper, we introduce audio-visual class-incremental learning, a c...
research
03/10/2020

Incremental Few-Shot Object Detection

Most existing object detection methods rely on the availability of abund...
research
08/21/2022

Semantic-enhanced Image Clustering

Image clustering is an important, and open challenge task in computer vi...
research
12/25/2021

Semantic Clustering based Deduction Learning for Image Recognition and Classification

The paper proposes a semantic clustering based deduction learning by mim...
research
12/02/2019

Mixture Dense Regression for Object Detection and Human Pose Estimation

Mixture models are well-established machine learning approaches that, in...

Please sign up or login with your details

Forgot password? Click here to reset