Scalable Similarity Learning using Large Margin Neighborhood Embedding

04/24/2014
by   Zhaowen Wang, et al.
0

Classifying large-scale image data into object categories is an important problem that has received increasing research attention. Given the huge amount of data, non-parametric approaches such as nearest neighbor classifiers have shown promising results, especially when they are underpinned by a learned distance or similarity measurement. Although metric learning has been well studied in the past decades, most existing algorithms are impractical to handle large-scale data sets. In this paper, we present an image similarity learning method that can scale well in both the number of images and the dimensionality of image descriptors. To this end, similarity comparison is restricted to each sample's local neighbors and a discriminative similarity measure is induced from large margin neighborhood embedding. We also exploit the ensemble of projections so that high-dimensional features can be processed in a set of lower-dimensional subspaces in parallel without much performance compromise. The similarity function is learned online using a stochastic gradient descent algorithm in which the triplet sampling strategy is customized for quick convergence of classification performance. The effectiveness of our proposed model is validated on several data sets with scales varying from tens of thousands to one million images. Recognition accuracies competitive with the state-of-the-art performance are achieved with much higher efficiency and scalability.

READ FULL TEXT
research
05/25/2018

Large-scale Distance Metric Learning with Uncertainty

Distance metric learning (DML) has been studied extensively in the past ...
research
01/20/2022

Adaptive neighborhood Metric learning

In this paper, we reveal that metric learning would suffer from serious ...
research
03/02/2010

Scalable Large-Margin Mahalanobis Distance Metric Learning

For many machine learning algorithms such as k-Nearest Neighbor (k-NN) c...
research
07/04/2014

Improving Performance of Self-Organising Maps with Distance Metric Learning Method

Self-Organising Maps (SOM) are Artificial Neural Networks used in Patter...
research
03/30/2021

Structured Inverted-File k-Means Clustering for High-Dimensional Sparse Data

This paper presents an architecture-friendly k-means clustering algorith...
research
04/22/2014

Large Margin Image Set Representation and Classification

In this paper, we propose a novel image set representation and classific...
research
04/07/2015

Large Margin Nearest Neighbor Embedding for Knowledge Representation

Traditional way of storing facts in triplets ( head_entity, relation, ta...

Please sign up or login with your details

Forgot password? Click here to reset