Sparse online relative similarity learning

04/15/2021
by   Dezhong Yao, et al.
20

For many data mining and machine learning tasks, the quality of a similarity measure is the key for their performance. To automatically find a good similarity measure from datasets, metric learning and similarity learning are proposed and studied extensively. Metric learning will learn a Mahalanobis distance based on positive semi-definite (PSD) matrix, to measure the distances between objectives, while similarity learning aims to directly learn a similarity function without PSD constraint so that it is more attractive. Most of the existing similarity learning algorithms are online similarity learning method, since online learning is more scalable than offline learning. However, most existing online similarity learning algorithms learn a full matrix with d 2 parameters, where d is the dimension of the instances. This is clearly inefficient for high dimensional tasks due to its high memory and computational complexity. To solve this issue, we introduce several Sparse Online Relative Similarity (SORS) learning algorithms, which learn a sparse model during the learning process, so that the memory and computational cost can be significantly reduced. We theoretically analyze the proposed algorithms, and evaluate them on some real-world high dimensional datasets. Encouraging empirical results demonstrate the advantages of our approach in terms of efficiency and efficacy.

READ FULL TEXT

page 1

page 8

research
11/10/2014

Similarity Learning for High-Dimensional Sparse Data

A good measure of similarity between data points is crucial to many task...
research
04/05/2018

Large Scale Local Online Similarity/Distance Learning Framework based on Passive/Aggressive

Similarity/Distance measures play a key role in many machine learning, p...
research
10/07/2020

Low-Rank Robust Online Distance/Similarity Learning based on the Rescaled Hinge Loss

An important challenge in metric learning is scalability to both size an...
research
07/20/2018

Escaping the Curse of Dimensionality in Similarity Learning: Efficient Frank-Wolfe Algorithm and Generalization Bounds

Similarity and metric learning provides a principled approach to constru...
research
01/07/2017

Similarity Function Tracking using Pairwise Comparisons

Recent work in distance metric learning has focused on learning transfor...
research
05/05/2015

Learning Style Similarity for Searching Infographics

Infographics are complex graphic designs integrating text, images, chart...
research
06/27/2012

Similarity Learning for Provably Accurate Sparse Linear Classification

In recent years, the crucial importance of metrics in machine learning a...

Please sign up or login with your details

Forgot password? Click here to reset