Efficient Data Analytics on Augmented Similarity Triplets

12/27/2019
by   Muhammad Ahmad, et al.
23

Many machine learning methods (classification, clustering, etc.) start with a known kernel that provides similarity or distance measure between two objects. Recent work has extended this to situations where the information about objects is limited to comparisons of distances between three objects (triplets). Humans find the comparison task much easier than the estimation of absolute similarities, so this kind of data can be easily obtained using crowd-sourcing. In this work, we give an efficient method of augmenting the triplets data, by utilizing additional implicit information inferred from the existing data. Triplets augmentation improves the quality of kernel-based and kernel-free data analytics tasks. Secondly, we also propose a novel set of algorithms for common supervised and unsupervised machine learning tasks based on triplets. These methods work directly with triplets, avoiding kernel evaluations. Experimental evaluation on real and synthetic datasets shows that our methods are more accurate than the current best-known techniques.

READ FULL TEXT

page 2

page 15

page 16

research
10/17/2021

Noise-robust Clustering

This paper presents noise-robust clustering techniques in unsupervised m...
research
01/15/2020

Learning similarity measures from data

Defining similarity measures is a requirement for some machine learning ...
research
10/05/2018

Network Distance Based on Laplacian Flows on Graphs

Distance plays a fundamental role in measuring similarity between object...
research
05/31/2022

An optimal transport approach for selecting a representative subsample with application in efficient kernel density estimation

Subsampling methods aim to select a subsample as a surrogate for the obs...
research
10/02/2020

Attention-Based Clustering: Learning a Kernel from Context

In machine learning, no data point stands alone. We believe that context...
research
10/09/2019

Active ordinal tuplewise querying for similarity learning

Many machine learning tasks such as clustering, classification, and data...
research
07/02/2014

How Many Dissimilarity/Kernel Self Organizing Map Variants Do We Need?

In numerous applicative contexts, data are too rich and too complex to b...

Please sign up or login with your details

Forgot password? Click here to reset