Active ordinal tuplewise querying for similarity learning

10/09/2019
by   Gregory Canal, et al.
0

Many machine learning tasks such as clustering, classification, and dataset search benefit from embedding data points in a space where distances reflect notions of relative similarity as perceived by humans. A common way to construct such an embedding is to request triplet similarity queries to an oracle, comparing two objects with respect to a reference. This work generalizes triplet queries to tuple queries of arbitrary size that ask an oracle to rank multiple objects against a reference, and introduces an efficient and robust adaptive selection method called InfoTuple that uses a novel approach to mutual information maximization. We show that the performance of InfoTuple at various tuple sizes exceeds that of the state-of-the-art adaptive triplet selection method on synthetic tests and new human response datasets, and empirically demonstrate the significant gains in efficiency and query consistency achieved by querying larger tuples instead of triplets.

READ FULL TEXT
research
11/06/2015

Active Perceptual Similarity Modeling with Auxiliary Information

Learning a model of perceptual similarity from a collection of objects i...
research
02/04/2022

Active metric learning and classification using similarity queries

Active learning is commonly used to train label-efficient models by adap...
research
06/02/2023

Fast Interactive Search with a Scale-Free Comparison Oracle

A comparison-based search algorithm lets a user find a target item t in ...
research
08/03/2020

Classification from Ambiguity Comparisons

Labeling data is an unavoidable pre-processing procedure for most machin...
research
10/05/2021

How to Query An Oracle? Efficient Strategies to Label Data

We consider the basic problem of querying an expert oracle for labeling ...
research
07/23/2022

Patent Search Using Triplet Networks Based Fine-Tuned SciBERT

In this paper, we propose a novel method for the prior-art search task. ...
research
12/27/2019

Efficient Data Analytics on Augmented Similarity Triplets

Many machine learning methods (classification, clustering, etc.) start w...

Please sign up or login with your details

Forgot password? Click here to reset