SoftSort: A Continuous Relaxation for the argsort Operator

06/29/2020
by   Sebastian Prillo, et al.
0

While sorting is an important procedure in computer science, the argsort operator - which takes as input a vector and returns its sorting permutation - has a discrete image and thus zero gradients almost everywhere. This prohibits end-to-end, gradient-based learning of models that rely on the argsort operator. A natural way to overcome this problem is to replace the argsort operator with a continuous relaxation. Recent work has shown a number of ways to do this, but the relaxations proposed so far are computationally complex. In this work we propose a simple continuous relaxation for the argsort operator which has the following qualities: it can be implemented in three lines of code, achieves state-of-the-art performance, is easy to reason about mathematically - substantially simplifying proofs - and is faster than competing approaches. We open source the code to reproduce all of the experiments and results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2019

Stochastic Optimization of Sorting Networks via Continuous Relaxations

Sorting input objects is an important step in many machine learning pipe...
research
01/31/2023

Faster Predict-and-Optimize with Three-Operator Splitting

In many practical settings, a combinatorial problem must be repeatedly s...
research
12/12/2020

PiRank: Learning To Rank via Differentiable Sorting

A key challenge with machine learning approaches for ranking is the gap ...
research
02/15/2021

NeuralNDCG: Direct Optimisation of a Ranking Metric via Differentiable Relaxation of Sorting

Learning to Rank (LTR) algorithms are usually evaluated using Informatio...
research
02/23/2018

Learning Latent Permutations with Gumbel-Sinkhorn Networks

Permutations and matchings are core building blocks in a variety of late...
research
08/29/2013

GNCGCP - Graduated NonConvexity and Graduated Concavity Procedure

In this paper we propose the Graduated NonConvexity and Graduated Concav...
research
11/01/2019

Generalized Speedy Q-learning

In this paper, we derive a generalization of the Speedy Q-learning (SQL)...

Please sign up or login with your details

Forgot password? Click here to reset