
Efficient and Modular Implicit Differentiation
Automatic differentiation (autodiff) has revolutionized machine learning...
Implicit differentiation for fast hyperparameter selection in nonsmooth convex learning
Finding the optimal hyperparameters of a model can be cast as a bilevel ...
SelfSupervised Learning of Audio Representations from Permutations with Differentiable Ranking
Selfsupervised pretraining using socalled "pretext" tasks has recentl...
Momentum Residual Neural Networks
The training of deep residual neural networks (ResNets) with backpropaga...
Differentiable Divergences Between Time Series
Computing the discrepancy between time series of variable sizes is notor...
Implicit differentiation of Lassotype models for hyperparameter optimization
Setting regularization parameters for Lassotype estimators is notorious...
Fast Differentiable Sorting and Ranking
The sorting operation is one of the most basic and commonly used buildin...
Learning with Differentiable Perturbed Optimizers
Machine learning pipelines often rely on optimization procedures to make...
Structured Prediction with Projection Oracles
We propose in this paper a general framework for deriving loss functions...
Geometric Losses for Distributional Learning
Building upon recent advances in entropyregularized optimal transport, ...
Learning with FenchelYoung Losses
Over the past decades, numerous loss functions have been been proposed f...
Learning Classifiers with FenchelYoung Losses: Generalized Entropies, Margins, and Algorithms
We study in this paper FenchelYoung losses, a generic way to construct ...
Blind Source Separation with Optimal Transport Nonnegative Matrix Factorization
Optimal transport as a loss for machine learning optimization problems h...
SparseMAP: Differentiable Sparse Structured Inference
Structured prediction requires searching over a combinatorial number of ...
Differentiable Dynamic Programming for Structured Prediction and Attention
Dynamic programming (DP) solves a variety of structured combinatorial pr...
LargeScale Optimal Transport and Mapping Estimation
This paper presents a novel twostep approach for the fundamental proble...
Smooth and Sparse Optimal Transport
Entropic regularization is quickly emerging as a new standard in optimal...
A Regularized Framework for Sparse and Structured Neural Attention
Modern neural networks are often augmented with an attention mechanism, ...
Multioutput Polynomial Networks and Factorization Machines
Factorization machines and polynomial networks are supervised polynomial...
SoftDTW: a Differentiable Loss Function for TimeSeries
We propose in this paper a differentiable learning loss between time ser...
Polynomial Networks and Factorization Machines: New Insights and Efficient Training Algorithms
Polynomial networks and factorization machines are two recentlyproposed...
HigherOrder Factorization Machines
Factorization machines (FMs) are a supervised learning approach that can...
