Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions

06/03/2021
by   Mathias Niepert, et al.
0

Integrating discrete probability distributions and combinatorial optimization problems into neural networks has numerous applications but poses several challenges. We propose Implicit Maximum Likelihood Estimation (I-MLE), a framework for end-to-end learning of models combining discrete exponential family distributions and differentiable neural components. I-MLE is widely applicable: it only requires the ability to compute the most probable states; and does not rely on smooth relaxations. The framework encompasses several approaches, such as perturbation-based implicit differentiation and recent methods to differentiate through black-box combinatorial solvers. We introduce a novel class of noise distributions for approximating marginals via perturb-and-MAP. Moreover, we show that I-MLE simplifies to maximum likelihood estimation when used in some recently studied learning settings that involve combinatorial solvers. Experiments on several datasets suggest that I-MLE is competitive with and often outperforms existing approaches which rely on problem-specific relaxations.

READ FULL TEXT

page 18

page 20

page 21

research
11/29/2019

Maximum likelihood estimation for discrete exponential families and random graphs

We characterize the existence of maximum likelihood estimators for discr...
research
09/11/2022

Adaptive Perturbation-Based Gradient Estimation for Discrete Latent Variable Models

The integration of discrete algorithmic components in deep learning arch...
research
03/04/2015

Bethe Learning of Conditional Random Fields via MAP Decoding

Many machine learning tasks can be formulated in terms of predicting str...
research
10/27/2022

Learning Discrete Directed Acyclic Graphs via Backpropagation

Recently continuous relaxations have been proposed in order to learn Dir...
research
06/06/2021

Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation

The shortcomings of maximum likelihood estimation in the context of mode...
research
05/19/2022

Classifying one-dimensional discrete models with maximum likelihood degree one

We propose a classification of all one-dimensional discrete statistical ...
research
10/21/2021

Generalization of Neural Combinatorial Solvers Through the Lens of Adversarial Robustness

End-to-end (geometric) deep learning has seen first successes in approxi...

Please sign up or login with your details

Forgot password? Click here to reset