On the Learning and Learnablity of Quasimetrics

06/30/2022
by   Tongzhou Wang, et al.
0

Our world is full of asymmetries. Gravity and wind can make reaching a place easier than coming back. Social artifacts such as genealogy charts and citation graphs are inherently directed. In reinforcement learning and control, optimal goal-reaching strategies are rarely reversible (symmetrical). Distance functions supported on these asymmetrical structures are called quasimetrics. Despite their common appearance, little research has been done on the learning of quasimetrics. Our theoretical analysis reveals that a common class of learning algorithms, including unconstrained multilayer perceptrons (MLPs), provably fails to learn a quasimetric consistent with training data. In contrast, our proposed Poisson Quasimetric Embedding (PQE) is the first quasimetric learning formulation that both is learnable with gradient-based optimization and enjoys strong performance guarantees. Experiments on random graphs, social graphs, and offline Q-learning demonstrate its effectiveness over many common baselines.

READ FULL TEXT
research
04/03/2023

Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning

In goal-reaching reinforcement learning (RL), the optimal value function...
research
06/15/2020

Neural Certificates for Safe Control Policies

This paper develops an approach to learn a policy of a dynamical system ...
research
06/07/2022

How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression

Offline goal-conditioned reinforcement learning (GCRL) promises general-...
research
01/07/2019

Credit Assignment Techniques in Stochastic Computation Graphs

Stochastic computation graphs (SCGs) provide a formalism to represent st...
research
08/03/2016

Fast and Simple Optimization for Poisson Likelihood Models

Poisson likelihood models have been prevalently used in imaging, social ...
research
01/27/2019

Bipartitioning of directed and mixed random graphs

We show that an intricate relation of cluster properties and optimal bip...
research
09/13/2017

Mitigating Overexposure in Viral Marketing

In traditional models for word-of-mouth recommendations and viral market...

Please sign up or login with your details

Forgot password? Click here to reset