Measuring Generalization with Optimal Transport

06/07/2021
by   Ching-Yao Chuang, et al.
0

Understanding the generalization of deep neural networks is one of the most important tasks in deep learning. Although much progress has been made, theoretical error bounds still often behave disparately from empirical observations. In this work, we develop margin-based generalization bounds, where the margins are normalized with optimal transport costs between independent random subsets sampled from the training distribution. In particular, the optimal transport cost can be interpreted as a generalization of variance which captures the structural properties of the learned feature space. Our bounds robustly predict the generalization error, given training data and network parameters, on large scale datasets. Theoretically, we demonstrate that the concentration and separation of features play crucial roles in generalization, supporting empirical results in the literature. The code is available at <https://github.com/chingyaoc/kV-Margin>.

READ FULL TEXT
research
12/14/2021

Inductive Semi-supervised Learning Through Optimal Transport

In this paper, we tackle the inductive semi-supervised learning problem ...
research
11/02/2022

Instance-Dependent Generalization Bounds via Optimal Transport

Existing generalization bounds fail to explain crucial factors that driv...
research
06/05/2021

k-Mixup Regularization for Deep Learning via Optimal Transport

Mixup is a popular regularization technique for training deep neural net...
research
02/10/2023

Predicting Out-of-Distribution Error with Confidence Optimal Transport

Out-of-distribution (OOD) data poses serious challenges in deployed mach...
research
06/22/2020

An Optimal Transport Kernel for Feature Aggregation and its Relationship to Attention

We introduce a kernel for sets of features based on an optimal transport...
research
12/04/2020

Representation Based Complexity Measures for Predicting Generalization in Deep Learning

Deep Neural Networks can generalize despite being significantly overpara...
research
05/25/2023

Characterizing Out-of-Distribution Error via Optimal Transport

Out-of-distribution (OOD) data poses serious challenges in deployed mach...

Please sign up or login with your details

Forgot password? Click here to reset