How Does Independence Help Generalization? Sample Complexity of ERM on Product Distributions

12/13/2022
by   Tao Lin, et al.
0

While many classical notions of learnability (e.g., PAC learnability) are distribution-free, utilizing the specific structures of an input distribution may improve learning performance. For example, a product distribution on a multi-dimensional input space has a much simpler structure than a correlated distribution. A recent paper [GHTZ21] shows that the sample complexity of a general learning problem on product distributions is polynomial in the input dimension, which is exponentially smaller than that on correlated distributions. However, the learning algorithm they use is not the standard Empirical Risk Minimization (ERM) algorithm. In this note, we characterize the sample complexity of ERM in a general learning problem on product distributions. We show that, even though product distributions are simpler than correlated distributions, ERM still needs an exponential number of samples to learn on product distributions, instead of a polynomial. This leads to the conclusion that a product distribution by itself does not make a learning problem easier – an algorithm designed specifically for product distributions is needed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2020

Downsampling for Testing and Learning in Product Distributions

We study the domain reduction problem of eliminating dependence on n fro...
research
11/27/2019

Generalizing Complex Hypotheses on Product Distributions: Auctions, Prophet Inequalities, and Pandora's Problem

This paper explores a theory of generalization for learning problems on ...
research
07/12/2021

Forster Decomposition and Learning Halfspaces with Noise

A Forster transform is an operation that turns a distribution into one w...
research
12/05/2019

On the Sample Complexity of Learning Sum-Product Networks

Sum-Product Networks (SPNs) can be regarded as a form of deep graphical ...
research
12/17/2018

Differentially Private Empirical Risk Minimization in Non-interactive Local Model via Polynomial of Inner Product Approximation

In this paper, we study the Empirical Risk Minimization problem in the n...
research
12/29/2020

Testing Product Distributions: A Closer Look

We study the problems of identity and closeness testing of n-dimensional...
research
05/23/2018

Tight Bounds for Collaborative PAC Learning via Multiplicative Weights

We study the collaborative PAC learning problem recently proposed in Blu...

Please sign up or login with your details

Forgot password? Click here to reset