Tree Learning: Optimal Algorithms and Sample Complexity

02/09/2023
by   Dmitrii Avdiukhin, et al.
0

We study the problem of learning a hierarchical tree representation of data from labeled samples, taken from an arbitrary (and possibly adversarial) distribution. Consider a collection of data tuples labeled according to their hierarchical structure. The smallest number of such tuples required in order to be able to accurately label subsequent tuples is of interest for data collection in machine learning. We present optimal sample complexity bounds for this problem in several learning settings, including (agnostic) PAC learning and online learning. Our results are based on tight bounds of the Natarajan and Littlestone dimensions of the associated problem. The corresponding tree classifiers can be constructed efficiently in near-linear time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2021

Robust learning under clean-label attack

We study the problem of robust learning under clean-label data-poisoning...
research
12/05/2022

Bagging is an Optimal PAC Learner

Determining the optimal sample complexity of PAC learning in the realiza...
research
05/22/2018

Improved Algorithms for Collaborative PAC Learning

We study a recent model of collaborative PAC learning where k players wi...
research
12/07/2020

VC Dimension and Distribution-Free Sample-Based Testing

We consider the problem of determining which classes of functions can be...
research
10/27/2014

Maximally Informative Hierarchical Representations of High-Dimensional Data

We consider a set of probabilistic functions of some input variables as ...
research
02/03/2021

Adversarially Robust Learning with Unknown Perturbation Sets

We study the problem of learning predictors that are robust to adversari...
research
06/07/2021

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

We consider the problem of learning a tree-structured Ising model from d...

Please sign up or login with your details

Forgot password? Click here to reset