On the Sample Complexity of Learning Bayesian Networks

02/13/2013
by   Nir Friedman, et al.
0

In recent years there has been an increasing interest in learning Bayesian networks from data. One of the most effective methods for learning such networks is based on the minimum description length (MDL) principle. Previous work has shown that this learning procedure is asymptotically successful: with probability one, it will converge to the target distribution, given a sufficient number of samples. However, the rate of this convergence has been hitherto unknown. In this work we examine the sample complexity of MDL based learning procedures for Bayesian networks. We show that the number of samples needed to learn an epsilon-close approximation (in terms of entropy distance) with confidence delta is O((1/epsilon)^(4/3)log(1/epsilon)log(1/delta)loglog (1/delta)). This means that the sample complexity is a low-order polynomial in the error threshold and sub-linear in the confidence bound. We also discuss how the constants in this term depend on the complexity of the target distribution. Finally, we address questions of asymptotic minimality and propose a method for using the sample complexity results to speed up the learning process.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
06/27/2012

On the Number of Samples Needed to Learn the Correct Structure of a Bayesian Network

Bayesian Networks (BNs) are useful tools giving a natural and compact re...
research
03/12/2018

Learning Binary Bayesian Networks in Polynomial Time and Sample Complexity

We consider the problem of structure learning for binary Bayesian networ...
research
03/25/2021

Active Structure Learning of Bayesian Networks in an Observational Setting

We study active structure learning of Bayesian networks in an observatio...
research
03/15/2023

On the Benefits of Leveraging Structural Information in Planning Over the Learned Model

Model-based Reinforcement Learning (RL) integrates learning and planning...
research
05/12/2015

Incorporating Type II Error Probabilities from Independence Tests into Score-Based Learning of Bayesian Network Structure

We give a new consistent scoring function for structure learning of Baye...
research
07/13/2017

A Brief Study of In-Domain Transfer and Learning from Fewer Samples using A Few Simple Priors

Domain knowledge can often be encoded in the structure of a network, suc...
research
07/26/2022

The Sample Complexity of Forecast Aggregation

We consider a Bayesian forecast aggregation model where n experts, after...

Please sign up or login with your details

Forgot password? Click here to reset