Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm

01/23/2013
by   Nir Friedman, et al.
0

Learning Bayesian networks is often cast as an optimization problem, where the computational task is to find a structure that maximizes a statistically motivated score. By and large, existing learning tools address this optimization problem using standard heuristic search techniques. Since the search space is extremely large, such search procedures can spend most of the time examining candidates that are extremely unreasonable. This problem becomes critical when we deal with data sets that are large either in the number of instances, or the number of attributes. In this paper, we introduce an algorithm that achieves faster learning by restricting the search space. This iterative algorithm restricts the parents of each variable to belong to a small subset of candidates. We then search for a network that satisfies these constraints. The learned network is then used for selecting better candidates for the next iteration. We evaluate this algorithm both on synthetic and real-life data. Our results show that it is significantly faster than alternative search procedures without loss of quality in the learned structures.

READ FULL TEXT

page 1

page 2

page 3

page 5

page 6

page 8

page 9

page 10

research
07/04/2012

Ordering-Based Search: A Simple and Effective Algorithm for Learning Bayesian Networks

One of the basic tasks for Bayesian networks (BNs) is that of learning a...
research
08/08/2020

Learning (Re-)Starting Solutions for Vehicle Routing Problems

A key challenge in solving a combinatorial optimization problem is how t...
research
07/19/2017

Entropy-based Pruning for Learning Bayesian Networks using BIC

For decomposable score-based structure learning of Bayesian networks, ex...
research
06/27/2012

Smoothness and Structure Learning by Proxy

As data sets grow in size, the ability of learning methods to find struc...
research
08/27/2020

Learning All Credible Bayesian Network Structures for Model Averaging

A Bayesian network is a widely used probabilistic graphical model with a...
research
11/18/2021

Interactive Set Discovery

We study the problem of set discovery where given a few example tuples o...
research
03/14/2016

Rapid building detection using machine learning

This work describes algorithms for performing discrete object detection,...

Please sign up or login with your details

Forgot password? Click here to reset