MIxBN: library for learning Bayesian networks from mixed data

06/24/2021
by   Anna V. Bubnova, et al.
0

This paper describes a new library for learning Bayesian networks from data containing discrete and continuous variables (mixed data). In addition to the classical learning methods on discretized data, this library proposes its algorithm that allows structural learning and parameters learning from mixed data without discretization since data discretization leads to information loss. This algorithm based on mixed MI score function for structural learning, and also linear regression and Gaussian distribution approximation for parameters learning. The library also offers two algorithms for enumerating graph structures - the greedy Hill-Climbing algorithm and the evolutionary algorithm. Thus the key capabilities of the proposed library are as follows: (1) structural and parameters learning of a Bayesian network on discretized data, (2) structural and parameters learning of a Bayesian network on mixed data using the MI mixed score function and Gaussian approximation, (3) launching learning algorithms on one of two algorithms for enumerating graph structures - Hill-Climbing and the evolutionary algorithm. Since the need for mixed data representation comes from practical necessity, the advantages of our implementations are evaluated in the context of solving approximation and gap recovery problems on synthetic data and real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2013

A Multivariate Discretization Method for Learning Bayesian Networks from Mixed Data

In this paper we address the problem of discretization in the context of...
research
08/29/2022

Approach of variable clustering and compression for learning large Bayesian networks

This paper describes a new approach for learning structures of large Bay...
research
01/23/2013

Bayesian Control for Concentrating Mixed Nuclear Waste

A control algorithm for batch processing of mixed waste is proposed base...
research
03/02/2021

Oil and Gas Reservoirs Parameters Analysis Using Mixed Learning of Bayesian Networks

In this paper, a multipurpose Bayesian-based method for data analysis, c...
research
01/22/2019

Solving All Regression Models For Learning Gaussian Networks Using Givens Rotations

Score based learning (SBL) is a promising approach for learning Bayesian...
research
09/18/2023

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees

Tabular data is hard to acquire and is subject to missing values. This p...

Please sign up or login with your details

Forgot password? Click here to reset