Consistent Second-Order Conic Integer Programming for Learning Bayesian Networks

05/29/2020
by   Simge Küçükyavuz, et al.
7

Bayesian Networks (BNs) represent conditional probability relations among a set of random variables (nodes) in the form of a directed acyclic graph (DAG), and have found diverse applications in knowledge discovery. We study the problem of learning the sparse DAG structure of a BN from continuous observational data. The central problem can be modeled as a mixed-integer program with an objective function composed of a convex quadratic loss function and a regularization penalty subject to linear constraints. The optimal solution to this mathematical program is known to have desirable statistical properties under certain conditions. However, the state-of-the-art optimization solvers are not able to obtain provably optimal solutions to the existing mathematical formulations for medium-size problems within reasonable computational times. To address this difficulty, we tackle the problem from both computational and statistical perspectives. On the one hand, we propose a concrete early stopping criterion to terminate the branch-and-bound process in order to obtain a near-optimal solution to the mixed-integer program, and establish the consistency of this approximate solution. On the other hand, we improve the existing formulations by replacing the linear "big-M" constraints that represent the relationship between the continuous and binary indicator variables with second-order conic constraints. Our numerical results demonstrate the effectiveness of the proposed approaches.

READ FULL TEXT

page 18

page 19

page 20

research
04/23/2019

Integer Programming for Learning Directed Acyclic Graphs from Continuous Data

Learning directed acyclic graphs (DAGs) from data is a challenging task ...
research
04/29/2015

Incorporating Road Networks into Territory Design

Given a set of basic areas, the territory design problem asks to create ...
research
07/11/2022

On SOCP-based disjunctive cuts for solving a class of integer bilevel nonlinear programs

We study a class of integer bilevel programs with second-order cone cons...
research
08/04/2020

No Cross-Validation Required: An Analytical Framework for Regularized Mixed-Integer Problems (Extended Version)

This paper develops a method to obtain the optimal value for the regular...
research
07/19/2022

Multi-parametric Analysis for Mixed Integer Linear Programming: An Application to Transmission Planning and Congestion Control

Enhancing existing transmission lines is a useful tool to combat transmi...
research
07/12/2023

Outlier detection in regression: conic quadratic formulations

In many applications, when building linear regression models, it is impo...
research
01/11/2021

Mixed-Integer Approaches to Constrained Optimum Communication Spanning Tree Problem

Several novel mixed-integer linear and bilinear formulations are propose...

Please sign up or login with your details

Forgot password? Click here to reset