Learning the Finer Things: Bayesian Structure Learning at the Instantiation Level

03/08/2023
by   Chase Yakaboski, et al.
0

Successful machine learning methods require a trade-off between memorization and generalization. Too much memorization and the model cannot generalize to unobserved examples. Too much over-generalization and we risk under-fitting the data. While we commonly measure their performance through cross validation and accuracy metrics, how should these algorithms cope in domains that are extremely under-determined where accuracy is always unsatisfactory? We present a novel probabilistic graphical model structure learning approach that can learn, generalize and explain in these elusive domains by operating at the random variable instantiation level. Using Minimum Description Length (MDL) analysis, we propose a new decomposition of the learning problem over all training exemplars, fusing together minimal entropy inferences to construct a final knowledge base. By leveraging Bayesian Knowledge Bases (BKBs), a framework that operates at the instantiation level and inherently subsumes Bayesian Networks (BNs), we develop both a theoretical MDL score and associated structure learning algorithm that demonstrates significant improvements over learned BNs on 40 benchmark datasets. Further, our algorithm incorporates recent off-the-shelf DAG learning techniques enabling tractable results even on large problems. We then demonstrate the utility of our approach in a significantly under-determined domain by learning gene regulatory networks on breast cancer gene mutational data available from The Cancer Genome Atlas (TCGA).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Biological Random Walks: integrating heterogeneous data in disease gene prioritization

This work proposes a unified framework to leverage biological informatio...
research
08/27/2016

Learning Bayesian Networks with Incomplete Data by Augmentation

We present new algorithms for learning Bayesian networks from data with ...
research
10/10/2017

An Extension of Deep Pathway Analysis: A Pathway Route Analysis Framework Incorporating Multi-dimensional Cancer Genomics Data

Recent breakthroughs in cancer research have come via the up-and-coming ...
research
03/20/2013

Bayesian Networks Aplied to Therapy Monitoring

We propose a general Bayesian network model for application in a wide cl...
research
04/04/2023

Bayesian Meta-Analysis of Penetrance for Cancer Risk

Multi-gene panel testing allows many cancer susceptibility genes to be t...
research
05/23/2016

Genetic Architect: Discovering Genomic Structure with Learned Neural Architectures

Each human genome is a 3 billion base pair set of encoding instructions....

Please sign up or login with your details

Forgot password? Click here to reset