Graph model selection by edge probability sequential inference

06/25/2021
by   Louis Duvivier, et al.
0

Graphs are widely used for describing systems made up of many interacting components and for understanding the structure of their interactions. Various statistical models exist, which describe this structure as the result of a combination of constraints and randomness. automatically identify the best model, and the best set of parameters for a given graph. To do so, most authors rely on the minimum description length paradigm, and apply it to graphs by considering the entropy of probability distributions defined on graph ensembles. In this paper, we introduce edge probability sequential inference, a new approach to perform model selection, which relies on probability distributions on edge ensembles. From a theoretical point of view, we show that this methodology provides a more consistent ground for statistical inference with respect to existing techniques, due to the fact that it relies on multiple realizations of the random variable. It also provides better guarantees against overfitting, by making it possible to lower the number of parameters of the model below the number of observations. Experimentally, we illustrate the benefits of this methodology in two situations: to infer the partition of a stochastic blockmodel, and to identify the most relevant model for a given graph between the stochastic blockmodel and the configuration model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2019

Minimum entropy stochastic block models neglect edge distribution heterogeneity

The statistical inference of stochastic block models as emerged as a mat...
research
10/07/2020

Stochastic parameterization with VARX processes

In this study we investigate a data-driven stochastic methodology to par...
research
08/29/2020

Modeling of Daily Precipitation Amounts Using the Mixed Gamma Weibull Distribution

By recognizing that the main difficulty of the modeling of daily precipi...
research
10/02/2021

Graph Compression with Application to Model Selection

Many multivariate data such as social and biological data exhibit comple...
research
08/02/2020

Statistical Inference of Minimally Complex Models

Finding the best model that describes a high dimensional dataset, is a d...
research
05/10/2023

Occam Factor for Random Graphs: Erdös-Rényi, Independent Edge, and a Uniparametric Stochastic Blockmodel

We investigate the evidence/flexibility (i.e., "Occam") paradigm and dem...
research
04/29/2021

A stochastic framework for atomistic fracture

We present a stochastic modeling framework for atomistic propagation of ...

Please sign up or login with your details

Forgot password? Click here to reset