Structure Learning in Inverse Ising Problems Using ℓ_2-Regularized Linear Estimator

by   Xiangming Meng, et al.

Inferring interaction parameters from observed data is a ubiquitous requirement in various fields of science and engineering. Recent studies have shown that the pseudolikelihood (PL) method is highly effective in meeting this requirement even though the maximum likelihood method is computationally intractable when used directly. To the best of our knowledge, most existing studies assume that the postulated model used in the inference stage covers the true model that generates the data. However, such an assumption does not necessarily hold in practical situations. From this perspective, we discuss the utility of the PL method in model mismatch cases. Specifically, we examine the inference performance of the PL method when ℓ_2-regularized (ridge) linear regression is applied to data generated from sparse Boltzmann machines of Ising spins using methods of statistical mechanics. Our analysis indicates that despite the model mismatch, one can perfectly identify the network topology using naive linear regression without regularization when the dataset size M is greater than the number of Ising spins, N. Further, even when M < N, perfect identification is possible using a two-stage estimator with much better quantitative performance compared to naive usage of the PL method. Results of extensive numerical experiments support our findings.



There are no comments yet.


page 1

page 2

page 3

page 4


Ising Model Selection Using ℓ_1-Regularized Linear Regression

We theoretically investigate the performance of ℓ_1-regularized linear r...

Analysis of overfitting in the regularized Cox model

The Cox proportional hazards model is ubiquitous in the analysis of time...

Takeuchi's Information Criteria as a form of Regularization

Takeuchi's Information Criteria (TIC) is a linearization of maximum like...

The Effects of Adaptation on Inference for Non-Linear Regression Models with Normal Errors

In this work, we assume that a response variable is explained by several...

Inject Machine Learning into Significance Test for Misspecified Linear Models

Due to its strong interpretability, linear regression is widely used in ...

Linear regression and its inference on noisy network-linked data

Linear regression on a set of observations linked by a network has been ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.