Limit theorems for dependent combinatorial data, with applications in statistical inference

08/27/2021
by   Somabha Mukherjee, et al.
0

The Ising model is a celebrated example of a Markov random field, introduced in statistical physics to model ferromagnetism. This is a discrete exponential family with binary outcomes, where the sufficient statistic involves a quadratic term designed to capture correlations arising from pairwise interactions. However, in many situations the dependencies in a network arise not just from pairs, but from peer-group effects. A convenient mathematical framework for capturing higher-order dependencies, is the p-tensor Ising model, where the sufficient statistic consists of a multilinear polynomial of degree p. This thesis develops a framework for statistical inference of the natural parameters in p-tensor Ising models. We begin with the Curie-Weiss Ising model, where we unearth various non-standard phenomena in the asymptotics of the maximum-likelihood (ML) estimates of the parameters, such as the presence of a critical curve in the interior of the parameter space on which these estimates have a limiting mixture distribution, and a surprising superefficiency phenomenon at the boundary point(s) of this curve. ML estimation fails in more general p-tensor Ising models due to the presence of a computationally intractable normalizing constant. To overcome this issue, we use the popular maximum pseudo-likelihood (MPL) method, which avoids computing the inexplicit normalizing constant based on conditional distributions. We derive general conditions under which the MPL estimate is √(N)-consistent, where N is the size of the underlying network. Finally, we consider a more general Ising model, which incorporates high-dimensional covariates at the nodes of the network, that can also be viewed as a logistic regression model with dependent observations. In this model, we show that the parameters can be estimated consistently under sparsity assumptions on the true covariate vector.

READ FULL TEXT
research
05/07/2020

Phase Transitions of the Maximum Likelihood Estimates in the Tensor Curie-Weiss Model

The p-tensor Curie-Weiss model is a two-parameter discrete exponential f...
research
10/07/2021

High Dimensional Logistic Regression Under Network Dependence

Logistic regression is one of the most fundamental methods for modeling ...
research
03/18/2020

Logistic-Regression with peer-group effects via inference in higher order Ising models

Spin glass models, such as the Sherrington-Kirkpatrick, Hopfield and Isi...
research
08/29/2020

Estimation in Tensor Ising Models

The p-tensor Ising model is a one-parameter discrete exponential family ...
research
07/03/2023

Limit Theorems and Phase Transitions in the Tensor Curie-Weiss Potts Model

In this paper, we derive results about the limiting distribution of the ...
research
09/24/2021

Bahadur Efficiency in Tensor Curie-Weiss Models

The tensor Ising model is a discrete exponential family used for modelin...
research
04/02/2023

Tensor Recovery in High-Dimensional Ising Models

The k-tensor Ising model is an exponential family on a p-dimensional bin...

Please sign up or login with your details

Forgot password? Click here to reset