A Sparse β-Model with Covariates for Networks

10/26/2020
by   Stefan Stein, et al.
0

Data in the form of networks are increasingly encountered in modern science and humanity. This paper concerns a new generative model, suitable for sparse networks commonly observed in practice, to capture degree heterogeneity and homophily, two stylized features of a typical network. The former is achieved by differentially assigning parameters to individual nodes, while the latter is materialized by incorporating covariates. Similar models in the literature for heterogeneity often include as many nodal parameters as the number of nodes, leading to over-parametrization and, as a result, strong requirements on the density of the network. For parameter estimation, we propose the use of the penalized likelihood method with an ℓ_1 penalty on the nodal parameters, giving rise to a convex optimization formulation which immediately connects our estimation procedure to the LASSO literature. We highlight the differences of our approach to the LASSO method for logistic regression, emphasizing the feasibility of our model to conduct inference for sparse networks, study the finite-sample error bounds on the excess risk and the ℓ_1-error of the resulting estimator, and develop a central limit theorem for the parameter associated with the covariates. Simulation and data analysis corroborate the developed theory. As a by-product of our main theory, we study what we call the Erdős-Rényi model with covariates and develop the associated statistical inference for sparse networks, which can be of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2021

A Sparse Random Graph Model for Sparse Directed Networks

An increasingly urgent task in analysis of networks is to develop statis...
research
08/08/2019

Analysis of Networks via the Sparse β-Model

Data in the form of networks are increasingly available in a variety of ...
research
08/22/2021

Convex Latent Effect Logit Model via Sparse and Low-rank Decomposition

In this paper, we propose a convex formulation for learning logistic reg...
research
11/26/2020

Generative Learning of Heterogeneous Tail Dependence

We propose a multivariate generative model to capture the complex depend...
research
06/07/2021

A sparse p_0 model with covariates for directed networks

We are concerned here with unrestricted maximum likelihood estimation in...
research
12/20/2022

Simultaneous Factors Selection and Fusion of Their Levels in Penalized Logistic Regression

Nowadays, several data analysis problems require for complexity reductio...
research
02/20/2023

Conformal Prediction for Network-Assisted Regression

An important problem in network analysis is predicting a node attribute ...

Please sign up or login with your details

Forgot password? Click here to reset