Non-parametric regression for networks

09/30/2020
by   Katie E. Severn, et al.
0

Network data are becoming increasingly available, and so there is a need to develop suitable methodology for statistical analysis. Networks can be represented as graph Laplacian matrices, which are a type of manifold-valued data. Our main objective is to estimate a regression curve from a sample of graph Laplacian matrices conditional on a set of Euclidean covariates, for example in dynamic networks where the covariate is time. We develop an adapted Nadaraya-Watson estimator which has uniform weak consistency for estimation using Euclidean and power Euclidean metrics. We apply the methodology to the Enron email corpus to model smooth trends in monthly networks and highlight anomalous networks. Another motivating application is given in corpus linguistics, which explores trends in an author's writing style over time based on word co-occurrence networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2019

Manifold valued data analysis of samples of networks, with applications in corpus linguistics

Networks can be used in many applications, such as in the analysis of te...
research
07/16/2021

Non-Parametric Manifold Learning

We introduce an estimator for manifold distances based on graph Laplacia...
research
09/07/2021

Dynamic Network Regression

Network data are increasingly available in various research fields, moti...
research
09/06/2022

Rates of Convergence for Regression with the Graph Poly-Laplacian

In the (special) smoothing spline problem one considers a variational pr...
research
01/29/2017

Random Forest regression for manifold-valued responses

An increasing array of biomedical and computer vision applications requi...
research
04/21/2019

Total Variation Regularized Fréchet Regression for Metric-Space Valued Data

Non-Euclidean data that are indexed with a scalar predictor such as time...

Please sign up or login with your details

Forgot password? Click here to reset