Inference for Dependent Data with Learned Clusters

07/30/2021
by   Jianfei Cao, et al.
0

This paper presents and analyzes an approach to cluster-based inference for dependent data. The primary setting considered here is with spatially indexed data in which the dependence structure of observed random variables is characterized by a known, observed dissimilarity measure over spatial indices. Observations are partitioned into clusters with the use of an unsupervised clustering algorithm applied to the dissimilarity measure. Once the partition into clusters is learned, a cluster-based inference procedure is applied to a statistical hypothesis testing procedure. The procedure proposed in the paper allows the number of clusters to depend on the data, which gives researchers a principled method for choosing an appropriate clustering level. The paper gives conditions under which the proposed procedure asymptotically attains correct size. A simulation study shows that the proposed procedure attains near nominal size in finite samples in a variety of statistical testing problems with dependent data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2015

Minimax Optimal Variable Clustering in G-Block Correlation Models via Cord

The goal of variable clustering is to partition a random vector X∈ R^p ...
research
10/24/2022

Post-clustering difference testing: valid inference and practical considerations

Clustering is part of unsupervised analysis methods that consist in grou...
research
03/02/2021

Network Cluster-Robust Inference

Since network data commonly consists of observations on a single large n...
research
01/30/2023

Selective inference for clustering with unknown variance

In many modern statistical problems, the limited available data must be ...
research
09/30/2021

A flexible and robust non-parametric test of exchangeability

Many statistical analyses assume that the data points within a sample ar...
research
04/10/2010

New Clustering Algorithm for Vector Quantization using Rotation of Error Vector

The paper presents new clustering algorithm. The proposed algorithm give...
research
09/20/2019

Consensual aggregation of clusters based on Bregman divergences to improve predictive models

A new procedure to construct predictive models in supervised learning pr...

Please sign up or login with your details

Forgot password? Click here to reset