Goodness-of-fit Test for Latent Block Models

06/10/2019
by   Chihiro Watanabe, et al.
2

Latent Block Models are used for probabilistic biclustering, which is shown to be an effective method for analyzing various relational data sets. However, there has been no statistical test method for determining the row and column cluster numbers of Latent Block Models. Recent studies have constructed statistical-test-based methods for Stochastic Block Models, in which we assume that the observed matrix is a square symmetric matrix and that the cluster assignments are the same for rows and columns. In this paper, we develop a goodness-of-fit test for Latent Block Models, which tests whether an observed data matrix fits a given set of row and column cluster numbers, or it consists of more clusters in at least one direction of row and column. To construct the test method, we use a result from random matrix theory for a sample covariance matrix. We show experimentally the effectiveness of our proposed method, by showing the asymptotic behavior of the test statistic and the test accuracy.

READ FULL TEXT

page 16

page 18

research
05/27/2020

Selective Inference for Latent Block Models

Model selection in latent block models has been a challenging but import...
research
02/23/2021

Goodness-of-fit Test on the Number of Biclusters in Relational Data Matrix

Biclustering is a method for detecting homogeneous submatrices in a give...
research
02/08/2020

Conjoined Dirichlet Process

Biclustering is a class of techniques that simultaneously clusters the r...
research
06/16/2022

Variational Estimators of the Degree-corrected Latent Block Model for Bipartite Networks

Biclustering on bipartite graphs is an unsupervised learning task that s...
research
01/31/2018

Coupling geometry on binary bipartite networks: hypotheses testing on pattern geometry and nestedness

Upon a matrix representation of a binary bipartite network, via the perm...
research
03/26/2021

Deep Two-Way Matrix Reordering for Relational Data Analysis

Matrix reordering is a task to permute the rows and columns of a given o...
research
12/05/2022

Matrix-valued Network Autoregression Model with Latent Group Structure

Matrix-valued time series data are frequently observed in a broad range ...

Please sign up or login with your details

Forgot password? Click here to reset