Provable Convex Co-clustering of Tensors

03/17/2018
by   Eric C. Chi, et al.
0

Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and is both computationally and storage efficient. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising "blessing of dimensionality" phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness.

READ FULL TEXT
research
08/24/2017

Dynamic Tensor Clustering

Dynamic tensor data are becoming prevalent in numerous applications. Exi...
research
03/31/2019

Sparse Tensor Additive Regression

Tensors are becoming prevalent in modern applications such as medical im...
research
09/15/2016

STORE: Sparse Tensor Response Regression and Neuroimaging Analysis

Motivated by applications in neuroimaging analysis, we propose a new reg...
research
12/18/2020

A Doubly-Enhanced EM Algorithm for Model-Based Tensor Clustering

Modern scientific studies often collect data sets in the forms of tensor...
research
07/09/2022

Error Analysis of Tensor-Train Cross Approximation

Tensor train decomposition is widely used in machine learning and quantu...
research
05/20/2022

Multidimensional heterogeneity learning for count value tensor data with applications to field goal attempt analysis of NBA players

We propose a multidimensional tensor clustering approach for studying ho...

Please sign up or login with your details

Forgot password? Click here to reset