Incrementally Updated Spectral Embeddings

09/03/2019
by   Vasileios Charisopoulos, et al.
0

Several fundamental tasks in data science rely on computing an extremal eigenspace of size r ≪ n, where n is the underlying problem dimension. For example, spectral clustering and PCA both require the computation of the leading r-dimensional subspace. Often, this process is repeated over time due to the possible temporal nature of the data; e.g., graphs representing relations in a social network may change over time, and feature vectors may be added, removed or updated in a dataset. Therefore, it is important to efficiently carry out the computations involved to keep up with frequent changes in the underlying data and also to dynamically determine a reasonable size for the subspace of interest. We present a complete computational pipeline for efficiently updating spectral embeddings in a variety of contexts. Our basic approach is to "seed" iterative methods for eigenproblems with the most recent subspace estimate to significantly reduce the computations involved, in contrast with a naïve approach which recomputes the subspace of interest from scratch at every step. In this setting, we provide various bounds on the number of iterations common eigensolvers need to perform in order to update the extremal eigenspace to a sufficient tolerance. We also incorporate a criterion for determining the size of the subspace based on successive eigenvalue ratios. We demonstrate the merits of our approach on the tasks of spectral clustering of temporally evolving graphs and PCA of an incrementally updated data matrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

Higher-Order Spectral Clustering for Geometric Graphs

The present paper is devoted to clustering geometric graphs. While the s...
research
03/15/2018

Fast Subspace Clustering Based on the Kronecker Product

Subspace clustering is a useful technique for many computer vision appli...
research
04/27/2022

Nonbacktracking spectral clustering of nonuniform hypergraphs

Spectral methods offer a tractable, global framework for clustering in g...
research
10/28/2015

Fast Landmark Subspace Clustering

Kernel methods obtain superb performance in terms of accuracy for variou...
research
03/12/2020

Bringing in the outliers: A sparse subspace clustering approach to learn a dictionary of mouse ultrasonic vocalizations

Mice vocalize in the ultrasonic range during social interactions. These ...
research
03/05/2019

A Novel Efficient Approach with Data-Adaptive Capability for OMP-based Sparse Subspace Clustering

Orthogonal Matching Pursuit (OMP) plays an important role in data scienc...
research
09/09/2018

Clustering of graph vertex subset via Krylov subspace model reduction

Clustering via graph-Laplacian spectral imbedding is ubiquitous in data ...

Please sign up or login with your details

Forgot password? Click here to reset