Cloud K-SVD: A Collaborative Dictionary Learning Algorithm for Big, Distributed Data

12/25/2014
by   Haroon Raja, et al.
0

This paper studies the problem of data-adaptive representations for big, distributed data. It is assumed that a number of geographically-distributed, interconnected sites have massive local data and they are interested in collaboratively learning a low-dimensional geometric structure underlying these data. In contrast to previous works on subspace-based data representations, this paper focuses on the geometric structure of a union of subspaces (UoS). In this regard, it proposes a distributed algorithm---termed cloud K-SVD---for collaborative learning of a UoS structure underlying distributed data of interest. The goal of cloud K-SVD is to learn a common overcomplete dictionary at each individual site such that every sample in the distributed data can be represented through a small number of atoms of the learned dictionary. Cloud K-SVD accomplishes this goal without requiring exchange of individual samples between sites. This makes it suitable for applications where sharing of raw data is discouraged due to either privacy concerns or large volumes of data. This paper also provides an analysis of cloud K-SVD that gives insights into its properties as well as deviations of the dictionaries learned at individual sites from a centralized solution in terms of different measures of local/global data and topology of interconnections. Finally, the paper numerically illustrates the efficacy of cloud K-SVD on real and synthetic distributed data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2023

Cloud K-SVD for Image Denoising

Cloud K-SVD is a dictionary learning algorithm that can train at multipl...
research
04/29/2021

Locality Constrained Analysis Dictionary Learning via K-SVD Algorithm

Recent years, analysis dictionary learning (ADL) and its applications fo...
research
08/26/2014

ℓ_1-K-SVD: A Robust Dictionary Learning Algorithm With Simultaneous Update

We develop a dictionary learning algorithm by minimizing the ℓ_1 distort...
research
11/07/2022

Decentralized Complete Dictionary Learning via ℓ^4-Norm Maximization

With the rapid development of information technologies, centralized data...
research
05/25/2011

Multiscale Geometric Methods for Data Sets II: Geometric Multi-Resolution Analysis

Data sets are often modeled as point clouds in R^D, for D large. It is o...
research
01/11/2022

Dictionary Learning with Uniform Sparse Representations for Anomaly Detection

Many applications like audio and image processing show that sparse repre...

Please sign up or login with your details

Forgot password? Click here to reset