Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

05/18/2018
by   Benjamin Guedj, et al.
0

When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls. Principal curves act as a nonlinear generalization of PCA and the present paper proposes a novel algorithm to automatically and sequentially learn principal curves from data streams. We show that our procedure is supported by regret bounds with optimal sublinear remainder terms. A greedy local search implementation that incorporates both sleeping experts and multi-armed bandit ingredients is presented, along with its regret bound and performance on a toy example and seismic data.

READ FULL TEXT
research
01/04/2021

Be Greedy in Multi-Armed Bandits

The Greedy algorithm is the simplest heuristic in sequential decision pr...
research
03/11/2018

Incentives in the Dark: Multi-armed Bandits for Evolving Users with Unknown Type

Design of incentives or recommendations to users is becoming more common...
research
03/05/2020

Spherical Principal Curves

This paper presents a new approach for dimension reduction of data obser...
research
11/15/2019

Estimation via length-constrained generalized empirical principal curves under small noise

In this paper, we propose a method to build a sequence of generalized em...
research
03/31/2021

Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA

The (open-high-low-close) OHLC data is the most common data form in the ...
research
06/02/2016

Sequential Principal Curves Analysis

This work includes all the technical details of the Sequential Principal...
research
05/21/2003

Neural network modeling of data with gaps: method of principal curves, Carleman's formula, and other

A method of modeling data with gaps by a sequence of curves has been dev...

Please sign up or login with your details

Forgot password? Click here to reset