Robust Factorization of Real-world Tensor Streams with Patterns, Missing Values, and Outliers

02/16/2021
by   Dongjin Lee, et al.
0

Consider multiple seasonal time series being collected in real-time, in the form of a tensor stream. Real-world tensor streams often include missing entries (e.g., due to network disconnection) and at the same time unexpected outliers (e.g., due to system errors). Given such a real-world tensor stream, how can we estimate missing entries and predict future evolution accurately in real-time? In this work, we answer this question by introducing SOFIA, a robust factorization method for real-world tensor streams. In a nutshell, SOFIA smoothly and tightly integrates tensor factorization, outlier removal, and temporal-pattern detection, which naturally reinforce each other. Moreover, SOFIA integrates them in linear time, in an online manner, despite the presence of missing entries. We experimentally show that SOFIA is (a) robust and accurate: yielding up to 76 error; (b) fast: up to 935X faster than the second-most accurate competitor; and (c) scalable: scaling linearly with the number of new entries per time step.

READ FULL TEXT
research
12/16/2020

Time-Aware Tensor Decomposition for Missing Entry Prediction

Given a time-evolving tensor with missing entries, how can we effectivel...
research
05/15/2023

Scalable and Robust Tensor Ring Decomposition for Large-scale Data

Tensor ring (TR) decomposition has recently received increased attention...
research
04/04/2019

VeST: Very Sparse Tucker Factorization of Large-Scale Tensors

Given a large tensor, how can we decompose it to sparse core tensor and ...
research
10/09/2014

Bayesian Robust Tensor Factorization for Incomplete Multiway Data

We propose a generative model for robust tensor factorization in the pre...
research
01/29/2019

A Robust Time Series Model with Outliers and Missing Entries

This paper studies the problem of robustly learning the correlation func...
research
08/29/2017

Fast, Accurate, and Scalable Method for Sparse Coupled Matrix-Tensor Factorization

How can we capture the hidden properties from a tensor and a matrix data...
research
05/10/2020

Non-recurrent Traffic Congestion Detection with a Coupled Scalable Bayesian Robust Tensor Factorization Model

Non-recurrent traffic congestion (NRTC) usually brings unexpected delays...

Please sign up or login with your details

Forgot password? Click here to reset