Sketches for Time-Dependent Machine Learning

08/26/2021
by   Jesus Antonanzas, et al.
0

Time series data can be subject to changes in the underlying process that generates them and, because of these changes, models built on old samples can become obsolete or perform poorly. In this work, we present a way to incorporate information about the current data distribution and its evolution across time into machine learning algorithms. Our solution is based on efficiently maintaining statistics, particularly the mean and the variance, of data features at different time resolutions. These data summarisations can be performed over the input attributes, in which case they can then be fed into the model as additional input features, or over latent representations learned by models, such as those of Recurrent Neural Networks. In classification tasks, the proposed techniques can significantly outperform the prediction capabilities of equivalent architectures with no feature / latent summarisations. Furthermore, these modifications do not introduce notable computational and memory overhead when properly adjusted.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2019

Local Score Dependent Model Explanation for Time Dependent Covariates

The use of deep neural networks to make high risk decisions creates a ne...
research
06/08/2022

Classification of Stochastic Processes with Topological Data Analysis

In this study, we examine if engineered topological features can disting...
research
09/06/2018

From FATS to feets: Further improvements to an astronomical feature extraction tool based on machine learning

Machine learning algorithms are highly useful for the classification of ...
research
04/18/2019

Explaining Deep Classification of Time-Series Data with Learned Prototypes

The emergence of deep learning networks raises a need for algorithms to ...
research
01/07/2022

Churn prediction in online gambling

In business retention, churn prevention has always been a major concern....
research
09/30/2021

Two ways towards combining Sequential Neural Network and Statistical Methods to Improve the Prediction of Time Series

Statistic modeling and data-driven learning are the two vital fields tha...
research
05/11/2020

Deep Latent Variable Model for Longitudinal Group Factor Analysis

In many scientific problems such as video surveillance, modern genomic a...

Please sign up or login with your details

Forgot password? Click here to reset