AMIC: An Adaptive Information Theoretic Method to Identify Multi-Scale Temporal Correlations in Big Time Series Data -- Accepted Version

06/24/2019
by   Nguyen Ho, et al.
0

Recent development in computing, sensing and crowd-sourced data have resulted in an explosion in the availability of quantitative information. The possibilities of analyzing this so-called Big Data to inform research and the decision-making process are virtually endless. In general, analyses have to be done across multiple data sets in order to bring out the most value of Big Data. A first important step is to identify temporal correlations between data sets. Given the characteristics of Big Data in terms of volume and velocity, techniques that identify correlations not only need to be fast and scalable, but also need to help users in ordering the correlations across temporal scales so that they can focus on important relationships. In this paper, we present AMIC (Adaptive Mutual Information-based Correlation), a method based on mutual information to identify correlations at multiple temporal scales in large time series. Discovered correlations are suggested to users in an order based on the strength of the relationships. Our method supports an adaptive streaming technique that minimizes duplicated computation and is implemented on top of Apache Spark for scalability. We also provide a comprehensive evaluation on the effectiveness and the scalability of AMIC using both synthetic and real-world data sets.

READ FULL TEXT
research
06/24/2019

AMIC: An Adaptive Information Theoretic Method to Identify Multi-Scale Temporal Correlations in Big Time Series Data

Recent development in computing, sensing and crowd-sourced data have res...
research
04/19/2022

A Unified Approach for Multi-Scale Synchronous Correlation Search in Big Time Series – Full Version

The wide deployment of IoT sensors has enabled the collection of very bi...
research
12/24/2021

Toeplitz Least Squares Problems, Fast Algorithms and Big Data

In time series analysis, when fitting an autoregressive model, one must ...
research
05/30/2019

Efficient Covariance Estimation from Temporal Data

Estimating the covariance structure of multivariate time series is a fun...
research
06/09/2021

Sirius: A Mutual Information Tool for Exploratory Visualization of Mixed Data

Data scientists across disciplines are increasingly in need of explorato...
research
07/04/2016

Neighborhood Features Help Detecting Non-Technical Losses in Big Data Sets

Electricity theft is a major problem around the world in both developed ...
research
11/12/2021

GraSSNet: Graph Soft Sensing Neural Networks

In the era of big data, data-driven based classification has become an e...

Please sign up or login with your details

Forgot password? Click here to reset