Data Generating Process to Evaluate Causal Discovery Techniques for Time Series Data

04/16/2021
by   Andrew R. Lawrence, et al.
0

Going beyond correlations, the understanding and identification of causal relationships in observational time series, an important subfield of Causal Discovery, poses a major challenge. The lack of access to a well-defined ground truth for real-world data creates the need to rely on synthetic data for the evaluation of these methods. Existing benchmarks are limited in their scope, as they either are restricted to a "static" selection of data sets, or do not allow for a granular assessment of the methods' performance when commonly made assumptions are violated. We propose a flexible and simple to use framework for generating time series data, which is aimed at developing, evaluating, and benchmarking time series causal discovery methods. In particular, the framework can be used to fine tune novel methods on vast amounts of data, without "overfitting" them to a benchmark, but rather so they perform well in real-world use cases. Using our framework, we evaluate prominent time series causal discovery methods and demonstrate a notable degradation in performance when their assumptions are invalidated and their sensitivity to choice of hyperparameters. Finally, we propose future research directions and how our framework can support both researchers and practitioners.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

Neural Additive Vector Autoregression Models for Causal Discovery in Time Series Data

Causal structure discovery in complex dynamical systems is an important ...
research
01/31/2023

Evaluating Temporal Observation-Based Causal Discovery Techniques Applied to Road Driver Behaviour

Autonomous robots are required to reason about the behaviour of dynamic ...
research
06/23/2021

Beyond Predictions in Neural ODEs: Identification and Interventions

Spurred by tremendous success in pattern matching and prediction tasks, ...
research
01/29/2023

Statistical evaluation of a long-memory process using the generalized entropic Value-at-Risk

The modeling and identification of time series data with a long memory a...
research
02/20/2023

Enhancing Causal Discovery from Robot Sensor Data in Dynamic Scenarios

Identifying the main features and learning the causal relationships of a...
research
10/29/2021

A Demonstration of Benchmarking Time Series Management Systems in the Cloud

Time Series Management Systems (TSMS) are Database Management Systems th...
research
01/29/2015

Particle swarm optimization for time series motif discovery

Efficiently finding similar segments or motifs in time series data is a ...

Please sign up or login with your details

Forgot password? Click here to reset