Extract Dynamic Information To Improve Time Series Modeling: a Case Study with Scientific Workflow

05/19/2022
by   Jeeyung Kim, et al.
0

In modeling time series data, we often need to augment the existing data records to increase the modeling accuracy. In this work, we describe a number of techniques to extract dynamic information about the current state of a large scientific workflow, which could be generalized to other types of applications. The specific task to be modeled is the time needed for transferring a file from an experimental facility to a data center. The key idea of our approach is to find recent past data transfer events that match the current event in some ways. Tests showed that we could identify recent events matching some recorded properties and reduce the prediction error by about 12 models with only static features. We additionally explored an application specific technique to extract information about the data production process, and was able to reduce the average prediction error by 44

READ FULL TEXT
research
01/31/2020

Two-Sample Testing for Event Impacts in Time Series

In many application domains, time series are monitored to detect extreme...
research
03/09/2020

Temporal Attribute Prediction via Joint Modeling of Multi-Relational Structure Evolution

Time series prediction is an important problem in machine learning. Prev...
research
02/01/2022

Semantic of Cloud Computing services for Time Series workflows

Time series (TS) are present in many fields of knowledge, research, and ...
research
05/20/2020

Modeling Physical/Digital Systems: Formal Event-B vs. Diagrammatic Thinging Machine

Models are centrally important in many scientific fields. A model is a r...
research
10/02/2017

KV-match: An Efficient Subsequence Matching Approach for Large Scale Time Series

Time series data have exploded due to the popularity of new applications...
research
07/19/2018

Indexing Execution Patterns in Workflow Provenance Graphs through Generalized Trie Structures

Over the last years, scientific workflows have become mature enough to b...
research
01/29/2019

A new tidy data structure to support exploration and modeling of temporal data

Mining temporal data for information is often inhibited by a multitude o...

Please sign up or login with your details

Forgot password? Click here to reset