cleanTS: Automated (AutoML) Tool to Clean Univariate Time Series at Microscales

10/22/2021
by   Mayur Kishor Shende, et al.
0

Data cleaning is one of the most important tasks in data analysis processes. One of the perennial challenges in data analytics is the detection and handling of non-valid data. Failing to do so can result in inaccurate analytics and unreliable decisions. The process of properly cleaning such data takes much time. Errors are prevalent in time series data. It is usually found that real world data is unclean and requires some pre-processing. The analysis of large amounts of data is difficult. This paper is intended to provide an easy to use and reliable system which automates the cleaning process of univariate time series data. Automating the process greatly reduces the time required. Visualizing a large amount of data at once is not very effective. To tackle this issue, an R package cleanTS is proposed. The proposed system provides a way to analyze data on different scales and resolutions. Also, it provides users with tools and a benchmark system for comparing various techniques used in data cleaning.

READ FULL TEXT

page 5

page 9

page 27

page 29

page 34

research
12/07/2018

Time Series Featurization via Topological Data Analysis

We develop a novel algorithm for feature extraction in time series data ...
research
08/05/2021

Local Exceptionality Detection in Time Series Using Subgroup Discovery

In this paper, we present a novel approach for local exceptionality dete...
research
11/11/2022

Data Quality Over Quantity: Pitfalls and Guidelines for Process Analytics

A significant portion of the effort involved in advanced process control...
research
08/14/2017

Computational Topology Techniques for Characterizing Time-Series Data

Topological data analysis (TDA), while abstract, allows a characterizati...
research
09/09/2020

tsBNgen: A Python Library to Generate Time Series Data from an Arbitrary Dynamic Bayesian Network Structure

Synthetic data is widely used in various domains. This is because many m...
research
03/16/2021

Deep Time Series Models for Scarce Data

Time series data have grown at an explosive rate in numerous domains and...
research
09/23/2022

WordStream Maker: A Lightweight End-to-end Visualization Platform for Qualitative Time-series Data

Whether it is in the form of transcribed conversations, blog posts, or t...

Please sign up or login with your details

Forgot password? Click here to reset