Plotly-Resampler: Effective Visual Analytics for Large Time Series

06/17/2022
by   Jonas Van Der Donckt, et al.
0

Visual analytics is arguably the most important step in getting acquainted with your data. This is especially the case for time series, as this data type is hard to describe and cannot be fully understood when using for example summary statistics. To realize effective time series visualization, four requirements have to be met; a tool should be (1) interactive, (2) scalable to millions of data points, (3) integrable in conventional data science environments, and (4) highly configurable. We observe that open source Python visualization toolkits empower data scientists in most visual analytics tasks, but lack the combination of scalability and interactivity to realize effective time series visualization. As a means to facilitate these requirements, we created Plotly-Resampler, an open source Python library. Plotly-Resampler is an add-on for Plotly's Python bindings, enhancing line chart scalability on top of an interactive toolkit by aggregating the underlying data depending on the current graph view. Plotly-Resampler is built to be snappy, as the reactivity of a tool qualitatively affects how analysts visually explore and analyze data. A benchmark task highlights how our toolkit scales better than alternatives in terms of number of samples and time series. Additionally, Plotly-Resampler's flexible data aggregation functionality paves the path towards researching novel aggregation techniques. Plotly-Resampler's integrability, together with its configurability, convenience, and high scalability, allows to effectively analyze high-frequency data in your day-to-day Python environment.

READ FULL TEXT

page 1

page 4

research
02/08/2023

DeepVATS: Deep Visual Analytics for Time Series

The field of Deep Visual Analytics (DVA) has recently arisen from the id...
research
04/29/2023

MinMaxLTTB: Leveraging MinMax-Preselection to Scale LTTB

Visualization plays an important role in analyzing and exploring time se...
research
07/29/2021

Mobilkit: A Python Toolkit for Urban Resilience and Disaster Risk Management Analytics using High Frequency Human Mobility Data

Increasingly available high-frequency location datasets derived from sma...
research
07/05/2023

tsdownsample: high-performance time series downsampling for scalable visualization

Interactive line chart visualizations greatly enhance the effective expl...
research
07/27/2020

LineSmooth: An Analytical Framework for Evaluating the Effectiveness of Smoothing Techniques on Line Charts

We present a comprehensive framework for evaluating line chart smoothing...
research
09/12/2019

A tale of two toolkits, report the first: benchmarking time series classification algorithms for correctness and efficiency

sktime is an open source, Python based, sklearn compatible toolkit for t...
research
09/01/2020

MultiSegVA: Using Visual Analytics to Segment Biologging Time Series on Multiple Scales

Segmenting biologging time series of animals on multiple temporal scales...

Please sign up or login with your details

Forgot password? Click here to reset