Visualizing probability distributions across bivariate cyclic temporal granularities

10/02/2020
by   Sayani Gupta, et al.
0

Deconstructing a time index into time granularities can assist in exploration and automated analysis of large temporal data sets. This paper describes classes of time deconstructions using linear and cyclic time granularities. Linear granularities respect the linear progression of time such as hours, days, weeks and months. Cyclic granularities can be circular such as hour-of-the-day, quasi-circular such as day-of-the-month, and aperiodic such as public holidays. The hierarchical structure of granularities creates a nested ordering: hour-of-the-day and second-of-the-minute are single-order-up. Hour-of-the-week is multiple-order-up, because it passes over day-of-the-week. Methods are provided for creating all possible granularities for a time index. A recommendation algorithm provides an indication whether a pair of granularities can be meaningfully examined together (a "harmony"), or when they cannot (a "clash"). Time granularities can be used to create data visualizations to explore for periodicities, associations and anomalies. The granularities form categorical variables (ordered or unordered) which induce groupings of the observations. Assuming a numeric response variable, the resulting graphics are then displays of distributions compared across combinations of categorical variables. The methods implemented in the open source R package `gravitas` are consistent with a tidy workflow, with probability distributions examined using the range of graphics available in `ggplot2`.

READ FULL TEXT

page 8

page 23

page 27

research
02/16/2019

Projected Pólya Tree

One way of defining probability distributions for circular variables (di...
research
02/27/2013

Ignorance and the Expressiveness of Single- and Set-Valued Probability Models of Belief

Over time, there have hen refinements in the way that probability distri...
research
10/23/2018

Calendar-based graphics for visualizing people's daily schedules

Calendars are broadly used in society to display temporal information, a...
research
07/24/2020

Cycles in Causal Learning

In the causal learning setting, we wish to learn cause-and-effect relati...
research
01/09/2023

Multivariate Nonnegative Trigonometric Sums Distributions for High-Dimensional Multivariate Cirular Data

Fernández-Durán and Gregorio-Domínguez (2014) defined a family of probab...

Please sign up or login with your details

Forgot password? Click here to reset