StreamingHub: Interactive Stream Analysis Workflows

05/01/2022
by   Yasith Jayawardana, et al.
0

Reusable data/code and reproducible analyses are foundational to quality research. This aspect, however, is often overlooked when designing interactive stream analysis workflows for time-series data (e.g., eye-tracking data). A mechanism to transmit informative metadata alongside data may allow such workflows to intelligently consume data, propagate metadata to downstream tasks, and thereby auto-generate reusable, reproducible analytic outputs with zero supervision. Moreover, a visual programming interface to design, develop, and execute such workflows may allow rapid prototyping for interdisciplinary research. Capitalizing on these ideas, we propose StreamingHub, a framework to build metadata propagating, interactive stream analysis workflows using visual programming. We conduct two case studies to evaluate the generalizability of our framework. Simultaneously, we use two heuristics to evaluate their computational fluidity and data growth. Results show that our framework generalizes to multiple tasks with a minimal performance overhead.

READ FULL TEXT

page 4

page 6

page 7

research
09/21/2022

Designing PIDs for Reproducible Science Using Time-Series Data

As part of the investigation done by the IEEE Standards Association P295...
research
07/30/2020

ConceptExplorer: Visual Analysis of Concept Driftsin Multi-source Time-series Data

Time-series data is widely studied in various scenarios, like weather fo...
research
03/02/2022

Nemo: Guiding and Contextualizing Weak Supervision for Interactive Data Programming

Weak Supervision (WS) techniques allow users to efficiently create large...
research
05/26/2023

Cluster Analysis of Open Research Data and a Case for Replication Metadata

Research data are often released upon journal publication to enable resu...
research
03/05/2021

BOPI: A Programming Interface For Reuse Of Research Data Available On DSpace Repositories

A recent study showed that more than 70 their peers's experiments and mo...
research
09/17/2019

Multimodal Multitask Representation Learning for Pathology Biobank Metadata Prediction

Metadata are general characteristics of the data in a well-curated and c...
research
08/10/2014

Modeling Creativity: Case Studies in Python

Modeling Creativity (doctoral dissertation, 2013) explores how creativit...

Please sign up or login with your details

Forgot password? Click here to reset