Hillview: A trillion-cell spreadsheet for big data

07/10/2019
by   Mihai Budiu, et al.
0

Hillview is a distributed spreadsheet for browsing very large datasets that cannot be handled by a single machine. As a spreadsheet, Hillview provides a high degree of interactivity that permits data analysts to explore information quickly along many dimensions while switching visualizations on a whim. To provide the required responsiveness, Hillview introduces visualization sketches, or vizketches, as a simple idea to produce compact data visualizations. Vizketches combine algorithmic techniques for data summarization with computer graphics principles for efficient rendering. While simple, vizketches are effective at scaling the spreadsheet by parallelizing computation, reducing communication, providing progressive visualizations, and offering precise accuracy guarantees. Using Hillview running on eight servers, we can navigate and visualize datasets of tens of billions of rows and trillions of cells, much beyond the published capabilities of competing systems.

READ FULL TEXT

page 4

page 10

page 17

research
09/29/2022

Summarizing text to embed qualitative data into visualizations

Qualitative data can be conveyed with strings of text. Fitting longer te...
research
06/28/2019

DIEL: Transparent Scaling for Interactive Visualization

We live in an era of big data and rich data visualization. As data sets ...
research
10/20/2020

What is Data Visualization and Why Is It Important?

#Datavisualization is the act of taking information (data) and placing i...
research
10/23/2008

Interchanging Interactive 3-d Graphics for Astronomy

We demonstrate how interactive, three-dimensional (3-d) scientific visua...
research
10/05/2017

InfiniViz: Interactive Visual Exploration using Progressive Bin Refinement

Interactive visualizations can accelerate the data analysis loop through...
research
12/27/2022

Using Large Language Models to Generate Engaging Captions for Data Visualizations

Creating compelling captions for data visualizations has been a longstan...
research
08/13/2022

VegaFusion: Automatic Server-Side Scaling for Interactive Vega Visualizations

The Vega grammar has been broadly adopted by a growing ecosystem of brow...

Please sign up or login with your details

Forgot password? Click here to reset