Polytope: An Algorithm for Efficient Feature Extraction on Hypercubes

06/20/2023
by   Mathilde Leuridan, et al.
0

Data extraction algorithms on data hypercubes, or datacubes, are traditionally only capable of cutting boxes of data along the datacube axes. For many use cases however, this is not a sufficient approach and returns more data than users might actually need. This not only forces users to apply post-processing after extraction, but more importantly this consumes more I/O resources than is necessary. When considering very large datacubes from which users only want to extract small non-rectangular subsets, the box approach does not scale well. Indeed, with this traditional approach, I/O systems quickly reach capacity, trying to read and return unwanted data to users. In this paper, we propose a novel technique, based on computational geometry concepts, which instead carefully pre-selects the precise bytes of data which the user needs in order to then only read those from the datacube. As we discuss later on, this novel extraction method will considerably help scale access to large petabyte size data hypercubes in a variety of scientific fields.

READ FULL TEXT

page 2

page 6

page 7

research
01/10/2023

CageCoach: Sharing-Oriented Redaction-Capable Distributed Cryptographic File System

The modern data economy is built on sharing data. However, sharing data ...
research
03/28/2019

Multifaceted 4D Feature Segmentation and Extraction in Point and Field-based Datasets

The use of large-scale multifaceted data is common in a wide variety of ...
research
05/10/2021

DocReader: Bounding-Box Free Training of a Document Information Extraction Model

Information extraction from documents is a ubiquitous first step in many...
research
10/26/2020

Distributed Feature Extraction in a P2P Setting - A Case Study

Finding the right data representation is essential for virtually every d...
research
05/26/2020

An Effective Pipeline for a Real-world Clothes Retrieval System

In this paper, we propose an effective pipeline for clothes retrieval sy...
research
04/19/2022

Core Box Image Recognition and its Improvement with a New Augmentation Technique

Most methods for automated full-bore rock core image analysis (descripti...

Please sign up or login with your details

Forgot password? Click here to reset