TensorBank:Tensor Lakehouse for Foundation Model Training

09/05/2023
by   Romeo Kienzler, et al.
0

Storing and streaming high dimensional data for foundation model training became a critical requirement with the rise of foundation models beyond natural language. In this paper we introduce TensorBank, a petabyte scale tensor lakehouse capable of streaming tensors from Cloud Object Store (COS) to GPU memory at wire speed based on complex relational queries. We use Hierarchical Statistical Indices (HSI) for query acceleration. Our architecture allows to directly address tensors on block level using HTTP range reads. Once in GPU memory, data can be transformed using PyTorch transforms. We provide a generic PyTorch dataset type with a corresponding dataset factory translating relational queries and requested transformations as an instance. By making use of the HSI, irrelevant blocks can be skipped without reading them as those indices contain statistics on their content at different hierarchical resolution levels. This is an opinionated architecture powered by open standards and making heavy use of open-source technology. Although, hardened for production use using geospatial-temporal data, this architecture generalizes to other use case like computer vision, computational neuroscience, biological sequence analysis and more.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2018

Data-parallel distributed training of very large models beyond GPU capacity

GPUs have limited memory and it is difficult to train wide and/or deep m...
research
09/22/2022

Deep Lake: a Lakehouse for Deep Learning

Traditional data lakes provide critical data infrastructure for analytic...
research
08/22/2023

Tensor Regression

Regression analysis is a key area of interest in the field of data analy...
research
08/04/2020

Distributed Non-Negative Tensor Train Decomposition

The era of exascale computing opens new venues for innovations and disco...
research
05/25/2023

Automated Tensor Model Parallelism with Overlapped Communication for Efficient Foundation Model Training

Deep learning is experiencing a rise in foundation models that are expec...
research
03/23/2020

Efficient Tensor Kernel methods for sparse regression

Recently, classical kernel methods have been extended by the introductio...
research
12/03/2020

AugSplicing: Synchronized Behavior Detection in Streaming Tensors

How can we track synchronized behavior in a stream of time-stamped tuple...

Please sign up or login with your details

Forgot password? Click here to reset