Non-Volatile Memory Accelerated Geometric Multi-Scale Resolution Analysis

02/21/2022
by   Andrew Wood, et al.
2

Dimensionality reduction algorithms are standard tools in a researcher's toolbox. Dimensionality reduction algorithms are frequently used to augment downstream tasks such as machine learning, data science, and also are exploratory methods for understanding complex phenomena. For instance, dimensionality reduction is commonly used in Biology as well as Neuroscience to understand data collected from biological subjects. However, dimensionality reduction techniques are limited by the von-Neumann architectures that they execute on. Specifically, data intensive algorithms such as dimensionality reduction techniques often require fast, high capacity, persistent memory which historically hardware has been unable to provide at the same time. In this paper, we present a re-implementation of an existing dimensionality reduction technique called Geometric Multi-Scale Resolution Analysis (GMRA) which has been accelerated via novel persistent memory technology called Memory Centric Active Storage (MCAS). Our implementation uses a specialized version of MCAS called PyMM that provides native support for Python datatypes including NumPy arrays and PyTorch tensors. We compare our PyMM implementation against a DRAM implementation, and show that when data fits in DRAM, PyMM offers competitive runtimes. When data does not fit in DRAM, our PyMM implementation is still able to process the data.

READ FULL TEXT

page 1

page 2

page 3

page 5

research
01/31/2022

Topology-Preserving Dimensionality Reduction via Interleaving Optimization

Dimensionality reduction techniques are powerful tools for data preproce...
research
06/14/2021

Improving Metric Dimensionality Reduction with Distributed Topology

We propose a novel approach to dimensionality reduction combining techni...
research
10/01/2019

TriMap: Large-scale Dimensionality Reduction Using Triplets

We introduce "TriMap"; a dimensionality reduction technique based on tri...
research
06/24/2020

Extracting the main trend in a dataset: the Sequencer algorithm

Scientists aim to extract simplicity from observations of the complex wo...
research
05/01/2019

Coordinatizing Data With Lens Spaces and Persistent Cohomology

We introduce here a framework to construct coordinates in finite Lens sp...
research
07/02/2016

Text comparison using word vector representations and dimensionality reduction

This paper describes a technique to compare large text sources using wor...
research
07/06/2022

Multi-scale Sinusoidal Embeddings Enable Learning on High Resolution Mass Spectrometry Data

Small molecules in biological samples are studied to provide information...

Please sign up or login with your details

Forgot password? Click here to reset