Snap Machine Learning

03/16/2018
by   Celestine Dünner, et al.
0

We describe an efficient, scalable machine learning library that enables very fast training of generalized linear models. We demonstrate that our library can remove the training time as a bottleneck for machine learning workloads, opening the door to a range of new applications. For instance, it allows more agile development, faster and more fine-grained exploration of the hyper-parameter space, enables scaling to massive datasets and makes frequent re-training of models possible in order to adapt to events as they occur. Our library, named Snap Machine Learning (Snap ML), combines recent advances in machine learning systems and algorithms in a nested manner to reflect the hierarchical architecture of modern distributed systems. This allows us to effectively leverage available network, memory and heterogeneous compute resources. On a terabyte-scale publicly available dataset for click-through-rate prediction in computational advertising, we demonstrate the training of a logistic regression classifier in 1.53 minutes, a 46x improvement over the fastest reported performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2023

An investigation of licensing of datasets for machine learning based on the GQM model

Dataset licensing is currently an issue in the development of machine le...
research
09/20/2023

Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

The problem of predicting the training time of machine learning (ML) mod...
research
08/17/2017

Efficient Use of Limited-Memory Accelerators for Linear Learning on Heterogeneous Systems

We propose a generic algorithmic building block to accelerate training o...
research
10/23/2012

MLPACK: A Scalable C++ Machine Learning Library

MLPACK is a state-of-the-art, scalable, multi-platform C++ machine learn...
research
12/07/2020

SpotTune: Leveraging Transient Resources for Cost-efficient Hyper-parameter Tuning in the Public Cloud

Hyper-parameter tuning (HPT) is crucial for many machine learning (ML) a...
research
06/09/2022

HDTorch: Accelerating Hyperdimensional Computing with GP-GPUs for Design Space Exploration

HyperDimensional Computing (HDC) as a machine learning paradigm is highl...
research
05/21/2018

Super Learning in the SAS system

Background and objective: Stacking is an ensemble machine learning metho...

Please sign up or login with your details

Forgot password? Click here to reset