When Lempel-Ziv-Welch Meets Machine Learning: A Case Study of Accelerating Machine Learning using Coding

02/22/2017
by   Fengan Li, et al.
0

In this paper we study the use of coding techniques to accelerate machine learning (ML). Coding techniques, such as prefix codes, have been extensively studied and used to accelerate low-level data processing primitives such as scans in a relational database system. However, there is little work on how to exploit them to accelerate ML algorithms. In fact, applying coding techniques for faster ML faces a unique challenge: one needs to consider both how the codes fit into the optimization algorithm used to train a model, and the interplay between the model structure and the coding scheme. Surprisingly and intriguingly, our study demonstrates that a slight variant of the classical Lempel-Ziv-Welch (LZW) coding scheme is a good fit for several popular ML algorithms, resulting in substantial runtime savings. Comprehensive experiments on several real-world datasets show that our LZW-based ML algorithms exhibit speedups of up to 31x compared to a popular and state-of-the-art ML library, with no changes to ML accuracy, even though the implementations of our LZW variants are not heavily tuned. Thus, our study reveals a new avenue for accelerating ML algorithms using coding techniques and we hope this opens up a new direction for more research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2020

Managing Data Lineage of O G Machine Learning Models: The Sweet Spot for Shale Use Case

Machine Learning (ML) has increased its role, becoming essential in seve...
research
08/13/2019

Requirements Engineering for Machine Learning: Perspectives from Data Scientists

Machine learning (ML) is used increasingly in real-world applications. I...
research
01/13/2021

MLGO: a Machine Learning Guided Compiler Optimizations Framework

Leveraging machine-learning (ML) techniques for compiler optimizations h...
research
06/09/2022

HDTorch: Accelerating Hyperdimensional Computing with GP-GPUs for Design Space Exploration

HyperDimensional Computing (HDC) as a machine learning paradigm is highl...
research
08/27/2019

A Framework for Model Search Across Multiple Machine Learning Implementations

Several recently devised machine learning (ML) algorithms have shown imp...
research
08/01/2023

An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning

Recently, bi-level optimization (BLO) has taken center stage in some ver...
research
01/28/2022

Task-Aware Network Coding Over Butterfly Network

Network coding allows distributed information sources such as sensors to...

Please sign up or login with your details

Forgot password? Click here to reset