Deep Learning with Apache SystemML

02/08/2018
by   Niketan Pansare, et al.
0

Enterprises operate large data lakes using Hadoop and Spark frameworks that (1) run a plethora of tools to automate powerful data preparation/transformation pipelines, (2) run on shared, large clusters to (3) perform many different analytics tasks ranging from model preparation, building, evaluation, and tuning for both machine learning and deep learning. Developing machine/deep learning models on data in such shared environments is challenging. Apache SystemML provides a unified framework for implementing machine learning and deep learning algorithms in a variety of shared deployment scenarios. SystemML's novel compilation approach automatically generates runtime execution plans for machine/deep learning algorithms that are composed of single-node and distributed runtime operations depending on data and cluster characteristics such as data size, data sparsity, cluster size, and memory configurations, while still exploiting the capabilities of the underlying big data frameworks.

READ FULL TEXT

page 1

page 2

page 3

research
05/08/2022

Parallelization of Machine Learning Algorithms Respectively on Single Machine and Spark

With the rapid development of big data technologies, how to dig out usef...
research
07/16/2017

Performance Evaluation of Distributed Computing Environments with Hadoop and Spark Frameworks

Recently, due to rapid development of information and communication tech...
research
05/16/2022

Phishing Detection Leveraging Machine Learning and Deep Learning: A Review

Phishing attacks trick victims into disclosing sensitive information. To...
research
11/19/2020

On tuning deep learning models: a data mining perspective

Deep learning algorithms vary depending on the underlying connection mec...
research
08/19/2019

AFrame: Extending DataFrames for Large-Scale Modern Data Analysis (Extended Version)

Analyzing the increasingly large volumes of data that are available toda...
research
10/20/2018

MMLSpark: Unifying Machine Learning Ecosystems at Massive Scales

We introduce Microsoft Machine Learning for Apache Spark (MMLSpark), an ...
research
07/05/2018

Blockchain as a Service: An Autonomous, Privacy Preserving, Decentralized Architecture for Deep Learning

Deep learning algorithms have recently gained attention due to their inh...

Please sign up or login with your details

Forgot password? Click here to reset