Rafiki: Machine Learning as an Analytics Service System

04/17/2018
by   Wei Wang, et al.
0

Big data analytics is gaining massive momentum in the last few years. Applying machine learning models to big data has become an implicit requirement or an expectation for most analysis tasks, especially on high-stakes applications.Typical applications include sentiment analysis against reviews for analyzing on-line products, image classification in food logging applications for monitoring user's daily intake and stock movement prediction. Extending traditional database systems to support the above analysis is intriguing but challenging. First, it is almost impossible to implement all machine learning models in the database engines. Second, expertise knowledge is required to optimize the training and inference procedures in terms of efficiency and effectiveness, which imposes heavy burden on the system users. In this paper, we develop and present a system, called Rafiki, to provide the training and inference service of machine learning models, and facilitate complex analytics on top of cloud platforms. Rafiki provides distributed hyper-parameter tuning for the training service, and online ensemble modeling for the inference service which trades off between latency and accuracy. Experimental results confirm the efficiency, effectiveness, scalability and usability of Rafiki.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/26/2017

FogLearn: Leveraging Fog-based Machine Learning for Smart System Big Data Analytics

Big data analytics with the cloud computing are one of the emerging area...
research
03/30/2021

Text Classification Using Hybrid Machine Learning Algorithms on Big Data

Recently, there are unprecedented data growth originating from different...
research
11/30/2021

Flood Analytics Information System (FAIS) Version 4.00 Manual

This project was the first attempt to use big data analytics approaches ...
research
04/01/2019

Machine Learning, Big Data, And Smart Buildings: A Comprehensive Survey

Future buildings will offer new convenience, comfort, and efficiency pos...
research
12/12/2020

PAIRS AutoGeo: an Automated Machine Learning Framework for Massive Geospatial Data

An automated machine learning framework for geospatial data named PAIRS ...
research
05/03/2018

CLAUDETTE: an Automated Detector of Potentially Unfair Clauses in Online Terms of Service

Terms of service of on-line platforms too often contain clauses that are...
research
08/03/2021

SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis

Deep learning has achieved great success in a wide spectrum of multimedi...

Please sign up or login with your details

Forgot password? Click here to reset