Type-Driven Automated Learning with Lale

05/24/2019
by   Martin Hirzel, et al.
0

Machine-learning automation tools, ranging from humble grid-search to hyperopt, auto-sklearn, and TPOT, help explore large search spaces of possible pipelines. Unfortunately, each of these tools has a different syntax for specifying its search space, leading to lack of portability, missed relevant points, and spurious points that are inconsistent with error checks and documentation of the searchable base components. This paper proposes using types (such as enum, float, or dictionary) both for checking the correctness of, and for automatically searching over, hyperparameters and pipeline configurations. Using types for both of these purposes guarantees consistency. We present Lale, an embedded language that resembles scikit learn but provides better automation, correctness checks, and portability. Lale extends the reach of existing automation tools across data modalities (tables, text, images, time-series) and programming languages (Python, Java, R). Thus, data scientists can leverage automation while remaining in control of their work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2020

Lale: Consistent Automated Machine Learning

Automated machine learning makes it easier for data scientists to develo...
research
06/30/2020

Mining Documentation to Extract Hyperparameter Schemas

AI automation tools need machine-readable hyperparameter schemas to defi...
research
07/01/2019

On consistency types for lattice-based distributed programming languages

Distributed systems address an increasing demand for fast access to reso...
research
07/28/2020

Automated Unit Test Generation for Python

Automated unit test generation is an established research field, and mat...
research
01/13/2021

Whither AutoML? Understanding the Role of Automation in Machine Learning Workflows

Efforts to make machine learning more widely accessible have led to a ra...
research
07/05/2021

Automating Generative Deep Learning for Artistic Purposes: Challenges and Opportunities

We present a framework for automating generative deep learning with a sp...
research
12/20/2017

Kayak: Safe Semantic Refactoring to Java Streams

Refactorings are structured changes to existing software that leave its ...

Please sign up or login with your details

Forgot password? Click here to reset