Mining Documentation to Extract Hyperparameter Schemas

06/30/2020
by   Guillaume Baudart, et al.
0

AI automation tools need machine-readable hyperparameter schemas to define their search spaces. At the same time, AI libraries often come with good human-readable documentation. While such documentation contains most of the necessary information, it is unfortunately not ready to consume by tools. This paper describes how to automatically mine Python docstrings in AI libraries to extract JSON Schemas for their hyperparameters. We evaluate our approach on 119 transformers and estimators from three different libraries and find that it is effective at extracting machine-readable schemas. Our vision is to reduce the burden to manually create and maintain such schemas for AI automation tools and broaden the reach of automation to larger libraries and richer schemas.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2019

Type-Driven Automated Learning with Lale

Machine-learning automation tools, ranging from humble grid-search to hy...
research
02/05/2019

How to "DODGE" Complex Software Analytics?

AI software is still software. Software engineers need better tools to m...
research
05/22/2020

MANGO: A Python Library for Parallel Hyperparameter Tuning

Tuning hyperparameters for machine learning algorithms is a tedious task...
research
08/09/2021

Experiences with the Introduction of AI-based Tools for Moderation Automation of Voice-based Participatory Media Forums

Voice-based discussion forums where users can record audio messages whic...
research
03/28/2020

Towards Automating the AI Operations Lifecycle

Today's AI deployments often require significant human involvement and s...
research
01/28/2020

An Adaptive and Near Parameter-free Evolutionary Computation Approach Towards True Automation in AutoML

A common claim of evolutionary computation methods is that they can achi...
research
01/16/2022

Social Networks as a Collective Intelligence: An Examination of the Python Ecosystem

The Python ecosystem represents a global, data rich, technology-enabled ...

Please sign up or login with your details

Forgot password? Click here to reset