PyRelationAL: A Library for Active Learning Research and Development

05/23/2022
by   Paul Scherer, et al.
0

In constrained real-world scenarios where it is challenging or costly to generate data, disciplined methods for acquiring informative new data points are of fundamental importance for the efficient training of machine learning (ML) models. Active learning (AL) is a subfield of ML focused on the development of methods to iteratively and economically acquire data through strategically querying new data points that are the most useful for a particular task. Here, we introduce PyRelationAL, an open source library for AL research. We describe a modular toolkit that is compatible with diverse ML frameworks (e.g. PyTorch, Scikit-Learn, TensorFlow, JAX). Furthermore, to help accelerate research and development in the field, the library implements a number of published methods and provides API access to wide-ranging benchmark datasets and AL task configurations based on existing literature. The library is supplemented by an expansive set of tutorials, demos, and documentation to help users get started. We perform experiments on the PyRelationAL collection of benchmark datasets and showcase the considerable economies that AL can provide. PyRelationAL is maintained using modern software engineering practices - with an inclusive contributor code of conduct - to promote long term library quality and utilisation.

READ FULL TEXT
research
10/16/2020

ALdataset: a benchmark for pool-based active learning

Active learning (AL) is a subfield of machine learning (ML) in which a l...
research
06/13/2021

Active Learning for Network Traffic Classification: A Technical Study

Network Traffic Classification (NTC) has become an important feature in ...
research
06/22/2020

ASReview: Open Source Software for Efficient and Transparent Active Learning for Systematic Reviews

For many tasks – including guideline development for medical doctors and...
research
06/25/2022

Making Look-Ahead Active Learning Strategies Feasible with Neural Tangent Kernels

We propose a new method for approximating active learning acquisition st...
research
06/09/2022

HDTorch: Accelerating Hyperdimensional Computing with GP-GPUs for Design Space Exploration

HyperDimensional Computing (HDC) as a machine learning paradigm is highl...
research
06/02/2023

Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

The costly human effort required to prepare the training data of machine...
research
02/21/2020

Towards Robust and Reproducible Active Learning Using Neural Networks

Active learning (AL) is a promising ML paradigm that has the potential t...

Please sign up or login with your details

Forgot password? Click here to reset