Small-text: Active Learning for Text Classification in Python

07/21/2021
by   Christopher Schröder, et al.
0

We present small-text, a simple modular active learning library, which offers pool-based active learning for text classification in Python. It comes with various pre-implemented state-of-the-art query strategies, including some which can leverage the GPU. Clearly defined interfaces allow to combine a multitude of such query strategies with different classifiers, thereby facilitating a quick mix and match, and enabling a rapid development of both active learning experiments and applications. To make various classifiers accessible in a consistent way, it integrates several well-known machine learning libraries, namely, scikit-learn, PyTorch, and huggingface transformers – for which the latter integrations are available as optionally installable extensions. The library is available under the MIT License at https://github.com/webis-de/small-text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2021

Uncertainty-based Query Strategies for Active Learning with Transformers

Active learning is the iterative construction of a classification model ...
research
05/02/2018

modAL: A modular active learning framework for Python

modAL is a modular active learning framework for Python, aimed to make a...
research
07/16/2021

The Application of Active Query K-Means in Text Classification

Active learning is a state-of-art machine learning approach to deal with...
research
11/24/2022

PyTAIL: Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data

Online data streams make training machine learning models hard because o...
research
04/19/2020

ktrain: A Low-Code Library for Augmented Machine Learning

We present ktrain, a low-code Python library that makes machine learning...
research
05/12/2021

Mining Legacy Issues in Open Pit Mining Sites: Innovation Support of Renaturalization and Land Utilization

Open pit mines left many regions worldwide inhospitable or uninhabitable...
research
01/31/2021

A Simple yet Brisk and Efficient Active Learning Platform for Text Classification

In this work, we propose the use of a fully managed machine learning ser...

Please sign up or login with your details

Forgot password? Click here to reset