A Simple yet Brisk and Efficient Active Learning Platform for Text Classification

01/31/2021
by   Teja Kanchinadam, et al.
0

In this work, we propose the use of a fully managed machine learning service, which utilizes active learning to directly build models from unstructured data. With this tool, business users can quickly and easily build machine learning models and then directly deploy them into a production ready hosted environment without much involvement from data scientists. Our approach leverages state-of-the-art text representation like OpenAI's GPT2 and a fast implementation of the active learning workflow that relies on a simple construction of incremental learning using linear models, thus providing a brisk and efficient labeling experience for the users. Experiments on both publicly available and real-life insurance datasets empirically show why our choices of simple and fast classification algorithms are ideal for the task at hand.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 7

07/21/2021

Small-text: Active Learning for Text Classification in Python

We present small-text, a simple modular active learning library, which o...
07/16/2021

The Application of Active Query K-Means in Text Classification

Active learning is a state-of-art machine learning approach to deal with...
10/18/2017

Can Machine Learning Create an Advocate for Foster Youth?

Statistics are bleak for youth aging out of the United States foster car...
05/12/2021

Mining Legacy Issues in Open Pit Mining Sites: Innovation Support of Renaturalization and Land Utilization

Open pit mines left many regions worldwide inhospitable or uninhabitable...
09/03/2021

Sample Noise Impact on Active Learning

This work explores the effect of noisy sample selection in active learni...
10/29/2019

Understand customer reviews with less data and in short time: pretrained language representation and active learning

In this paper, we address customer review understanding problems by usin...
06/02/2020

Toward Optimal Probabilistic Active Learning Using a Bayesian Approach

Gathering labeled data to train well-performing machine learning models ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.