The Application of Active Query K-Means in Text Classification

07/16/2021
by   Yukun Jiang, et al.
0

Active learning is a state-of-art machine learning approach to deal with an abundance of unlabeled data. In the field of Natural Language Processing, typically it is costly and time-consuming to have all the data annotated. This inefficiency inspires out our application of active learning in text classification. Traditional unsupervised k-means clustering is first modified into a semi-supervised version in this research. Then, a novel attempt is applied to further extend the algorithm into active learning scenario with Penalized Min-Max-selection, so as to make limited queries that yield more stable initial centroids. This method utilizes both the interactive query results from users and the underlying distance representation. After tested on a Chinese news dataset, it shows a consistent increase in accuracy while lowering the cost in training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2021

Small-text: Active Learning for Text Classification in Python

We present small-text, a simple modular active learning library, which o...
research
02/08/2023

CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data

Financial sector and especially the insurance industry collect vast volu...
research
07/29/2021

Semi-Supervised Active Learning with Temporal Output Discrepancy

While deep learning succeeds in a wide range of tasks, it highly depends...
research
01/31/2021

A Simple yet Brisk and Efficient Active Learning Platform for Text Classification

In this work, we propose the use of a fully managed machine learning ser...
research
08/22/2019

Active Learning for Chinese Word Segmentation in Medical Text

Electronic health records (EHRs) stored in hospital information systems ...
research
04/27/2021

Multi-class Text Classification using BERT-based Active Learning

Text Classification finds interesting applications in the pickup and del...
research
05/12/2021

Mining Legacy Issues in Open Pit Mining Sites: Innovation Support of Renaturalization and Land Utilization

Open pit mines left many regions worldwide inhospitable or uninhabitable...

Please sign up or login with your details

Forgot password? Click here to reset