Reinforced Co-Training

04/17/2018
by   Jiawei Wu, et al.
0

Co-training is a popular semi-supervised learning framework to utilize a large amount of unlabeled data in addition to a small labeled set. Co-training methods exploit predicted labels on the unlabeled data and select samples based on prediction confidence to augment the training. However, the selection of samples in existing co-training methods is based on a predetermined policy, which ignores the sampling bias between the unlabeled and the labeled subsets, and fails to explore the data space. In this paper, we propose a novel method, Reinforced Co-Training, to select high-quality unlabeled samples to better co-train on. More specifically, our approach uses Q-learning to learn a data selection policy with a small labeled dataset, and then exploits this policy to train the co-training classifiers automatically. Experimental results on clickbait detection and generic text classification tasks demonstrate that our proposed method can obtain more accurate text classification results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2018

Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning Framework

The current success of deep neural networks (DNNs) in an increasingly br...
research
01/21/2022

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization

Localizing keypoints of an object is a basic visual problem. However, su...
research
02/04/2020

Iterative Data Programming for Expanding Text Classification Corpora

Real-world text classification tasks often require many labeled training...
research
02/16/2019

CruzAffect at AffCon 2019 Shared Task: A feature-rich approach to characterize happiness

We present our system, CruzAffect, for the CL-Aff Shared Task 2019. Cruz...
research
06/27/2020

Uncertainty-aware Self-training for Text Classification with Few Labels

Recent success of large-scale pre-trained language models crucially hing...
research
08/18/2013

Reference Distance Estimator

A theoretical study is presented for a simple linear classifier called r...
research
05/11/2018

Textual Membership Queries

Human labeling of textual data can be very time-consuming and expensive,...

Please sign up or login with your details

Forgot password? Click here to reset