Mixture of Expert/Imitator Networks: Scalable Semi-supervised Learning Framework

10/13/2018
by   Shun Kiyono, et al.
0

The current success of deep neural networks (DNNs) in an increasingly broad range of tasks for the artificial intelligence strongly depends on the quality and quantity of labeled training data. In general, the scarcity of labeled data, which is often observed in many natural language processing tasks, is one of the most important issues to be addressed. Semi-supervised learning (SSL) is a promising approach to overcome this issue by incorporating a large amount of unlabeled data. In this paper, we propose a novel scalable method of SSL for text classification tasks. The unique property of our method, Mixture of Expert/Imitator Networks, is that imitator networks learn to "imitate" the estimated label distribution of the expert network over the unlabeled data, which potentially contributes as a set of features for the classification. Our experiments demonstrate that the proposed method consistently improves the performance of several types of baseline DNNs. We also demonstrate that our method has the more data, better performance property with promising scalability to the unlabeled data.

READ FULL TEXT

page 10

page 11

research
04/17/2018

Reinforced Co-Training

Co-training is a popular semi-supervised learning framework to utilize a...
research
02/27/2018

Semi-Supervised Learning Enabled by Multiscale Deep Neural Network Inversion

Deep Neural Networks (DNNs) provide state-of-the-art solutions in severa...
research
03/23/2016

A Tutorial on Deep Neural Networks for Intelligent Systems

Developing Intelligent Systems involves artificial intelligence approach...
research
11/05/2022

Learning to Infer from Unlabeled Data: A Semi-supervised Learning Approach for Robust Natural Language Inference

Natural Language Inference (NLI) or Recognizing Textual Entailment (RTE)...
research
07/22/2022

Efficient Testing of Deep Neural Networks via Decision Boundary Analysis

Deep learning plays a more and more important role in our daily life due...
research
05/31/2023

A rule-general abductive learning by rough sets

In real-world tasks, there is usually a large amount of unlabeled data a...
research
03/14/2021

Semi-Supervised Video Deraining with Dynamic Rain Generator

While deep learning (DL)-based video deraining methods have achieved sig...

Please sign up or login with your details

Forgot password? Click here to reset