DOC: Deep Open Classification of Text Documents

09/25/2017
by   Lei Shu, et al.
0

Traditional supervised learning makes the closed-world assumption that the classes appeared in the test data must have appeared in training. This also applies to text learning or text classification. As learning is used increasingly in dynamic open environments where some new/test documents may not belong to any of the training classes, identifying these novel documents during classification presents an important problem. This problem is called open-world classification or open classification. This paper proposes a novel deep learning based approach. It outperforms existing state-of-the-art techniques dramatically.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2017

HDLTex: Hierarchical Deep Learning for Text Classification

The continually increasing number of documents produced each year necess...
research
11/04/2020

Handwriting Classification for the Analysis of Art-Historical Documents

Digitized archives contain and preserve the knowledge of generations of ...
research
11/17/2020

Measuring the Novelty of Natural Language Text Using the Conjunctive Clauses of a Tsetlin Machine Text Classifier

Most supervised text classification approaches assume a closed world, co...
research
09/23/2020

Text Classification with Novelty Detection

This paper studies the problem of detecting novel or unexpected instance...
research
03/23/2020

Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised Deep Asymmetric Metric Learning

The reconstruction of shredded documents consists in arranging the piece...
research
06/27/2021

Deep Learning for Technical Document Classification

In large technology companies, the requirements for managing and organiz...
research
04/07/2021

OpenGAN: Open-Set Recognition via Open Data Generation

Real-world machine learning systems need to analyze novel testing data t...

Please sign up or login with your details

Forgot password? Click here to reset