Doc2Im: document to image conversion through self-attentive embedding

11/08/2018
by   Mithun Das Gupta, et al.
4

Text classification is a fundamental task in NLP applications. Latest research in this field has largely been divided into two major sub-fields. Learning representations is one sub-field and learning deeper models, both sequential and convolutional, which again connects back to the representation is the other side. We posit the idea that the stronger the representation is, the simpler classifier models are needed to achieve higher performance. In this paper we propose a completely novel direction to text classification research, wherein we convert text to a representation very similar to images, such that any deep network able to handle images is equally able to handle text. We take a deeper look at the representation of documents as an image and subsequently utilize very simple convolution based models taken as is from computer vision domain. This image can be cropped, re-scaled, re-sampled and augmented just like any other image to work with most of the state-of-the-art large convolution based models which have been designed to handle large image datasets. We show impressive results with some of the latest benchmarks in the related fields. We perform transfer learning experiments, both from text to text domain and also from image to text domain. We believe this is a paradigm shift from the way document understanding and text classification has been traditionally done, and will drive numerous novel research ideas in the community.

READ FULL TEXT

page 5

page 6

research
08/30/2019

Sequential Learning of Convolutional Features for Effective Text Classification

Text classification has been one of the major problems in natural langua...
research
08/31/2018

Seeing Colors: Learning Semantic Text Encoding for Classification

The question we answer with this work is: can we convert a text document...
research
05/31/2020

Improve Document Embedding for Text Categorization Through Deep Siamese Neural Network

Due to the increasing amount of data on the internet, finding a highly-i...
research
11/28/2016

Improving Multi-Document Summarization via Text Classification

Developed so far, multi-document summarization has reached its bottlenec...
research
07/27/2023

Gzip versus bag-of-words for text classification with KNN

The effectiveness of compression distance in KNN-based text classificati...
research
08/03/2023

Food Classification using Joint Representation of Visual and Textual Data

Food classification is an important task in health care. In this work, w...
research
08/19/2019

A novel text representation which enables image classifiers to perform text classification, applied to name disambiguation

Patent data are often used to study the process of innovation and resear...

Please sign up or login with your details

Forgot password? Click here to reset