Hierarchical Taxonomy-Aware and Attentional Graph Capsule RCNNs for Large-Scale Multi-Label Text Classification

06/09/2019
by   Hao Peng, et al.
6

CNNs, RNNs, GCNs, and CapsNets have shown significant insights in representation learning and are widely used in various text mining tasks such as large-scale multi-label text classification. However, most existing deep models for multi-label text classification consider either the non-consecutive and long-distance semantics or the sequential semantics, but how to consider them both coherently is less studied. In addition, most existing methods treat output labels as independent methods, but ignore the hierarchical relations among them, leading to useful semantic information loss. In this paper, we propose a novel hierarchical taxonomy-aware and attentional graph capsule recurrent CNNs framework for large-scale multi-label text classification. Specifically, we first propose to model each document as a word order preserved graph-of-words and normalize it as a corresponding words-matrix representation which preserves both the non-consecutive, long-distance and local sequential semantics. Then the words-matrix is input to the proposed attentional graph capsule recurrent CNNs for more effectively learning the semantic features. To leverage the hierarchical relations among the class labels, we propose a hierarchical taxonomy embedding method to learn their representations, and define a novel weighted margin loss by incorporating the label representation similarity. Extensive evaluations on three datasets show that our model significantly improves the performance of large-scale multi-label text classification by comparing with state-of-the-art approaches.

READ FULL TEXT

page 12

page 14

research
04/14/2023

Label Dependencies-aware Set Prediction Networks for Multi-label Text Classification

Multi-label text classification aims to extract all the related labels f...
research
09/18/2017

Leveraging Distributional Semantics for Multi-Label Learning

We present a novel and scalable label embedding framework for large-scal...
research
12/01/2015

Taxonomy grounded aggregation of classifiers with different label sets

We describe the problem of aggregating the label predictions of diverse ...
research
09/10/2021

CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification

Large-Scale Multi-Label Text Classification (LMTC) includes tasks with h...
research
07/02/2020

A Novel BGCapsule Network for Text Classification

Several text classification tasks such as sentiment analysis, news categ...
research
06/17/2014

Notes on hierarchical ensemble methods for DAG-structured taxonomies

Several real problems ranging from text classification to computational ...
research
10/15/2018

A Context-aware Capsule Network for Multi-label Classification

Recently proposed Capsule Network is a brain inspired architecture that ...

Please sign up or login with your details

Forgot password? Click here to reset