Large-Scale Multi-Label Text Classification on EU Legislation

06/05/2019
by   Ilias Chalkidis, et al.
0

We consider Large-Scale Multi-Label Text Classification (LMTC) in the legal domain. We release a new dataset of 57k legislative documents from EURLEX, annotated with 4.3k EUROVOC labels, which is suitable for LMTC, few- and zero-shot learning. Experimenting with several neural classifiers, we show that BIGRUs with label-wise attention perform better than other current state of the art methods. Domain-specific WORD2VEC and context-sensitive ELMO embeddings further improve performance. We also find that considering only particular zones of the documents is sufficient. This allows us to bypass BERT's maximum text length limit and fine-tune BERT, obtaining the best results in all but zero-shot learning cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2019

Extreme Multi-Label Legal Text Classification: A case study in EU Legislation

We consider the task of Extreme Multi-Label Text Classification (XMTC) i...
research
10/04/2020

An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels

Large-scale Multi-label Text Classification (LMTC) has a wide range of N...
research
07/21/2023

DEFTri: A Few-Shot Label Fused Contextual Representation Learning For Product Defect Triage in e-Commerce

Defect Triage is a time-sensitive and critical process in a large-scale ...
research
03/02/2023

Adopting the Multi-answer Questioning Task with an Auxiliary Metric for Extreme Multi-label Text Classification Utilizing the Label Hierarchy

Extreme multi-label text classification utilizes the label hierarchy to ...
research
08/30/2022

Flexible Job Classification with Zero-Shot Learning

Using a taxonomy to organize information requires classifying objects (d...
research
09/28/2019

Generalized Zero-shot ICD Coding

The International Classification of Diseases (ICD) is a list of classifi...
research
04/01/2020

Deep Learning Based Multi-Label Text Classification of UNGA Resolutions

The main goal of this research is to produce a useful software for Unite...

Please sign up or login with your details

Forgot password? Click here to reset