ML-Net: multi-label classification of biomedical texts with deep neural networks

11/13/2018
by   Jingcheng Du, et al.
0

Background: Multi-label text classification is one type of text classification where each text can be assigned with one or more labels. Multi-label text classification, which has broad applications in biomedical domain, is often considered harder than other types of text classification, as each textual document can be assigned with indeterminate number of labels. Methods: In this work, we propose ML-Net, a novel end-to-end deep learning framework, for multi-label classification of biomedical tasks. ML-Net combines the label prediction network with a label count prediction network, which can determine the output labels based on both label confidence scores and document context in an end-to-end manner. We evaluated the ML-Net on publicly available multi-label biomedical text classification tasks from both biomedical literature domain and clinical domain. Example-based metrics including precision, recall and f-measure were calculated. We compared the ML-NET with both traditional machine learning baseline models as well as classic deep learning models. The codes are available at: https://github.com/jingcheng-du/ML_Net Results & Conclusions: The deep neural network based approaches showed superiority compared to traditional machine learning algorithms in multi-label classification of biomedical texts. Deep neural networks alleviate human efforts for feature engineering, and avoid building individual classifiers for each label. ML-NET is able to dynamically estimate the label count based on the document context in a more accurate manner.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2023

Label Dependencies-aware Set Prediction Networks for Multi-label Text Classification

Multi-label text classification aims to extract all the related labels f...
research
08/21/2022

Automatic tagging of knowledge points for K12 math problems

Automatic tagging of knowledge points for practice problems is the basis...
research
06/09/2016

Large scale biomedical texts classification: a kNN and an ESA-based approaches

With the large and increasing volume of textual data, automated methods ...
research
01/13/2017

Deep Neural Networks for Czech Multi-label Document Classification

This paper is focused on automatic multi-label document classification o...
research
12/03/2022

Harnessing label semantics to extract higher performance under noisy label for Company to Industry matching

Assigning appropriate industry tag(s) to a company is a critical task in...
research
09/10/2021

CoPHE: A Count-Preserving Hierarchical Evaluation Metric in Large-Scale Multi-Label Text Classification

Large-Scale Multi-Label Text Classification (LMTC) includes tasks with h...
research
03/19/2021

Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark

Perhaps surprisingly sewerage infrastructure is one of the most costly i...

Please sign up or login with your details

Forgot password? Click here to reset