Imbalanced Multi-label Classification for Business-related Text with Moderately Large Label Spaces

06/12/2023
by   Muhammad Arslan, et al.
0

In this study, we compared the performance of four different methods for multi label text classification using a specific imbalanced business dataset. The four methods we evaluated were fine tuned BERT, Binary Relevance, Classifier Chains, and Label Powerset. The results show that fine tuned BERT outperforms the other three methods by a significant margin, achieving high values of accuracy, F1 Score, Precision, and Recall. Binary Relevance also performs well on this dataset, while Classifier Chains and Label Powerset demonstrate relatively poor performance. These findings highlight the effectiveness of fine tuned BERT for multi label text classification tasks, and suggest that it may be a useful tool for businesses seeking to analyze complex and multifaceted texts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2023

MatchXML: An Efficient Text-label Matching Framework for Extreme Multi-label Text Classification

The eXtreme Multi-label text Classification(XMC) refers to training a cl...
research
03/11/2022

verBERT: Automating Brazilian Case Law Document Multi-label Categorization Using BERT

In this work, we carried out a study about the use of attention-based al...
research
03/16/2020

Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data

The automatic identification of propaganda has gained significance in re...
research
04/12/2021

WHOSe Heritage: Classification of UNESCO World Heritage "Outstanding Universal Value" Documents with Smoothed Labels

The UNESCO World Heritage List (WHL) is to identify the exceptionally va...
research
07/21/2023

DEFTri: A Few-Shot Label Fused Contextual Representation Learning For Product Defect Triage in e-Commerce

Defect Triage is a time-sensitive and critical process in a large-scale ...
research
03/22/2021

Hybrid Model for Patent Classification using Augmented SBERT and KNN

Purpose: This study aims to provide a hybrid approach for patent claim c...
research
02/04/2022

Extracting Software Requirements from Unstructured Documents

Requirements identification in textual documents or extraction is a tedi...

Please sign up or login with your details

Forgot password? Click here to reset