Rank over Class: The Untapped Potential of Ranking in Natural Language Processing

09/10/2020
by   Amir Atapour Abarghouei, et al.
0

Text classification has long been a staple in natural language processing with applications spanning across sentiment analysis, online content tagging, recommender systems and spam detection. However, text classification, by nature, suffers from a variety of issues stemming from dataset imbalance, text ambiguity, subjectivity and the lack of linguistic context in the data. In this paper, we explore the use of text ranking, commonly used in information retrieval, to carry out challenging classification-based tasks. We propose a novel end-to-end ranking approach consisting of a Transformer network responsible for producing representations for a pair of text sequences, which are in turn passed into a context aggregating network outputting ranking scores used to determine an ordering to the sequences based on some notion of relevance. We perform numerous experiments on publicly-available datasets and investigate the possibility of applying our ranking approach to certain problems often addressed using classification. In an experiment on a heavily-skewed sentiment analysis dataset, converting ranking results to classification labels yields an approximately 22 state-of-the-art text classification, demonstrating the efficacy of text ranking over text classification in certain scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2023

USA: Universal Sentiment Analysis Model Construction of Japanese Sentiment Text Classification and Part of Speech Dataset

Sentiment analysis is a pivotal task in the domain of natural language p...
research
02/05/2023

A Semantic Approach to Negation Detection and Word Disambiguation with Natural Language Processing

This study aims to demonstrate the methods for detecting negations in a ...
research
10/06/2022

Adaptive Ranking-based Sample Selection for Weakly Supervised Class-imbalanced Text Classification

To obtain a large amount of training labels inexpensively, researchers h...
research
07/18/2022

Deep Sequence Models for Text Classification Tasks

The exponential growth of data generated on the Internet in the current ...
research
04/04/2023

Polarity based Sarcasm Detection using Semigraph

Sarcasm is an advanced linguistic expression often found on various onli...
research
08/26/2020

SHAP values for Explaining CNN-based Text Classification Models

Deep neural networks are increasingly used in natural language processin...
research
06/10/2019

A cost-reducing partial labeling estimator in text classification problem

We propose a new approach to address the text classification problems wh...

Please sign up or login with your details

Forgot password? Click here to reset