LESA: Linguistic Encapsulation and Semantic Amalgamation Based Generalised Claim Detection from Online Content

by   Shreya Gupta, et al.

The conceptualization of a claim lies at the core of argument mining. The segregation of claims is complex, owing to the divergence in textual syntax and context across different distributions. Another pressing issue is the unavailability of labeled unstructured text for experimentation. In this paper, we propose LESA, a framework which aims at advancing headfirst into expunging the former issue by assembling a source-independent generalized model that captures syntactic features through part-of-speech and dependency embeddings, as well as contextual features through a fine-tuned language model. We resolve the latter issue by annotating a Twitter dataset which aims at providing a testing ground on a large unstructured dataset. Experimental results show that LESA improves upon the state-of-the-art performance across six benchmark claim datasets by an average of 3 claim-F1 points for in-domain experiments and by 2 claim-F1 points for general-domain experiments. On our dataset too, LESA outperforms existing baselines by 1 claim-F1 point on the in-domain experiments and 2 claim-F1 points on the general-domain experiments. We also release comprehensive data annotation guidelines compiled during the annotation phase (which was missing in the current literature).


page 1

page 2

page 3

page 4


IMHO Fine-Tuning Improves Claim Detection

Claims are the central component of an argument. Detecting claims across...

Stance Prediction and Claim Verification: An Arabic Perspective

This work explores the application of textual entailment in news claim v...

Claim Extraction in Biomedical Publications using Deep Discourse Model and Transfer Learning

Claims are a fundamental unit of scientific discourse. The exponential g...

Towards Automated Factchecking: Developing an Annotation Schema and Benchmark for Consistent Automated Claim Detection

In an effort to assist factcheckers in the process of factchecking, we t...

NewsClaims: A New Benchmark for Claim Detection from News with Background Knowledge

Claim detection and verification are crucial for news understanding and ...

FiNCAT: Financial Numeral Claim Analysis Tool

While making investment decisions by reading financial documents, invest...

Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity Classification

As part of an automated fact-checking pipeline, the claim veracity class...

Please sign up or login with your details

Forgot password? Click here to reset