An Entity-based Claim Extraction Pipeline for Real-world Biomedical Fact-checking

04/11/2023
by   Amelie Wührl, et al.
10

Existing fact-checking models for biomedical claims are typically trained on synthetic or well-worded data and hardly transfer to social media content. This mismatch can be mitigated by adapting the social media input to mimic the focused nature of common training claims. To do so, Wuehrl Klinger (2022) propose to extract concise claims based on medical entities in the text. However, their study has two limitations: First, it relies on gold-annotated entities. Therefore, its feasibility for a real-world application cannot be assessed since this requires detecting relevant entities automatically. Second, they represent claim entities with the original tokens. This constitutes a terminology mismatch which potentially limits the fact-checking performance. To understand both challenges, we propose a claim extraction pipeline for medical tweets that incorporates named entity recognition and terminology normalization via entity linking. We show that automatic NER does lead to a performance drop in comparison to using gold annotations but the fact-checking performance still improves considerably over inputting the unchanged tweets. Normalizing entities to their canonical forms does, however, not improve the performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2022

Entity-based Claim Representation Improves Fact-Checking of Medical Content in Tweets

False medical information on social media poses harm to people's health....
research
04/23/2021

Claim Detection in Biomedical Twitter Posts

Social media contains unfiltered and unique information, which is potent...
research
04/26/2022

CoVERT: A Corpus of Fact-checked Biomedical COVID-19 Tweets

Over the course of the COVID-19 pandemic, large volumes of biomedical in...
research
09/14/2021

Tribrid: Stance Classification with Neural Inconsistency Detection

We study the problem of performing automatic stance classification on so...
research
06/05/2023

Rebooting Internet Immunity

We do everything online. We shop, travel, invest, socialize, and even ho...
research
04/21/2022

Recovering Patient Journeys: A Corpus of Biomedical Entities and Relations on Twitter (BEAR)

Text mining and information extraction for the medical domain has focuse...
research
06/09/2020

Extensive Error Analysis and a Learning-Based Evaluation of Medical Entity Recognition Systems to Approximate User Experience

When comparing entities extracted by a medical entity recognition system...

Please sign up or login with your details

Forgot password? Click here to reset