CLaCLab at SocialDisNER: Using Medical Gazetteers for Named-Entity Recognition of Disease Mentions in Spanish Tweets

09/08/2022
by   Harsh Verma, et al.
0

This paper summarizes the CLaC submission for SMM4H 2022 Task 10 which concerns the recognition of diseases mentioned in Spanish tweets. Before classifying each token, we encode each token with a transformer encoder using features from Multilingual RoBERTa Large, UMLS gazetteer, and DISTEMIST gazetteer, among others. We obtain a strict F1 score of 0.869, with competition mean of 0.675, standard deviation of 0.245, and median of 0.761.

READ FULL TEXT

page 1

page 2

page 3

research
11/30/2021

Automatic Extraction of Medication Names in Tweets as Named Entity Recognition

Social media posts contain potentially valuable information about medica...
research
02/19/2023

Exploring the Potential of Machine Translation for Generating Named Entity Datasets: A Case Study between Persian and English

This study focuses on the generation of Persian named entity datasets th...
research
05/30/2018

Bilingual Character Representation for Efficiently Addressing Out-of-Vocabulary Words in Code-Switching Named Entity Recognition

We propose an LSTM-based model with hierarchical architecture on named e...
research
04/15/2021

UIT-E10dot3 at SemEval-2021 Task 5: Toxic Spans Detection with Named Entity Recognition and Question-Answering Approaches

The increment of toxic comments on online space is causing tremendous ef...
research
09/19/2021

Unified and Multilingual Author Profiling for Detecting Haters

This paper presents a unified user profiling framework to identify hate ...
research
04/01/2019

Cyberthreat Detection from Twitter using Deep Neural Networks

To be prepared against cyberattacks, most organizations resort to securi...

Please sign up or login with your details

Forgot password? Click here to reset