EPICURE Ensemble Pretrained Models for Extracting Cancer Mutations from Literature

06/11/2021
by   Jiarun Cao, et al.
0

To interpret the genetic profile present in a patient sample, it is necessary to know which mutations have important roles in the development of the corresponding cancer type. Named entity recognition is a core step in the text mining pipeline which facilitates mining valuable cancer information from the scientific literature. However, due to the scarcity of related datasets, previous NER attempts in this domain either suffer from low performance when deep learning based models are deployed, or they apply feature based machine learning models or rule based models to tackle this problem, which requires intensive efforts from domain experts, and limit the model generalization capability. In this paper, we propose EPICURE, an ensemble pre trained model equipped with a conditional random field pattern layer and a span prediction pattern layer to extract cancer mutations from text. We also adopt a data augmentation strategy to expand our training set from multiple datasets. Experimental results on three benchmark datasets show competitive results compared to the baseline models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2022

SFE-AI at SemEval-2022 Task 11: Low-Resource Named Entity Recognition using Large Pre-trained Language Models

Large scale pre-training models have been widely used in named entity re...
research
07/07/2022

Win-Win Cooperation: Bundling Sequence and Span Models for Named Entity Recognition

For Named Entity Recognition (NER), sequence labeling-based and span-bas...
research
10/09/2022

Deep Span Representations for Named Entity Recognition

Span-based models are one of the most straightforward methods for named ...
research
10/11/2022

SEE-Few: Seed, Expand and Entail for Few-shot Named Entity Recognition

Few-shot named entity recognition (NER) aims at identifying named entiti...
research
08/09/2022

An Embarrassingly Easy but Strong Baseline for Nested Named Entity Recognition

Named entity recognition (NER) is the task to detect and classify the en...
research
11/08/2019

SEPT: Improving Scientific Named Entity Recognition with Span Representation

We introduce a new scientific named entity recognizer called SEPT, which...
research
04/21/2022

TEAM-Atreides at SemEval-2022 Task 11: On leveraging data augmentation and ensemble to recognize complex Named Entities in Bangla

Many areas, such as the biological and healthcare domain, artistic works...

Please sign up or login with your details

Forgot password? Click here to reset