A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese

04/18/2023
by   Hugo Sousa, et al.
0

Textual health records of cancer patients are usually protracted and highly unstructured, making it very time-consuming for health professionals to get a complete overview of the patient's therapeutic course. As such limitations can lead to suboptimal and/or inefficient treatment procedures, healthcare providers would greatly benefit from a system that effectively summarizes the information of those records. With the advent of deep neural models, this objective has been partially attained for English clinical texts, however, the research community still lacks an effective solution for languages with limited resources. In this paper, we present the approach we developed to extract procedures, drugs, and diseases from oncology health records written in European Portuguese. This project was conducted in collaboration with the Portuguese Institute for Oncology which, besides holding over 10 years of duly protected medical records, also provided oncologist expertise throughout the development of the project. Since there is no annotated corpus for biomedical entity extraction in Portuguese, we also present the strategy we followed in annotating the corpus for the development of the models. The final models, which combined a neural architecture with entity linking, achieved F_1 scores of 88.6, 95.0, and 55.8 per cent in the mention extraction of procedures, drugs, and diseases, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/16/2020

Comparing Rule-based, Feature-based and Deep Neural Methods for De-identification of Dutch Medical Records

Unstructured information in electronic health records provide an invalua...
research
02/24/2022

An NLP Solution to Foster the Use of Information in Electronic Health Records for Efficiency in Decision-Making in Hospital Care

The project aimed to define the rules and develop a technological soluti...
research
07/17/2017

PDD Graph: Bridging Electronic Medical Records and Biomedical Knowledge Graphs via Entity Linking

Electronic medical records contain multi-format electronic medical data ...
research
09/05/2023

Inferring Actual Treatment Pathways from Patient Records

Treatment pathways are step-by-step plans outlining the recommended medi...
research
10/07/2020

COMETA: A Corpus for Medical Entity Linking in the Social Media

Whilst there has been growing progress in Entity Linking (EL) for genera...
research
01/05/2022

Mining Adverse Drug Reactions from Unstructured Mediums at Scale

Adverse drug reactions / events (ADR/ADE) have a major impact on patient...

Please sign up or login with your details

Forgot password? Click here to reset