From POS tagging to dependency parsing for biomedical event extraction

08/11/2018
by   Dat Quoc Nguyen, et al.
0

Given the importance of relation or event extraction from biomedical research publications to support knowledge capture and synthesis, and the strong dependency of approaches to this information extraction task on syntactic information, it is valuable to understand which approaches to syntactic processing of biomedical text have the highest performance. In this paper, we perform an empirical study comparing state-of-the-art traditional feature-based and neural network-based models for two core NLP tasks of POS tagging and dependency parsing on two benchmark biomedical corpora, GENIA and CRAFT. To the best of our knowledge, there is no recent work making such comparisons in the biomedical context; specifically no detailed analysis of neural models on this data is available. We also perform a task-oriented evaluation to investigate the influences of these models in a downstream application on biomedical event extraction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

Comparison of Syntactic Parsers on Biomedical Texts

Syntactic parsing is an important step in the automated text analysis wh...
research
11/03/2016

An empirical study for Vietnamese dependency parsing

This paper presents an empirical comparison of different dependency pars...
research
04/25/2017

Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks

While part-of-speech (POS) tagging and dependency parsing are observed t...
research
05/02/2019

Context awareness and embedding for biomedical event extraction

Motivation: Biomedical event detection is fundamental for information ex...
research
03/27/2023

An Information Extraction Study: Take In Mind the Tokenization!

Current research on the advantages and trade-offs of using characters, i...
research
10/22/2019

A Search-based Neural Model for Biomedical Nested and Overlapping Event Detection

We tackle the nested and overlapping event detection task and propose a ...
research
05/15/2023

Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences

Background Objective: Biomedical text data are increasingly availabl...

Please sign up or login with your details

Forgot password? Click here to reset