Discourse Tagging for Scientific Evidence Extraction

09/10/2019
by   Xiangci Li, et al.
0

The biomedical scientific literature comprises a crucial, sometimes life-saving, natural language resource whose size is accelerating over time. The information in this resource tends to follow a style of discourse that is intended to provide scientific explanations for various pieces of evidence derived from experimental findings. Studying the rhetorical structure of the narrative discourse could enable more powerful information extraction methods to automatically construct models of scientific argument from full-text papers. In this paper, we apply richly contextualized deep representation learning to the analysis of scientific discourse structures as a clause-tagging task. We improve the current state-of-the-art clause-level sequence tagging over text clauses for a set of discourse types (e.g. "hypothesis", "result", "implication", etc.) on scientific paragraphs. Our model uses contextualized embeddings, word-to-clause encoder, and clause-level sequence tagging models and achieves F1 performance of 0.784.

READ FULL TEXT
research
07/01/2019

Claim Extraction in Biomedical Publications using Deep Discourse Model and Transfer Learning

Claims are a fundamental unit of scientific discourse. The exponential g...
research
06/12/2017

Scientific document summarization via citation contextualization and scientific discourse

The rapid growth of scientific literature has made it difficult for the ...
research
04/20/2021

StateCensusLaws.org: A Web Application for Consuming and Annotating Legal Discourse Learning

In this work, we create a web application to highlight the output of NLP...
research
08/29/2019

Scientific Statement Classification over arXiv.org

We introduce a new classification task for scientific statements and rel...
research
09/07/2017

Leveraging Discourse Information Effectively for Authorship Attribution

We explore techniques to maximize the effectiveness of discourse informa...
research
11/21/2018

Resource Mention Extraction for MOOC Discussion Forums

In discussions hosted on discussion forums for MOOCs, references to onli...
research
05/11/2020

Segmenting Scientific Abstracts into Discourse Categories: A Deep Learning-Based Approach for Sparse Labeled Data

The abstract of a scientific paper distills the contents of the paper in...

Please sign up or login with your details

Forgot password? Click here to reset