TERMinator: A system for scientific texts processing

09/29/2022
by   Elena Bruches, et al.
0

This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities. In this paper, we present a dataset that includes annotations for two tasks and develop a system called TERMinator for the study of the influence of language models on term recognition and comparison of different approaches for relation extraction. Experiments show that language models pre-trained on the target language are not always show the best performance. Also adding some heuristic approaches may improve the overall quality of the particular task. The developed tool and the annotated corpus are publicly available at https://github.com/iis-research-team/terminator and may be useful for other researchers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/19/2020

Entity Recognition and Relation Extraction from Scientific and Technical Texts in Russian

This paper is devoted to the study of methods for information extraction...
research
09/07/2023

Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences

Hypothesis formulation and testing are central to empirical research. A ...
research
09/14/2021

A system for information extraction from scientific texts in Russian

In this paper, we present a system for information extraction from scien...
research
04/26/2022

PLOD: An Abbreviation Detection Dataset for Scientific Documents

The detection and extraction of abbreviations from unstructured texts ca...
research
06/07/2023

Good Data, Large Data, or No Data? Comparing Three Approaches in Developing Research Aspect Classifiers for Biomedical Papers

The rapid growth of scientific publications, particularly during the COV...
research
07/10/2023

HistRED: A Historical Document-Level Relation Extraction Dataset

Despite the extensive applications of relation extraction (RE) tasks in ...
research
05/08/2023

Language Independent Neuro-Symbolic Semantic Parsing for Form Understanding

Recent works on form understanding mostly employ multimodal transformers...

Please sign up or login with your details

Forgot password? Click here to reset