Transfer Learning for Scientific Data Chain Extraction in Small Chemical Corpus with BERT-CRF Model

by   Na Pang, et al.

Computational chemistry develops fast in recent years due to the rapid growth and breakthroughs in AI. Thanks for the progress in natural language processing, researchers can extract more fine-grained knowledge in publications to stimulate the development in computational chemistry. While the works and corpora in chemical entity extraction have been restricted in the biomedicine or life science field instead of the chemistry field, we build a new corpus in chemical bond field annotated for 7 types of entities: compound, solvent, method, bond, reaction, pKa and pKa value. This paper presents a novel BERT-CRF model to build scientific chemical data chains by extracting 7 chemical entities and relations from publications. And we propose a joint model to extract the entities and relations simultaneously. Experimental results on our Chemical Special Corpus demonstrate that we achieve state-of-art and competitive NER performance.


page 1

page 2

page 3

page 4


AlbNER: A Corpus for Named Entity Recognition in Albanian

Scarcity of resources such as annotated text corpora for under-resourced...

Stress Testing BERT Anaphora Resolution Models for Reaction Extraction in Chemical Patents

The high volume of published chemical patents and the importance of a ti...

Fine-Grained Chemical Entity Typing with Multimodal Knowledge Representation

Automated knowledge discovery from trending chemical literature is essen...

Non-Uniform Gaussian Blur of Hexagonal Bins in Cartesian Coordinates

In a recent application of the Bokeh Python library for visualizing phys...

End-to-End Models for Chemical-Protein Interaction Extraction: Better Tokenization and Span-Based Pipeline Strategies

End-to-end relation extraction (E2ERE) is an important task in informati...

Building a Chatbot on a Closed Domain using RASA

In this study, we build a chatbot system in a closed domain with the RAS...

Please sign up or login with your details

Forgot password? Click here to reset