BioNLI: Generating a Biomedical NLI Dataset Using Lexico-semantic Constraints for Adversarial Examples

10/26/2022
by   Mohaddeseh Bastan, et al.
0

Natural language inference (NLI) is critical for complex decision-making in biomedical domain. One key question, for example, is whether a given biomedical mechanism is supported by experimental evidence. This can be seen as an NLI problem but there are no directly usable datasets to address this. The main challenge is that manually creating informative negative examples for this task is difficult and expensive. We introduce a novel semi-supervised procedure that bootstraps an NLI dataset from existing biomedical dataset that pairs mechanisms with experimental evidence in abstracts. We generate a range of negative examples using nine strategies that manipulate the structure of the underlying mechanisms both with rules, e.g., flip the roles of the entities in the interaction, and, more importantly, as perturbations via logical constraints in a neuro-logical decoding system. We use this procedure to create a novel dataset for NLI in the biomedical domain, called BioNLI and benchmark two state-of-the-art biomedical classifiers. The best result we obtain is around mid 70s in F1, suggesting the difficulty of the task. Critically, the performance on the different classes of negative examples varies widely, from 97 chance on the negative examples generated using neuro-logic decoding.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2022

SuMe: A Dataset Towards Summarizing Biomedical Mechanisms

Can language models read biomedical texts and explain the biomedical mec...
research
08/26/2018

Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge

Adversarial examples are inputs to machine learning models designed to c...
research
05/21/2022

UVA Resources for the Biomedical Vocabulary Alignment at Scale in the UMLS Metathesaurus

The construction and maintenance process of the UMLS (Unified Medical La...
research
09/27/2021

Discovering Drug-Target Interaction Knowledge from Biomedical Literature

The Interaction between Drugs and Targets (DTI) in human body plays a cr...
research
05/01/2020

Biomedical Entity Representations with Synonym Marginalization

Biomedical named entities often play important roles in many biomedical ...
research
11/04/2019

On the Effectiveness of the Pooling Methods for Biomedical Relation Extraction with Deep Learning

Deep learning models have achieved state-of-the-art performances on many...
research
07/24/2017

Domain Recursion for Lifted Inference with Existential Quantifiers

In recent work, we proved that the domain recursion inference rule makes...

Please sign up or login with your details

Forgot password? Click here to reset