BBAEG: Towards BERT-based Biomedical Adversarial Example Generation for Text Classification

04/05/2021
by   Ishani Mondal, et al.
0

Healthcare predictive analytics aids medical decision-making, diagnosis prediction and drug review analysis. Therefore, prediction accuracy is an important criteria which also necessitates robust predictive language models. However, the models using deep learning have been proven vulnerable towards insignificantly perturbed input instances which are less likely to be misclassified by humans. Recent efforts of generating adversaries using rule-based synonyms and BERT-MLMs have been witnessed in general domain, but the ever increasing biomedical literature poses unique challenges. We propose BBAEG (Biomedical BERT-based Adversarial Example Generation), a black-box attack algorithm for biomedical text classification, leveraging the strengths of both domain-specific synonym replacement for biomedical named entities and BERTMLM predictions, spelling variation and number replacement. Through automatic and human evaluation on two datasets, we demonstrate that BBAEG performs stronger attack with better language fluency, semantic coherence as compared to prior work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2020

BAE: BERT-based Adversarial Examples for Text Classification

Modern text classification models are susceptible to adversarial example...
research
09/23/2021

Breaking BERT: Understanding its Vulnerabilities for Biomedical Named Entity Recognition through Adversarial Attack

Biomedical named entity recognition (NER) is a key task in the extractio...
research
12/20/2020

Explaining Black-box Models for Biomedical Text Classification

In this paper, we propose a novel method named Biomedical Confident Item...
research
09/25/2021

Coreference Resolution for the Biomedical Domain: A Survey

Issues with coreference resolution are one of the most frequently mentio...
research
06/17/2021

Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study

Biomedical knowledge graphs (KGs) hold rich information on entities such...
research
06/12/2021

Explaining the Deep Natural Language Processing by Mining Textual Interpretable Features

Despite the high accuracy offered by state-of-the-art deep natural-langu...
research
03/11/2016

Sieve-based Coreference Resolution in the Biomedical Domain

We describe challenges and advantages unique to coreference resolution i...

Please sign up or login with your details

Forgot password? Click here to reset