Medical Coding with Biomedical Transformer Ensembles and Zero/Few-shot Learning

05/01/2022
by   Angelo Ziletti, et al.
0

Medical coding (MC) is an essential pre-requisite for reliable data retrieval and reporting. Given a free-text reported term (RT) such as "pain of right thigh to the knee", the task is to identify the matching lowest-level term (LLT) - in this case "unilateral leg pain" - from a very large and continuously growing repository of standardized medical terms. However, automating this task is challenging due to a large number of LLT codes (as of writing over 80,000), limited availability of training data for long tail/emerging classes, and the general high accuracy demands of the medical domain. With this paper, we introduce the MC task, discuss its challenges, and present a novel approach called xTARS that combines traditional BERT-based classification with a recent zero/few-shot learning approach (TARS). We present extensive experiments that show that our combined approach outperforms strong baselines, especially in the few-shot regime. The approach is developed and deployed at Bayer, live since November 2021. As we believe our approach potentially promising beyond MC, and to ensure reproducibility, we release the code to the research community.

READ FULL TEXT
research
05/03/2021

One Model to Rule them All: Towards Zero-Shot Learning for Databases

In this paper, we present our vision of so called zero-shot learning for...
research
08/13/2021

Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Cloud

While there has been a number of studies on Zero-Shot Learning (ZSL) for...
research
12/16/2019

Transductive Zero-Shot Learning for 3D Point Cloud Classification

Zero-shot learning, the task of learning to recognize new classes not se...
research
02/01/2023

Learning Generalized Zero-Shot Learners for Open-Domain Image Geolocalization

Image geolocalization is the challenging task of predicting the geograph...
research
06/28/2023

Is ChatGPT a Biomedical Expert? – Exploring the Zero-Shot Performance of Current GPT Models in Biomedical Tasks

We assessed the performance of commercial Large Language Models (LLMs) G...
research
10/21/2022

Generalizing over Long Tail Concepts for Medical Term Normalization

Medical term normalization consists in mapping a piece of text to a larg...

Please sign up or login with your details

Forgot password? Click here to reset