Large-scale Generative Modeling to Improve Automated Veterinary Disease Coding

11/29/2018
by   Yuhui Zhang, et al.
0

Supervised learning is limited both by the quantity and quality of the labeled data. In the field of medical record tagging, writing styles between hospitals vary drastically. The knowledge learned from one hospital might not transfer well to another. This problem is amplified in veterinary medicine domain because veterinary clinics rarely apply medical codes to their records. We proposed and trained the first large-scale generative modeling algorithm in automated disease coding. We demonstrate that generative modeling can learn discriminative features when additionally trained with supervised fine-tuning. We systematically ablate and evaluate the effect of generative modeling on the final system's performance. We compare the performance of our model with several baselines in a challenging cross-hospital setting with substantial domain shift. We outperform competitive baselines by a large margin. In addition, we provide interpretation for what is learned by our model.

READ FULL TEXT

page 9

page 10

research
06/18/2021

Semi-supervised Optimal Transport with Self-paced Ensemble for Cross-hospital Sepsis Early Detection

The utilization of computer technology to solve problems in medical scen...
research
11/25/2021

Amortized Prompt: Lightweight Fine-Tuning for CLIP in Domain Generalization

Domain generalization (DG) is a difficult transfer learning problem aimi...
research
08/27/2023

Cheap Lunch for Medical Image Segmentation by Fine-tuning SAM on Few Exemplars

The Segment Anything Model (SAM) has demonstrated remarkable capabilitie...
research
06/28/2018

DeepTag: inferring all-cause diagnoses from clinical notes in under-resourced medical domain

In many under-resourced settings, clinicians lack time and expertise to ...
research
12/30/2020

Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays

Exploiting available medical records to train high performance computer-...
research
06/30/2022

Learning Underrepresented Classes from Decentralized Partially Labeled Medical Images

Using decentralized data for federated training is one promising emergin...

Please sign up or login with your details

Forgot password? Click here to reset