Adapting Pretrained Vision-Language Foundational Models to Medical Imaging Domains

10/09/2022
by   Pierre Chambon, et al.
0

Multi-modal foundation models are typically trained on millions of pairs of natural images and text captions, frequently obtained through web-crawling approaches. Although such models depict excellent generative capabilities, they do not typically generalize well to specific domains such as medical images that have fundamentally shifted distributions compared to natural images. Building generative models for medical images that faithfully depict clinical context may help alleviate the paucity of healthcare datasets. Thus, in this study, we seek to research and expand the representational capabilities of large pretrained foundation models to medical concepts, specifically for leveraging the Stable Diffusion model to generate domain specific images found in medical imaging. We explore the sub-components of the Stable Diffusion pipeline (the variational autoencoder, the U-Net and the text-encoder) to fine-tune the model to generate medical images. We benchmark the efficacy of these efforts using quantitative image quality metrics and qualitative radiologist-driven evaluations that accurately represent the clinical content of conditional text prompts. Our best-performing model improves upon the stable diffusion baseline and can be conditioned to insert a realistic-looking abnormality on a synthetic radiology image, while maintaining a 95 a classifier trained to detect the abnormality.

READ FULL TEXT

page 1

page 3

page 7

page 10

page 11

page 15

page 16

research
11/23/2022

RoentGen: Vision-Language Foundation Model for Chest X-ray Generation

Multimodal models trained on large natural image-text pair datasets have...
research
11/02/2022

Spot the fake lungs: Generating Synthetic Medical Images using Neural Diffusion Models

Generative models are becoming popular for the synthesis of medical imag...
research
06/16/2023

Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback

Generative models capable of capturing nuanced clinical features in medi...
research
08/19/2023

DUAW: Data-free Universal Adversarial Watermark against Stable Diffusion Customization

Stable Diffusion (SD) customization approaches enable users to personali...
research
05/12/2023

Beware of diffusion models for synthesizing medical images – A comparison with GANs in terms of memorizing brain tumor images

Diffusion models were initially developed for text-to-image generation a...
research
03/03/2023

Robust Detection Outcome: A Metric for Pathology Detection in Medical Images

Detection of pathologies is a fundamental task in medical imaging and th...
research
10/09/2021

Exploring constraints on CycleGAN-based CBCT enhancement for adaptive radiotherapy

Research exploring CycleGAN-based synthetic image generation has recentl...

Please sign up or login with your details

Forgot password? Click here to reset