Contrastive Learning of Medical Visual Representations from Paired Images and Text

10/02/2020
by   Yuhao Zhang, et al.
39

Learning visual representations of medical images is core to medical image understanding but its progress has been held back by the small size of hand-labeled datasets. Existing work commonly relies on transferring weights from ImageNet pretraining, which is suboptimal due to drastically different image characteristics, or rule-based label extraction from the textual report data paired with medical images, which is inaccurate and hard to generalize. We propose an alternative unsupervised strategy to learn medical visual representations directly from the naturally occurring pairing of images and textual data. Our method of pretraining medical image encoders with the paired text data via a bidirectional contrastive objective between the two modalities is domain-agnostic, and requires no additional expert input. We test our method by transferring our pretrained weights to 4 medical image classification tasks and 2 zero-shot retrieval tasks, and show that our method leads to image representations that considerably outperform strong baselines in most settings. Notably, in all 4 classification tasks, our method requires only 10 labeled training data as an ImageNet initialized counterpart to achieve better or comparable performance, demonstrating superior data efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2023

Generative Text-Guided 3D Vision-Language Pretraining for Unified Medical Image Segmentation

Vision-Language Pretraining (VLP) has demonstrated remarkable capabiliti...
research
07/15/2020

Comparing to Learn: Surpassing ImageNet Pretraining on Radiographs By Comparing Image Representations

In deep learning era, pretrained models play an important role in medica...
research
03/23/2023

Increasing Textual Context Size Boosts Medical Image-Text Matching

This short technical report demonstrates a simple technique that yields ...
research
07/13/2021

Cats, not CAT scans: a study of dataset similarity in transfer learning for 2D medical image classification

Transfer learning is a commonly used strategy for medical image classifi...
research
08/05/2022

RadTex: Learning Efficient Radiograph Representations from Text Reports

Automated analysis of chest radiography using deep learning has tremendo...
research
07/12/2023

Unified Medical Image-Text-Label Contrastive Learning With Continuous Prompt

Contrastive language-image Pre-training (CLIP) [13] can leverage large d...
research
04/17/2023

BenchMD: A Benchmark for Modality-Agnostic Learning on Medical Images and Sensors

Medical data poses a daunting challenge for AI algorithms: it exists in ...

Please sign up or login with your details

Forgot password? Click here to reset