Two-Stream CNN with Loose Pair Training for Multi-modal AMD Categorization

07/28/2019
by   Weisen Wang, et al.
0

This paper studies automated categorization of age-related macular degeneration (AMD) given a multi-modal input, which consists of a color fundus image and an optical coherence tomography (OCT) image from a specific eye. Previous work uses a traditional method, comprised of feature extraction and classifier training that cannot be optimized jointly. By contrast, we propose a two-stream convolutional neural network (CNN) that is end-to-end. The CNN's fusion layer is tailored to the need of fusing information from the fundus and OCT streams. For generating more multi-modal training instances, we introduce Loose Pair training, where a fundus image and an OCT image are paired based on class labels rather than eyes. Moreover, for a visual interpretation of how the individual modalities make contributions, we extend the class activation mapping technique to the multi-modal scenario. Experiments on a real-world dataset collected from an outpatient clinic justify the viability of our proposal for multi-modal AMD categorization.

READ FULL TEXT
research
12/26/2016

Image-Text Multi-Modal Representation Learning by Adversarial Backpropagation

We present novel method for image-text multi-modal representation learni...
research
10/03/2018

Image and Encoded Text Fusion for Multi-Modal Classification

Multi-modal approaches employ data from multiple input streams such as t...
research
04/20/2019

Multi-modal gated recurrent units for image description

Using a natural language sentence to describe the content of an image is...
research
10/13/2020

A Multi-Modal Method for Satire Detection using Textual and Visual Cues

Satire is a form of humorous critique, but it is sometimes misinterprete...
research
12/20/2013

Learning Paired-associate Images with An Unsupervised Deep Learning Architecture

This paper presents an unsupervised multi-modal learning system that lea...
research
02/07/2020

Exploiting Temporal Coherence for Multi-modal Video Categorization

Multimodal ML models can process data in multiple modalities (e.g., vide...
research
03/20/2022

Multi-Modal Learning Using Physicians Diagnostics for Optical Coherence Tomography Classification

In this paper, we propose a framework that incorporates experts diagnost...

Please sign up or login with your details

Forgot password? Click here to reset