Multi-modal data generation with a deep metric variational autoencoder

We present a deep metric variational autoencoder for multi-modal data generation. The variational autoencoder employs triplet loss in the latent space, which allows for conditional data generation by sampling in the latent space within each class cluster. The approach is evaluated on a multi-modal dataset consisting of otoscopy images of the tympanic membrane with corresponding wideband tympanometry measurements. The modalities in this dataset are correlated, as they represent different aspects of the state of the middle ear, but they do not present a direct pixel-to-pixel correlation. The approach shows promising results for the conditional generation of pairs of images and tympanograms, and will allow for efficient data augmentation of data from multi-modal sources.

READ FULL TEXT

page 2

page 5

research
11/01/2019

A Perceived Environment Design using a Multi-Modal Variational Autoencoder for learning Active-Sensing

This contribution comprises the interplay between a multi-modal variatio...
research
06/28/2021

Dizygotic Conditional Variational AutoEncoder for Multi-Modal and Partial Modality Absent Few-Shot Learning

Data augmentation is a powerful technique for improving the performance ...
research
07/13/2020

Drum Beats and Where To Find Them: Sampling Drum Patterns from a Latent Space

This paper presents a large dataset of drum patterns and compares two di...
research
03/18/2019

M^2VAE - Derivation of a Multi-Modal Variational Autoencoder Objective from the Marginal Joint Log-Likelihood

This work gives an in-depth derivation of the trainable evidence lower b...
research
12/06/2016

Learning Diverse Image Colorization

Colorization is an ambiguous problem, with multiple viable colorizations...
research
03/25/2019

Learning a Multi-Modal Policy via Imitating Demonstrations with Mixed Behaviors

We propose a novel approach to train a multi-modal policy from mixed dem...
research
05/17/2021

Synthesising Multi-Modal Minority Samples for Tabular Data

Real-world binary classification tasks are in many cases imbalanced, whe...

Please sign up or login with your details

Forgot password? Click here to reset