Deep Archetypal Analysis

01/30/2019
by   Sebastian Mathias Keller, et al.
0

"Deep Archetypal Analysis" generates latent representations of high-dimensional datasets in terms of fractions of intuitively understandable basic entities called archetypes. The proposed method is an extension of linear "Archetypal Analysis" (AA), an unsupervised method to represent multivariate data points as sparse convex combinations of extremal elements of the dataset. Unlike the original formulation of AA, "Deep AA" can also handle side information and provides the ability for data-driven representation learning which reduces the dependence on expert knowledge. Our method is motivated by studies of evolutionary trade-offs in biology where archetypes are species highly adapted to a single task. Along these lines, we demonstrate that "Deep AA" also lends itself to the supervised exploration of chemical space, marking a distinct starting point for de novo molecular design. In the unsupervised setting we show how "Deep AA" is used on CelebA to identify archetypal faces. These can then be superimposed in order to generate new faces which inherit dominant traits of the archetypes they are based on.

READ FULL TEXT

page 6

page 7

research
02/03/2020

Learning Extremal Representations with Deep Archetypal Analysis

Archetypes are typical population representatives in an extremal sense, ...
research
04/26/2022

SoFaiR: Single Shot Fair Representation Learning

To avoid discriminatory uses of their data, organizations can learn to m...
research
02/21/2023

CHA2: CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

Optimizing molecular design and discovering novel chemical structures to...
research
04/27/2023

Deep Spatiotemporal Clustering: A Temporal Clustering Approach for Multi-dimensional Climate Data

Clustering high-dimensional spatiotemporal data using an unsupervised ap...
research
09/21/2022

A data-driven interpretation of the stability of molecular crystals

Due to the subtle balance of intermolecular interactions that govern str...
research
09/12/2022

CustOmics: A versatile deep-learning based strategy for multi-omics integration

Recent advances in high-throughput sequencing technologies have enabled ...

Please sign up or login with your details

Forgot password? Click here to reset