Are VAEs Bad at Reconstructing Molecular Graphs?

05/04/2023
by   Hagen Muenkler, et al.
0

Many contemporary generative models of molecules are variational auto-encoders of molecular graphs. One term in their training loss pertains to reconstructing the input, yet reconstruction capabilities of state-of-the-art models have not yet been thoroughly compared on a large and chemically diverse dataset. In this work, we show that when several state-of-the-art generative models are evaluated under the same conditions, their reconstruction accuracy is surprisingly low, worse than what was previously reported on seemingly harder datasets. However, we show that improving reconstruction does not directly lead to better sampling or optimization performance. Failed reconstructions from the MoLeR model are usually similar to the inputs, assembling the same motifs in a different way, and possess similar chemical properties such as solubility. Finally, we show that the input molecule and its failed reconstruction are usually mapped by the different encoders to statistically distinguishable posterior distributions, hinting that posterior collapse may not fully explain why VAEs are bad at reconstructing molecular graphs.

READ FULL TEXT
research
06/17/2020

MoFlow: An Invertible Flow Model for Generating Molecular Graphs

Generating molecular graphs with desired chemical properties driven by d...
research
12/06/2021

Keeping it Simple: Language Models can learn Complex Molecular Distributions

Deep generative models of molecules have grown immensely in popularity, ...
research
06/08/2021

Augmenting Molecular Deep Generative Models with Topological Data Analysis Representations

Deep generative models have emerged as a powerful tool for learning info...
research
10/08/2017

Reconstruction of Hidden Representation for Robust Feature Extraction

This paper aims to develop a new and robust approach to feature represen...
research
07/02/2019

Generative Models for Automatic Chemical Design

Materials discovery is decisive for tackling urgent challenges related t...
research
05/28/2019

Anomaly scores for generative models

Reconstruction error is a prevalent score used to identify anomalous sam...

Please sign up or login with your details

Forgot password? Click here to reset