Topology-Driven Generative Completion of Lacunae in Molecular Data

07/29/2022
by   Dmitry Yu. Zubarev, et al.
0

We introduce an approach to the targeted completion of lacunae in molecular data sets which is driven by topological data analysis, such as Mapper algorithm. Lacunae are filled in using scaffold-constrained generative models trained with different scoring functions. The approach enables addition of links and vertices to the skeletonized representations of the data, such as Mapper graph, and falls in the broad category of network completion methods. We illustrate application of the topology-driven data completion strategy by creating a lacuna in the data set of onium cations extracted from USPTO patents, and repairing it.

READ FULL TEXT
research
06/08/2021

Augmenting Molecular Deep Generative Models with Topological Data Analysis Representations

Deep generative models have emerged as a powerful tool for learning info...
research
03/14/2023

Automated patent extraction powers generative modeling in focused chemical spaces

Deep generative models have emerged as an exciting avenue for inverse mo...
research
12/18/2022

Internal Diverse Image Completion

Image completion is widely used in photo restoration and editing applica...
research
02/07/2023

Graph Generation with Destination-Driven Diffusion Mixture

Generation of graphs is a major challenge for real-world tasks that requ...
research
06/22/2023

Molecular geometric deep learning

Geometric deep learning (GDL) has demonstrated huge power and enormous p...
research
01/30/2023

Can Persistent Homology provide an efficient alternative for Evaluation of Knowledge Graph Completion Methods?

In this paper we present a novel method, Knowledge Persistence (𝒦𝒫), for...
research
02/05/2020

Completing Simple Valuations in K-categories

We prove that Keimel and Lawson's K-completion Kc of the simple valuatio...

Please sign up or login with your details

Forgot password? Click here to reset