Self-Labeling of Fully Mediating Representations by Graph Alignment

03/25/2021
by   Martijn Oldenhof, et al.
0

To be able to predict a molecular graph structure (W) given a 2D image of a chemical compound (U) is a challenging problem in machine learning. We are interested to learn f: U → W where we have a fully mediating representation V such that f factors into U → V → W. However, observing V requires detailed and expensive labels. We propose graph aligning approach that generates rich or detailed labels given normal labels W. In this paper we investigate the scenario of domain adaptation from the source domain where we have access to the expensive labels V to the target domain where only normal labels W are available. Focusing on the problem of predicting chemical compound graphs from 2D images the fully mediating layer is represented using the planar embedding of the chemical graph structure we are predicting. The use of a fully mediating layer implies some assumptions on the mechanism of the underlying process. However if the assumptions are correct it should allow the machine learning model to be more interpretable, generalize better and be more data efficient at training time. The empirical results show that, using only 4000 data points, we obtain up to 4x improvement of performance after domain adaptation to target domain compared to pretrained model only on the source domain. After domain adaptation, the model is even able to detect atom types that were never seen in the original source domain. Finally, on the Maybridge data set the proposed self-labeling approach reached higher performance than the current state of the art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2021

Multi-source Few-shot Domain Adaptation

Multi-source Domain Adaptation (MDA) aims to transfer predictive models ...
research
12/02/2021

Source Free Unsupervised Graph Domain Adaptation

Graph Neural Networks (GNNs) have achieved great success on a variety of...
research
07/03/2023

Generating Reliable Pixel-Level Labels for Source Free Domain Adaptation

This work addresses the challenging domain adaptation setting in which k...
research
11/12/2020

Learning causal representations for robust domain adaptation

Domain adaptation solves the learning problem in a target domain by leve...
research
07/28/2020

Discrepancy Minimization in Domain Generalization with Generative Nearest Neighbors

Domain generalization (DG) deals with the problem of domain shift where ...
research
02/23/2020

ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning

In drug discovery, knowledge of the graph structure of chemical compound...

Please sign up or login with your details

Forgot password? Click here to reset