"Found in Translation": Predicting Outcomes of Complex Organic Chemistry Reactions using Neural Sequence-to-Sequence Models

11/13/2017
by   Philippe Schwaller, et al.
0

There is an intuitive analogy of an organic chemist's understanding of a compound and a language speaker's understanding of a word. Consequently, it is possible to introduce the basic concepts and analyze potential impacts of linguistic analysis to the world of organic chemistry. In this work, we cast the reaction prediction task as a translation problem by introducing a template-free sequence-to-sequence model, trained end-to-end and fully data-driven. We propose a novel way of tokenization, which is arbitrarily extensible with reaction information. With this approach, we demonstrate results superior to the state-of-the-art solution by a significant margin on the top-1 accuracy. Specifically, our approach achieves an accuracy of 80.1 without relying on auxiliary knowledge such as reaction templates. Also, 66.4 accuracy is reached on a larger and noisier dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2017

Retrosynthetic reaction prediction using neural sequence-to-sequence models

We describe a fully data driven model that learns to perform a retrosynt...
research
09/13/2017

Predicting Organic Reaction Outcomes with Weisfeiler-Lehman Network

The prediction of organic reaction outcomes is a fundamental problem in ...
research
01/21/2019

Chemical Names Standardization using Neural Sequence to Sequence Model

Chemical information extraction is to convert chemical knowledge in text...
research
08/09/2021

ChemiRise: a data-driven retrosynthesis engine

We have developed an end-to-end, retrosynthesis system, named ChemiRise,...
research
11/29/2019

Neural Chinese Word Segmentation as Sequence to Sequence Translation

Recently, Chinese word segmentation (CWS) methods using neural networks ...
research
06/27/2020

Molecule Edit Graph Attention Network: Modeling Chemical Reactions as Sequences of Graph Edits

One of the key challenges in automated synthesis planning is to generate...
research
01/29/2022

Retroformer: Pushing the Limits of Interpretable End-to-end Retrosynthesis Transformer

Retrosynthesis prediction is one of the fundamental challenges in organi...

Please sign up or login with your details

Forgot password? Click here to reset