Lingo3DMol: Generation of a Pocket-based 3D Molecule using a Language Model

05/17/2023
by   Lvwei Wang, et al.
0

Structure-based drug design powered by deep generative models have attracted increasing research interest in recent years. Language models have demonstrated a robust capacity for generating valid molecules in 2D structures, while methods based on geometric deep learning can directly produce molecules with accurate 3D coordinates. Inspired by both methods, this article proposes a pocket-based 3D molecule generation method that leverages the language model with the ability to generate 3D coordinates. High quality protein-ligand complex data are insufficient; hence, a perturbation and restoration pre-training task is designed that can utilize vast amounts of small-molecule data. A new molecular representation, a fragment-based SMILES with local and global coordinates, is also presented, enabling the language model to learn molecular topological structures and spatial position information effectively. Ultimately, CrossDocked and DUD-E dataset is employed for evaluation and additional metrics are introduced. This method achieves state-of-the-art performance in nearly all metrics, notably in terms of binding patterns, drug-like properties, rational conformations, and inference speed. Our model is available as an online service to academic users via sw3dmg.stonewise.cn

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2021

Learning to design drug-like molecules in three-dimensional space using deep generative models

Recently, deep generative models for molecular graphs are gaining more a...
research
02/28/2020

A Deep Generative Model for Fragment-Based Molecule Generation

Molecule generation is a challenging open problem in cheminformatics. Cu...
research
12/06/2021

Keeping it Simple: Language Models can learn Complex Molecular Distributions

Deep generative models of molecules have grown immensely in popularity, ...
research
02/03/2022

Direct Molecular Conformation Generation

Molecular conformation generation aims to generate three-dimensional coo...
research
06/26/2023

CoarsenConf: Equivariant Coarsening with Aggregated Attention for Molecular Conformer Generation

Molecular conformer generation (MCG) is an important task in cheminforma...
research
09/02/2022

Exploiting Pretrained Biochemical Language Models for Targeted Drug Design

Motivation: The development of novel compounds targeting proteins of int...

Please sign up or login with your details

Forgot password? Click here to reset