Anonymous Pattern Molecular Fingerprint and its Applications on Property Identification

01/04/2023
by   Xue Liu, et al.
0

Molecular fingerprints are significant cheminformatics tools to map molecules into vectorial space according to their characteristics in diverse functional groups, atom sequences, and other topological structures. In this paper, we set out to investigate a novel molecular fingerprint Anonymous-FP that possesses abundant perception about the underlying interactions shaped in small, medium, and large molecular scale links. In detail, the possible inherent atom chains are sampled from each molecule and are extended in a certain anonymous pattern. After that, the molecular fingerprint Anonymous-FP is encoded in virtue of the Natural Language Processing technique PV-DBOW. Anonymous-FP is studied on molecular property identification and has shown valuable advantages such as rich information content, high experimental performance, and full structural significance. During the experimental verification, the scale of the atom chain or its anonymous manner matters significantly to the overall representation ability of Anonymous-FP. Generally, the typical scale r = 8 enhances the performance on a series of real-world molecules, and specifically, the accuracy could level up to above 93% on all NCI datasets.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

page 8

page 10

page 12

research
11/16/2022

Molecular Fingerprints for Robust and Efficient ML-Driven Molecular Generation

We propose a novel molecular fingerprint-based variational autoencoder a...
research
06/17/2021

Do Large Scale Molecular Language Representations Capture Important Structural Information?

Predicting chemical properties from the structure of a molecule is of gr...
research
06/21/2023

Interactive Molecular Discovery with Natural Language

Natural language is expected to be a key medium for various human-machin...
research
03/02/2016

Molecular Graph Convolutions: Moving Beyond Fingerprints

Molecular "fingerprints" encoding structural information are the workhor...
research
12/24/2019

TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Molecular fingerprints are the workhorse in ligand-based drug discovery....
research
07/07/2021

Information-theoretic characterization of the complete genotype-phenotype map of a complex pre-biotic world

How information is encoded in bio-molecular sequences is difficult to qu...
research
03/17/2023

QUBO-inspired Molecular Fingerprint for Chemical Property Prediction

Molecular fingerprints are widely used for predicting chemical propertie...

Please sign up or login with your details

Forgot password? Click here to reset