FPScreen: A Rapid Similarity Search Tool for Massive Molecular Library Based on Molecular Fingerprint Comparison

06/13/2019
by   Lijun Wang, et al.
0

We designed a fast similarity search engine for large molecular libraries: FPScreen. We downloaded 100 million molecules' structure files in PubChem with SDF extension, then applied a computational chemistry tool RDKit to convert each structure file into one line of text in MACCS format and stored them in a text file as our molecule library. The similarity search engine compares the similarity while traversing the 166-bit strings in the library file line by line. FPScreen can complete similarity search through 100 million entries in our molecule library within one hour. That is very fast as a biology computation tool. Additionally, we divided our library into several strides for parallel processing. FPScreen was developed in WEB mode.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2020

Pabulib: A Participatory Budgeting Library

We describe the PArticipatory BUdgeting LIBrary website (in short, Pabul...
research
08/18/2021

Generation of TypeScript Declaration Files from JavaScript Code

Developers are starting to write large and complex applications in TypeS...
research
09/13/2021

Optimizing FPGA-based Accelerator Design for Large-Scale Molecular Similarity Search

Molecular similarity search has been widely used in drug discovery to id...
research
11/14/2021

Unicode at Gigabytes per Second

We often represent text using Unicode formats (UTF-8 and UTF-16). The UT...
research
09/03/2021

IMG2SMI: Translating Molecular Structure Images to Simplified Molecular-input Line-entry System

Like many scientific fields, new chemistry literature has grown at a sta...
research
10/19/2022

An efficient graph generative model for navigating ultra-large combinatorial synthesis libraries

Virtual, make-on-demand chemical libraries have transformed early-stage ...
research
10/17/2017

DASHMM Accelerated Adaptive Fast Multipole Poisson-Boltzmann Solver on Distributed Memory Architecture

We present an updated version of the AFMPB package for fast calculation ...

Please sign up or login with your details

Forgot password? Click here to reset