ChemVA: Interactive Visual Analysis of Chemical Compound Similarity in Virtual Screening

08/30/2020
by   María Virginia Sabando, et al.
0

In the modern drug discovery process, medicinal chemists deal with the complexity of analysis of large ensembles of candidate molecules. Computational tools, such as dimensionality reduction (DR) and classification, are commonly used to efficiently process the multidimensional space of features. These underlying calculations often hinder interpretability of results and prevent experts from assessing the impact of individual molecular features on the resulting representations. To provide a solution for scrutinizing such complex data, we introduce ChemVA, an interactive application for the visual exploration of large molecular ensembles and their features. Our tool consists of multiple coordinated views: Hexagonal view, Detail view, 3D view, Table view, and a newly proposed Difference view designed for the comparison of DR projections. These views display DR projections combined with biological activity, selected molecular features, and confidence scores for each of these projections. This conjunction of views allows the user to drill down through the dataset and to efficiently select candidate compounds. Our approach was evaluated on two case studies of finding structurally similar ligands with similar binding affinity to a target protein, as well as on an external qualitative evaluation. The results suggest that our system allows effective visual inspection and comparison of different high-dimensional molecular representations. Furthermore, ChemVA assists in the identification of candidate compounds while providing information on the certainty behind different molecular representations.

READ FULL TEXT

page 5

page 6

page 8

page 9

research
06/28/2022

Feature Learning for Dimensionality Reduction toward Maximal Extraction of Hidden Patterns

Dimensionality reduction (DR) plays a vital role in the visual analysis ...
research
07/09/2019

Multiscale Visual Drilldown for the Analysis of Large Ensembles of Multi-Body Protein Complexes

When studying multi-body protein complexes, biochemists use computationa...
research
10/10/2019

A Multi-view Dimensionality Reduction Algorithm Based on Smooth Representation Model

Over the past few decades, we have witnessed a large family of algorithm...
research
06/29/2021

Interactive Dimensionality Reduction for Comparative Analysis

Finding the similarities and differences between groups of datasets is a...
research
01/18/2019

Tunable Approximations to Control Time-to-Solution in an HPC Molecular Docking Mini-App

The drug discovery process involves several tasks to be performed in viv...
research
08/26/2023

Class-constrained t-SNE: Combining Data Features and Class Probabilities

Data features and class probabilities are two main perspectives when, e....
research
05/14/2004

Interactive visualization of higher dimensional data in a multiview environment

We develop multiple view visualization of higher dimensional data. Our w...

Please sign up or login with your details

Forgot password? Click here to reset