Space-Filling Curves as a Novel Crystal Structure Representation for Machine Learning Models

08/19/2016
by   Dipti Jasrasaria, et al.
0

A fundamental problem in applying machine learning techniques for chemical problems is to find suitable representations for molecular and crystal structures. While the structure representations based on atom connectivities are prevalent for molecules, two-dimensional descriptors are not suitable for describing molecular crystals. In this work, we introduce the SFC-M family of feature representations, which are based on Morton space-filling curves, as an alternative means of representing crystal structures. Latent Semantic Indexing (LSI) was employed in a novel setting to reduce sparsity of feature representations. The quality of the SFC-M representations were assessed by using them in combination with artificial neural networks to predict Density Functional Theory (DFT) single point, Ewald summed, lattice, and many-body dispersion energies of 839 organic molecular crystal unit cells from the Cambridge Structural Database that consist of the elements C, H, N, and O. Promising initial results suggest that the SFC-M representations merit further exploration to improve its ability to predict solid-state properties of organic crystal structures

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2019

Transfer Learning Using Ensemble Neural Nets for Organic Solar Cell Screening

Organic Solar Cells are a promising technology for solving the clean ene...
research
11/14/2018

CheMixNet: Mixed DNN Architectures for Predicting Chemical Properties using Multiple Molecular Representations

SMILES is a linear representation of chemical structures which encodes t...
research
12/17/2020

Deep Molecular Dreaming: Inverse machine learning for de-novo molecular design and interpretability with surjective representations

Computer-based de-novo design of functional molecules is one of the most...
research
07/11/2020

Crystal Structure Representations for Machine Learning Models of Formation Energies

We introduce and evaluate a set of feature vector representations of cry...
research
12/15/2017

WACSF - Weighted Atom-Centered Symmetry Functions as Descriptors in Machine Learning Potentials

We introduce weighted atom-centered symmetry functions (wACSFs) as descr...
research
09/21/2022

A data-driven interpretation of the stability of molecular crystals

Due to the subtle balance of intermolecular interactions that govern str...
research
12/07/2022

Designing Feature Vector Representations: A case study from Chemistry

We present a case study investigating feature descriptors in the context...

Please sign up or login with your details

Forgot password? Click here to reset