SidechainNet: An All-Atom Protein Structure Dataset for Machine Learning

10/16/2020
by   Jonathan E. King, et al.
0

Despite recent advancements in deep learning methods for protein structure prediction and representation, little focus has been directed at the simultaneous inclusion and prediction of protein backbone and sidechain structure information. We present SidechainNet, a new dataset that directly extends the ProteinNet dataset. SidechainNet includes angle and atomic coordinate information capable of describing all heavy atoms of each protein structure. In this paper, we first provide background information on the availability of protein structure data and the significance of ProteinNet. Thereafter, we argue for the potentially beneficial inclusion of sidechain information through SidechainNet, describe the process by which we organize SidechainNet, and provide a software package (https://github.com/jonathanking/sidechainnet) for data manipulation and training with machine learning models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2021

Mimetic Neural Networks: A unified framework for Protein Design and Folding

Recent advancements in machine learning techniques for protein folding m...
research
06/11/2019

iProStruct2D: Identifying protein structural classes by deep learning via 2D representations

In this paper we address the problem of protein classification starting ...
research
07/19/2017

EnzyNet: enzyme classification using 3D convolutional neural networks on spatial representation

During the past decade, with the significant progress of computational p...
research
11/19/2019

Improvements of the REDCRAFT Software Package

Traditional approaches to elucidation of protein structures by NMR spect...
research
05/20/2019

ROMEO: A Plug-and-play Software Platform of Robotics-inspired Algorithms for Modeling Biomolecular Structures and Motions

Motivation: Due to the central role of protein structure in molecular re...
research
02/01/2019

ProteinNet: a standardized data set for machine learning of protein structure

Rapid progress in deep learning has spurred its application to bioinform...
research
03/02/2022

FastFold: Reducing AlphaFold Training Time from 11 Days to 67 Hours

Protein structure prediction is an important method for understanding ge...

Please sign up or login with your details

Forgot password? Click here to reset