A Heterogeneous Parallel Non-von Neumann Architecture System for Accurate and Efficient Machine Learning Molecular Dynamics

03/26/2023
by   Zhuoying Zhao, et al.
0

This paper proposes a special-purpose system to achieve high-accuracy and high-efficiency machine learning (ML) molecular dynamics (MD) calculations. The system consists of field programmable gate array (FPGA) and application specific integrated circuit (ASIC) working in heterogeneous parallelization. To be specific, a multiplication-less neural network (NN) is deployed on the non-von Neumann (NvN)-based ASIC (SilTerra 180 nm process) to evaluate atomic forces, which is the most computationally expensive part of MD. All other calculations of MD are done using FPGA (Xilinx XC7Z100). It is shown that, to achieve similar-level accuracy, the proposed NvN-based system based on low-end fabrication technologies (180 nm) is 1.6x faster and 10^2-10^3x more energy efficiency than state-of-the-art vN based MLMD using graphics processing units (GPUs) based on much more advanced technologies (12 nm), indicating superiority of the proposed NvN-based heterogeneous parallel architecture.

READ FULL TEXT

page 1

page 5

page 7

research
12/13/2017

Reconfigurable Hardware Accelerators: Opportunities, Trends, and Challenges

With the emerging big data applications of Machine Learning, Speech Reco...
research
05/18/2023

Multi-Fidelity Machine Learning for Excited State Energies of Molecules

The accurate but fast calculation of molecular excited states is still a...
research
12/19/2022

PEZY-SC3: A MIMD Many-core Processor for Energy-efficient Computing

PEZY-SC3 is a highly energy- and area-efficient processor for supercompu...
research
03/25/2020

Overview of the IBM Neural Computer Architecture

The IBM Neural Computer (INC) is a highly flexible, re-configurable para...
research
06/15/2020

Efficient Ab-Initio Molecular Dynamic Simulations by Offloading Fast Fourier Transformations to FPGAs

A large share of today's HPC workloads is used for Ab-Initio Molecular D...
research
02/18/2018

Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework

Hardware accelerations of deep learning systems have been extensively in...
research
11/07/2020

Strawberry Detection Using a Heterogeneous Multi-Processor Platform

Over the last few years, the number of precision farming projects has in...

Please sign up or login with your details

Forgot password? Click here to reset