GPU-based Parallel Computation Support for Stan

07/01/2019
by   Rok Češnovar, et al.
0

This paper details an extensible OpenCL framework that allows Stan to utilize heterogeneous compute devices. It includes GPU-optimized routines for the Cholesky decomposition, its derivative, other matrix algebra primitives and some commonly used likelihoods, with more additions planned for the near future. Stan users can now benefit from large speedups offered by GPUs with little effort and without changes to their existing Stan code. We demonstrate the practical utility of our work with two examples - logistic regression and Gaussian Process regression.

READ FULL TEXT
research
05/17/2023

Optimization and Portability of a Fusion OpenACC-based FORTRAN HPC Code from NVIDIA to AMD GPUs

NVIDIA has been the main provider of GPU hardware in HPC systems for ove...
research
11/22/2022

Improved Multi-GPU parallelization of a Lagrangian Transport Model

This report highlights our work on improving GPU parallelization by supp...
research
08/19/2023

High Performance Computing Applied to Logistic Regression: A CPU and GPU Implementation Comparison

We present a versatile GPU-based parallel version of Logistic Regression...
research
08/06/2023

Bandicoot: C++ Library for GPU Linear Algebra and Scientific Computing

This report provides an introduction to the Bandicoot C++ library for li...
research
10/24/2017

Auto-Differentiating Linear Algebra

Development systems for deep learning, such as Theano, Torch, TensorFlow...
research
06/16/2023

Runtime Construction of Large-Scale Spiking Neuronal Network Models on GPU Devices

Simulation speed matters for neuroscientific research: this includes not...
research
09/17/2018

AlSub: Fully Parallel Subdivision for Modeling and Rendering

Mesh subdivision is a key geometric modeling task which forges smooth, s...

Please sign up or login with your details

Forgot password? Click here to reset