The Megopolis Resampler: Memory Coalesced Resampling on GPUs

09/28/2021
by   Joshua A. Chesser, et al.
0

The resampling process employed in widely used methods such as Importance Sampling (IS), with its adaptive extension (AIS), are used to solve challenging problems requiring approximate inference; for example, non-linear, non-Gaussian state estimation problems. However, the re-sampling process can be computationally prohibitive for practical problems with real-time requirements. We consider the problem of developing highly parallelisable resampling algorithms for massively parallel hardware architectures of modern graphics processing units (GPUs) to accomplish real-time performance. We develop a new variant of the Metropolis algorithm – Megopolis – that improves performance without requiring a tuning parameter or reducing resampling quality. The Megopolis algorithm is built upon exploiting the memory access patterns of modern GPU units to reduce the number of memory transactions without the need for tuning parameters. Extensive numerical experiments on GPU hardware demonstrate that the proposed Megopolis algorithm is numerically stable and outperforms the original Metropolis algorithm and its variants – Metropolis-C1 and Metropolis-C2 – in speed and quality metrics. Further, given the absence of open tools in this domain and facilitating fair comparisons in the future and supporting the signal processing community, we also open source the complete project, including a repository of source code with Megopolis and all other comparison methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2018

Effective Implementation of GPU-based Revised Simplex algorithm applying new memory management and cycle avoidance strategies

Graphics Processing Units (GPUs) with high computational capabilities us...
research
01/21/2019

Efficient Tsunami Modeling on Adaptive Grids with Graphics Processing Units (GPUs)

Solving the shallow water equations efficiently is critical to the study...
research
10/21/2021

Vortex: Extending the RISC-V ISA for GPGPU and 3D-GraphicsResearch

The importance of open-source hardware and software has been increasing....
research
06/20/2018

A scalable H-matrix approach for the solution of boundary integral equations on multi-GPU clusters

In this work, we consider the solution of boundary integral equations by...
research
02/10/2021

Using hardware performance counters to speed up autotuning convergence on GPUs

Nowadays, GPU accelerators are commonly used to speed up general-purpose...
research
06/09/2021

Benchmarking the Nvidia GPU Lineage: From Early K80 to Modern A100 with Asynchronous Memory Transfers

For many, Graphics Processing Units (GPUs) provides a source of reliable...
research
09/30/2016

GPU Acceleration of Hermite Methods for the Simulation of Wave Propagation

The Hermite methods of Goodrich, Hagstrom, and Lorenz (2006) use Hermite...

Please sign up or login with your details

Forgot password? Click here to reset