Boosting the Performance of Object Tracking with a Half-Precision Particle Filter on GPU

08/01/2023
by   Gabin Schieffer, et al.
0

High-performance GPU-accelerated particle filter methods are critical for object detection applications, ranging from autonomous driving, robot localization, to time-series prediction. In this work, we investigate the design, development and optimization of particle-filter using half-precision on CUDA cores and compare their performance and accuracy with single- and double-precision baselines on Nvidia V100, A100, A40 and T4 GPUs. To mitigate numerical instability and precision losses, we introduce algorithmic changes in the particle filters. Using half-precision leads to a performance improvement of 1.5-2x and 2.5-4.6x with respect to single- and double-precision baselines respectively, at the cost of a relatively small loss of accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2018

NVIDIA Tensor Core Programmability, Performance & Precision

The NVIDIA Volta GPU microarchitecture introduces a specialized unit, ca...
research
08/10/2020

sputniPIC: an Implicit Particle-in-Cell Code for Multi-GPU Systems

Large-scale simulations of plasmas are essential for advancing our under...
research
04/23/2021

tcFFT: Accelerating Half-Precision FFT through Tensor Cores

Fast Fourier Transform (FFT) is an essential tool in scientific and engi...
research
07/18/2021

Feedback Particle Filter With Stochastically Perturbed Innovation And Its Application to Dual Estimation

In this paper, we introduce a stochastically perturbed feedback particle...
research
12/11/2019

High Accuracy Low Precision QR Factorization and Least Square Solver on GPU with TensorCore

Driven by the insatiable needs to process ever larger amount of data wit...
research
12/18/2022

High-Performance Filters For GPUs

Filters approximately store a set of items while trading off accuracy fo...
research
09/15/2023

Speeding up the GENGA N-body integrator on consumer-grade graphics cards

GPU computing is popular due to the calculation potential of a single ca...

Please sign up or login with your details

Forgot password? Click here to reset