Acceleration of tensor-product operations for high-order finite element methods

11/02/2017
by   Kasia Świrydowicz, et al.
0

This paper is devoted to GPU kernel optimization and performance analysis of three tensor-product operators arising in finite element methods. We provide a mathematical background to these operations and implementation details. Achieving close-to-the-peak performance for these operators requires extensive optimization because of the operators' properties: low arithmetic intensity, tiered structure, and the need to store intermediate results inside the kernel. We give a guided overview of optimization strategies and we present a performance model that allows us to compare the efficacy of these optimizations against an empirically calibrated roofline.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset