Evaluation of the Intel Xeon Phi and NVIDIA K80 as accelerators for two-dimensional panel codes

11/06/2015
by   Lukas Einkemmer, et al.
0

To predict the properties of fluid flow over a solid geometry is an important engineering problem. In many applications so-called panel methods (or boundary element methods) have become the standard approach to solve the corresponding partial differential equation. Since panel methods in two dimensions are computationally cheap, they are well suited as the inner solver in an optimization algorithm. In this paper we evaluate the performance of the Intel Xeon Phi 7120 and the NVIDIA K80 to accelerate such an optimization algorithm. For that purpose, we have implemented an optimized version of the algorithm on the CPU and Xeon Phi (based on OpenMP, vectorization, and the Intel MKL library) and on the GPU (based on CUDA and the MAGMA library). We present timing results for all codes and discuss the similarities and differences between the three implementations. Overall, we observe a speedup of approximately 2.5 for adding a Intel Xeon Phi 7120 to a dual socket workstation and a speedup between 3 and 3.5 for adding a NVIDIA K80 to a dual socket workstation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2021

Intel HEXL: Accelerating Homomorphic Encryption with Intel AVX512-IFMA52

Modern implementations of homomorphic encryption (HE) rely heavily on po...
research
04/24/2017

A Novel Hybrid Quicksort Algorithm Vectorized using AVX-512 on Intel Skylake

The modern CPU's design, which is composed of hierarchical memory and SI...
research
09/23/2020

Applying the Roofline model for Deep Learning performance optimizations

In this paper We present a methodology for creating Roofline models auto...
research
03/13/2021

An Application of the Virus Optimization Algorithm to the Problem of Finding Extremal Binary Self-Dual Codes

In this paper, a virus optimization algorithm, which is one of the metah...
research
07/01/2016

Using the pyMIC Offload Module in PyFR

PyFR is an open-source high-order accurate computational fluid dynamics ...
research
09/13/2017

OpenMP GNU and Intel Fortran programs for solving the time-dependent Gross-Pitaevskii equation

We present Open Multi-Processing (OpenMP) version of Fortran 90 programs...
research
10/15/2021

On Extending Amdahl's law to Learn Computer Performance

The problem of learning parallel computer performance is investigated in...

Please sign up or login with your details

Forgot password? Click here to reset