Porting an Explicit Time-Domain Volume Integral Equation Solver onto Multiple GPUs Using MPI and OpenACC

Abstract
scalable parallelization algorithm to port an explicit marching-on-in-time (MOT)-based time domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The algorithm makes use of MPI and OpenACC for efficient implementation. The MPI processes are responsible for synchronizing and communicating the distributed compute kernels of the MOT-TDVIE solver between the GPUs, where one MPI task is assigned to one GPU. The compiler directives of the OpenACC are responsible for the data transfer and kernels’ offloading from the CPU to the GPU and their execution on the GPU. The speedups achieved against the MPI/OpenMP code execution on multiple CPUs and parallel efficiencies are presented. Index Terms ─ Explicit marching-on-in-time scheme, GPU, MPI, OpenACC, time-domain volume integral equation.

Journal
APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL

Additional Links
https://web.a.ebscohost.com/abstract?site=ehost&scope=site&jrnl=10544887&AN=129213316&h=JsJYyIajUivoDzVe6BObEQowNIcGKDhor8s91pC%2Besl1NQeUY87zMfZUBgi%2FQTWVkg8J2uaRAL2ok4ePagFiaQ%3D%3D&crl=c&resultLocal=ErrCrlNoResults&resultNs=Ehost&crlhashurl=login.aspx%3Fdirect%3Dtrue%26profile%3Dehost%26scope%3Dsite%26authtype%3Dcrawler%26jrnl%3D10544887%26AN%3D129213316

Permanent link to this record