High performance shallow water kernels for parallel overland flow simulations based on FullSWOF2D
Permanent link to this recordhttp://hdl.handle.net/10754/624957
MetadataShow full item record
AbstractWe describe code optimization and parallelization procedures applied to the sequential overland flow solver FullSWOF2D. Major difficulties when simulating overland flows comprise dealing with high resolution datasets of large scale areas which either cannot be computed on a single node either due to limited amount of memory or due to too many (time step) iterations resulting from the CFL condition. We address these issues in terms of two major contributions. First, we demonstrate a generic step-by-step transformation of the second order finite volume scheme in FullSWOF2D towards MPI parallelization. Second, the computational kernels are optimized by the use of templates and a portable vectorization approach. We discuss the load imbalance of the flux computation due to dry and wet cells and propose a solution using an efficient cell counting approach. Finally, scalability results are shown for different test scenarios along with a flood simulation benchmark using the Shaheen II supercomputer.
CitationWittmann R, Bungartz H-J, Neumann P (2017) High performance shallow water kernels for parallel overland flow simulations based on FullSWOF2D. Computers & Mathematics with Applications 74: 110–125. Available: http://dx.doi.org/10.1016/j.camwa.2017.01.005.
SponsorsWe thank for financial support by the Kompetenznetzwerk für Wissenschaftliches Höchstleistungsrechnen in Bayern (KONWIHR) for the Multicore-Software-Initiative with the project “Optimization of a multi-functional shallow water solver for complex overland flows” (KONWIHR-IV) and KAUST Supercomputing Laboratory for providing the access to supercomputer Shaheen 2 (project k1050). We thank Ralf-Peter Mundani and Florian Mintgen from the Department of Civil, Geo and Environmental Engineering of the Technical University of Munich for access to the Glasgow scenario datasets as well as for the fruitful discussions on this scenario.