GPU-acceleration of the Discontinuous Galerkin Shallow Water Equations Model (DG-SWEM) with OpenACC
Abstract
This paper presents a porting of DG-SWEM, a first-order discontinuous Galerkin solver for storm surge based on the Advanced Circulation Model (ADCIRC), to NVIDIA GPUs. Time-explicit discontinuous Galerkin methods contain a large number of degrees of freedom but have been shown to exhibit a large amount of data parallelism due to the loose coupling between elements, and thus are naturally mapped to the GPU architecture. A previous framework in porting DG-SWEM to GPUs required converting subroutines from Fortran to C++ to be used with CUDA C++. By using OpenACC and Unified Memory, we simplify the porting process and maintain a single codebase for both CPU and GPU versions. We test the code using a large Hurricane Harvey scenario on NVIDIA's Grace Hopper chip, and compare the GPU code's performance on multiple H200 nodes to the CPU version on the same amount of Grace CPU nodes.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.