A Directive-Based Heterogeneous SPH Solver for Portable Performance on CPUs and GPUs
ORAL
Abstract
Smoothed Particle Hydrodynamics (SPH) is powerful but computationally intensive. While GPU acceleration via low-level APIs like CUDA offers immense speedups, it often leads to high maintenance costs, reduced portability, and limitations in numerical precision (FP32). We present a high-performance SPH framework that achieves performance portability across multi-core CPUs and GPUs using directive-based programming (OpenACC and OpenMP). Our approach implements key algorithmic optimizations for neighbor searching and memory access, enabling directives to effectively parallelize the code with minimal refactoring. We demonstrate two orders of magnitude of speedups for double-precision (FP64) multiphase simulations on GPUs compared to optimized multi-core CPU runs. Furthermore, we showcase the framework's portability by successfully migrating the solver from OpenACC to OpenMP. This work provides a practical pathway for developing efficient, maintainable, and portable SPH codes for modern heterogeneous HPC architectures.
–
Presenters
-
Yongsuk Cho
Texas Tech University
Authors
-
Yongsuk Cho
Texas Tech University
-
Song-Charng Kong
Texas Tech University, Texas tech university