Particle-in-cell Simulations with Charge-Conserving Current Deposition on Graphic Processing Units
POSTER
Abstract
We present an implementation of a fully relativistic, electromagnetic PIC code, with charge-conserving current deposition, on graphics processing units (GPUs) with NVIDIA's massively multithreaded computing architecture CUDA. A particle-based computation thread assignment was used in the current deposition scheme and write conflicts among the threads were resolved by a thread racing technique. A parallel particle sorting scheme was also developed and used. The implementation took advantage of fast on-chip shared memory. The 2D implementation achieved a one particle-step process time of 2.28 ns for cold plasma runs and 8.53 ns for extremely relativistic plasma runs on a GTX 280 graphic card, which were respectively 90 and 29 times faster than a single threaded state-of-art CPU code. A comparable speedup was also achieved for the 3D implementation.
Authors
-
Xianglong Kong
University of Rochester
-
Michael Huang
University of Rochester
-
Chuang Ren
University of Rochester
-
Viktor Decyk
UCLA Plasma Simulation Group, University of California Los Angeles, UCLA, University of California, Los Angeles