Performance Optimization of the XGC code on KNL architecture
POSTER
Abstract
Gyrokinetic particle-in-cell (PIC) simulations have played an important role to the fundamental understanding of plasma edge turbulence, which is one of the barriers to a successful tokamak operation. In order to fully resolve the Vlasov transport equation the ion and electron species must be treated separately due to their different velocities. The electron push for kinetic electrons occurs many times per ion time step, becoming a dominant consumer of computer time. This makes the electron push a prime candidate for vectorization using modern computer architectures, such as Intel Knights Landing architecture used in many high performance computing settings. Such improvements include the proper selection of Array of Structures or Structure of Arrays datatypes, the optimal combination of nested openMP and vector processing, and the optimization of the memory access pattern within the electron push kernel. Results demonstrating the performance improvement of the electron push kernel on such architectures will be presented.
Presenters
-
Brian MacKie-Mason
Argonne Natl Lab
Authors
-
Brian MacKie-Mason
Argonne Natl Lab
-
Paulius Velesko
Intel Corp.
-
Robert Hager
Princeton Plasma Physics Laboratory, Princeton Plasma Physics Lab, Princeton Plasma Physics Laboratory, Princeton, NJ 08543-451, USA, Princeton Plasma Phys Lab
-
Choong Seock Chang
Princeton Plasma Physics Laboratory, Princeton Plasma Physics Lab, Princeton Plasma Phys Laboratory, Princeton Plasma Phys Lab, Princeton Plasma Physics Laboratory, Princeton, NJ 08543-451, USA
-
Timothy J. Williams
Argonne Natl Lab