APS Logo

Algorithm for the replica redistribution for population annealing method

ORAL

Abstract

The population annealing method is one of the promising approaches for large scale simulations as potentially scalable on any parallel architecture. We present an implementation of the algorithm on the hybrid program architecture combining CUDA and MPI. The problem is to keep all general-purpose graphics processing unit devices as busy as possible, redistributing replicas efficiently. We provide details of the testing on Intel Skylake/Nvidia V100 based hardware running in parallel more than two million replicas of the Ising model sample. The results are quite optimistic because the acceleration grows toward the perfect line with the simulated system's growing complexity.

Presenters

  • Lev Shchur

    Landau ITP - Chernogolovka, Landau Institute for Theoretical Physics

Authors

  • Alexander Russkov

    Keldysh IAM

  • Roman Chulkevich

    HSE University

  • Lev Shchur

    Landau ITP - Chernogolovka, Landau Institute for Theoretical Physics