Genarris 2.0: A Random Structure Generator for Molecular Crystals
ORAL
Abstract
Genarris 2.0 is an open-source Python code, parallilized with mpi4py, that performs configuration space screening of molecular crystals by random structure generation. It may be used for generating initial populations to seed other structure search algorithms (such as genetic algorithms) or for generating datasets to train machine learning models. The target unit cell volume is estimated from the single molecule structure by a machine-learned model trained on data from the Cambridge Structural Database (CSD). Crystal structures are then generated in all space groups compatible with the requested number of molecules per cell (Z) with one molecule in the asymmetric unit (Zā=1), including those with special Wyckoff positions. To avoid unphysically close intermolecular distances, structures undergo a cascade of three increasingly rigorous checks. Special settings are applied for strong hydrogen bonds, which are automatically detected. Once an initial dataset of several thousand structures is generated, a smaller dataset may be selected based on quality and diversity criteria via user-defined workflows. For clustering Genarris uses the affinity propagation machine learning algorithm with a relative coordinate descriptor (RCD) or a radial symmetry function (RSF) representation.
ā
Presenters
-
Rithwik Tom
Carnegie Mellon University
Authors
-
Rithwik Tom
Carnegie Mellon University
-
Tim C Rose
Carnegie Mellon University
-
Imanuel Bier
Carnegie Mellon University
-
Harriet O'Brien
Carnegie Mellon University
-
Alvaro Vazquez-Mayagoitia
Argonne Leadership Computing Facility, Argonne National Laboratory, Argonne National Lab, Computational Science Division, Argonne National Laboratory
-
Noa Marom
Carnegie Mellon University, Carnegie Mellon Univ