APS Logo

EFIT-AI Database Creation and Storage for Machine Learning

POSTER

Abstract

The desired application of Machine Learning (ML) and Data Science techniques to advance fusion research has motivated the need for high quality and open access databases. A EFIT-AI database has been assembled and curated: a collection of multiple equilibrium reconstructions for a variety of tokamak discharges. The database features the entirety of the 2019 DIII-D campaign (approx. 2500 discharges) and contains 3 different types of EFIT reconstructions: 1. with magnetics-only constraints; 2. with magnetics + MSE(Motional Stark Effect) constraints; and 3. with OMFIT-automated kinetic constraints (for a subset of shots). The database is currently being used for training of equilibrium reconstruction neural networks aimed at improving real time reconstructions. To incorporate Findable, Accessible, Interoperable, and Reusable (FAIR) data principles, the data was organized according to the ITER IMAS (Integrated Modeling and Analysis Suite) data schema (ontology), and then stored as self-descriptive HDF5 binary files that will be made publicly available. This mapping to IMAS was carried out within the OMFIT framework [https://omfit.io], leveraging the functionalities of the OMAS library [https://gafusion.github.io/omas]. A collection of validation scripts ensures the quality of the training dataset by verifying that all necessary data is available, the equilibria have low Grad-Shafranov equilibrium error, and the equilibrium reconstructions match experimental diagnostic signals. To track different versions of EFIT data generated for this database construction, this work utilized Git and the Open-source Data Version Control (DVC) system, which was developed intentionally for Data Science and Machine Learning projects. Work supported by the US Department of Energy DE-SC0021203, and DE-FC02-04ER54698.

Presenters

  • David Orozco

    General Atomics - San Diego, General Atomics

Authors

  • David Orozco

    General Atomics - San Diego, General Atomics

  • Scott E Kruger

    Tech-X Corp

  • Sterling P Smith

    General Atomics, General Atomics - San Diego

  • Lang L Lao

    General Atomics

  • Cihan Akcay

    General Atomics

  • Alexei Pankin

    Princeton Plasma Physics Laboratory

  • Torrin A Bechtel

    Oakridge Associate Universities