Reinforcement Learning for ITER Burn Control*

Binnuo Liu; Vincent R Graber; Sai Tej Paruchuri; Eugenio Schuster

Reinforcement Learning for ITER Burn Control*

POSTER

Abstract

A reinforcement learning (RL)-based control algorithm is proposed for burn control in ITER tokamak plasmas. The RL agent, trained using the Deep Deterministic Policy Gradient (DDPG) algorithm, regulates ion and electron auxiliary heating alongside deuterium and tritium fueling. The primary objective is to accurately track dynamic target plasma states: ion energy density, electron energy density, total particle density, and tritium fraction. The controller's performance is rigorously assessed within a closed-loop simulation environment, which employs a nonlinear zero-dimensional two-temperature plasma model. This model meticulously separates ion and electron energy dynamics and incorporates key physical processes such as fusion heating, radiation losses, and ohmic heating, all parameterized with ITER-relevant conditions to accurately capture plasma behavior. A sophisticated tracking error-based reward function guides the RL agent's learning, enabling it to effectively follow both static and dynamic reference trajectories. The results provide strong evidence of the DDPG agent's capability to achieve precise burn regulation, further highlighting the significant potential of neural network-based controllers for advanced operational control in next-generation fusion reactors.

Presenters

Binnuo Liu

Lehigh University

Authors

Binnuo Liu

Lehigh University
Vincent R Graber

Lehigh University
Sai Tej Paruchuri

Lehigh University
Eugenio Schuster

Lehigh University