Zermelo’s problem: Optimal point-to-point navigation in 2D turbulent flows using Reinforcement Learning

Michele Buzzicotti; Luca Biferale; Fabio Bonaccorso; Patricio Clark Di Leoni; Kristian Gustavsson

Zermelo’s problem: Optimal point-to-point navigation in 2D turbulent flows using Reinforcement Learning

ORAL

Abstract

To find the path that minimizes the time to navigate between two given points in a fluid flow is known as the Zermelo's problem. Here, we investigate it by using a Reinforcement Learning (RL) approach for the case of a vessel which has a slip velocity with fixed intensity, $V_{\rm s}$, but variable direction and navigating in a 2D turbulent sea. We use an Actor-Critic RL algorithm, and compare the results with strategies obtained analytically from continuous Optimal Navigation (ON) protocols. We show that for our application, ON solutions are unstable for the typical duration of the navigation process, and are therefore not useful in practice. On the other hand, RL solutions are much more robust with respect to small changes in the initial conditions and to external noise, and are able to find optimal trajectories even when $V_{\rm s}$ is much smaller than the maximum flow velocity. Furthermore, we show how the RL approach is able to take advantage of the flow properties in order to reach the target, especially when the steering speed is small.

Nov. 25, 2019, 8:16 PM – Nov. 25, 2019, 8:29 PM

Authors

Michele Buzzicotti

University of Rome Tor Vergata and INFN
Luca Biferale

University of Rome Tor Vergata and INFN
Fabio Bonaccorso

Center for Life Nano Science@La Sapienza, Istituto Italiano di Tecnologia
Patricio Clark Di Leoni

Dept. of Mechanical Engineering, Johns Hopkins University, Johns Hopkins University
Kristian Gustavsson

Dept. of Physics, University of Gothenburg, Department of Physics, Gothenburg University, Sweden