Towards neural reinforcement learning for large deviations in nonequilibrium systems with memory

Abstract

We introduce a reinforcement learning method for a class of non-Markov systems; our approach extends the actor-critic framework given by Rose et al. [New J. Phys. 23 013013 (2021)] for obtaining scaled cumulant generating functions characterizing the fluctuations. The actor-critic is implemented using neural networks; a particular innovation in our method is the use of an additional neural policy for processing memory variables. We demonstrate results for current fluctuations in various memory-dependent models with special focus on semi-Markov systems where the dynamics is controlled by nonexponential interevent waiting time distributions.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…