Statistical Mechanics of Reinforcement Learning

ORAL  · Invited

Abstract

Reinforcement learning, the study of optimal decision-making over long timescales in stochastic systems, has recently seen remarkable advances due in large part to the efforts of the deep learning community. However, a theoretical framework to understand and develop the corresponding algorithms is lacking. We show that the reinforcement learning problem can be formulated and solved using statistical mechanics. Drawing on principles of invariance and duality, and using non-equilibrium tools such as large deviation theory, important problems in the field, such as inverse RL, reward shaping, and bounding the value function, can be addressed. In this talk, we describe this framework, showcasing its ability to solve open problems in average-reward reinforcement learning. Based on its broad applicability, we also discuss implications for RL applications in complex high-dimensional physical systems.

*This work is supported by the National Science Foundation under Cooperative Agreement PHY-2019786 (The NSF AI Institute for Artificial Intelligence and Fundamental Interactions, http://iaifi.org/).

Publication: Adamczyk, J., Kumaraswamy, R., MacGlashan, J., Wurman, P., Stone, P. "Td-based Centering for Continuing Tasks", In Preparation.
Adamczyk, J., Luo, D. "Inverse Reinforcement Learning via Free Energy-Entropy Duality", In Preparation.
Adamczyk, J., Kulkarni, R. "Statistical Mechanics of Generative Flow Networks", In Preparation.
Adamczyk, J., Makarenko, V., Tiomkin, S., & Kulkarni, R. V. (2025, May). Average-Reward Soft Actor-Critic. In Reinforcement Learning Conference.

Presenters

  • Jacob Adamczyk

    • University of Massachusetts Boston

Authors

  • Jacob Adamczyk

    • University of Massachusetts Boston
  • Rahul V Kulkarni

    • Department of Physics, University of Massachusetts Boston
    • University of Massachusetts Boston