Dynamical Implications of Hyperparameters in Reinforcement Learning

Hyun Jin Kim; Daniel Shams; David Schwab

Dynamical Implications of Hyperparameters in Reinforcement Learning

ORAL

Abstract

Reinforcement learning has been gaining popularity within the physics research community over past several years. Many of the algorithms require selection of hyperparameters, and it is unclear how these choices affect the learned policy. Often in practice, researchers choose hyperparameters for their simulations using brute force methods such as exhaustive grid search or heuristics without fully understanding how they may alter the effective reward structure and value function landscape. Here, we investigate the possibility of using a nonlinear dynamics approach to guide our selection and adaptive optimization of appropriate hyperparameters that are better aligned to achieve particular aims of the agent. We then extend our investigation to the setting of multi-agent reinforcement learning and game theory to see how the choice of agents hyperparameters affect the interplay between each other in competitive, cooperative, and mixed settings.

March 4, 2019, 1:39 PM – March 4, 2019, 1:51 PM

Presenters

Hyun Jin Kim

Northwestern University

Authors

Hyun Jin Kim

Northwestern University
Daniel Shams

Northwestern University
David Schwab

The Graduate Center, City University of New York, City University of New York, Institute for Theoretical Science, CUNY Graduate Center