Dynamical Implications of Hyperparameters in Reinforcement Learning
ORAL
Abstract
Reinforcement learning has been gaining popularity within the physics research community over past several years. Many of the algorithms require selection of hyperparameters, and it is unclear how these choices affect the learned policy. Often in practice, researchers choose hyperparameters for their simulations using brute force methods such as exhaustive grid search or heuristics without fully understanding how they may alter the effective reward structure and value function landscape. Here, we investigate the possibility of using a nonlinear dynamics approach to guide our selection and adaptive optimization of appropriate hyperparameters that are better aligned to achieve particular aims of the agent. We then extend our investigation to the setting of multi-agent reinforcement learning and game theory to see how the choice of agents hyperparameters affect the interplay between each other in competitive, cooperative, and mixed settings.
–
Presenters
-
Hyun Jin Kim
Northwestern University
Authors
-
Hyun Jin Kim
Northwestern University
-
Daniel Shams
Northwestern University
-
David Schwab
The Graduate Center, City University of New York, City University of New York, Institute for Theoretical Science, CUNY Graduate Center