Free Energy–Based Reinforcement Learning Using Quantum Monte Carlo and Quantum Annealing

ORAL

Abstract

We investigate whether quantum or numerical sampling from select many-body systems can be used to improve upon classical methods in reinforcement learning. We introduce free energy–based reinforcement learning (FERL) as an application of such sampling methods. In our experiments, we focus on transverse field Ising spin Hamiltonians with layouts of qubits similar to that of deep Boltzmann machines (DBM), and use simulated quantum annealing (SQA) as a subroutine of a reinforcement learning framework. In the absence of a transverse field, the DBMs train more effectively than restricted Boltzmann machines with the same number of weights. We then develop a framework for training the network as a quantum Boltzmann machine (QBM) in the presence of a significant transverse field. This further improves the DBM-based reinforcement learning method. To perform physical experiments, we propose a method for processing a quantum annealer’s measured qubit spin configurations in approximating the free energy of a QBM. We then apply this method to perform reinforcement learning on the grid-world problem using the D-Wave 2000Q quantum annealer. The experimental results show that our technique is a promising method for harnessing the power of quantum sampling in reinforcement learning tasks.

Presenters

  • Pooya Ronagh

    Institute for Quantum Computing, University of Waterloo

Authors

  • Pooya Ronagh

    Institute for Quantum Computing, University of Waterloo

  • Anna Levit

    1QBit

  • Daniel Crawford

    1QBit

  • Navid Ghadermarzy

    Mathematics Department, University of British Columbia

  • Jaspreet Oberoi

    1QBit

  • Ehsan Zahedinejad

    1QBit