Finding the ground state of spin Hamiltonians with reinforcement learning

被引:21
|
作者
Mills, Kyle [1 ,2 ,3 ]
Ronagh, Pooya [1 ,4 ,5 ]
Tamblyn, Isaac [2 ,3 ,6 ]
机构
[1] 1QB Informat Technol 1QBit, Vancouver, BC, Canada
[2] Univ Ontario Inst Technol, Oshawa, ON, Canada
[3] Vector Inst Artificial Intelligence, Toronto, ON, Canada
[4] Inst Quantum Comp IQC, Waterloo, ON, Canada
[5] Univ Waterloo, Dept Phys & Astron, Waterloo, ON, Canada
[6] Natl Res Council Canada, Ottawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
QUANTUM; OPTIMIZATION; MODEL; ALGORITHM; GO; EFFICIENT; GAME;
D O I
10.1038/s42256-020-0226-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning has become a popular method in various domains, for problems where an agent must learn what actions must be taken to reach a particular goal. An interesting example where the technique can be applied is simulated annealing in condensed matter physics, where a procedure is determined for slowly cooling a complex system to its ground state. A reinforcement learning approach has been developed that can learn a temperature scheduling protocol to find the ground state of spin glasses, magnetic systems with strong spin-spin interactions between neighbouring atoms. Reinforcement learning (RL) has become a proven method for optimizing a procedure for which success has been defined, but the specific actions needed to achieve it have not. Using a method we call 'controlled online optimization learning' (COOL), we apply the so-called 'black box' method of RL to simulated annealing (SA), demonstrating that an RL agent based on proximal policy optimization can, through experience alone, arrive at a temperature schedule that surpasses the performance of standard heuristic temperature schedules for two classes of Hamiltonians. When the system is initialized at a cool temperature, the RL agent learns to heat the system to 'melt' it and then slowly cool it in an effort to anneal to the ground state; if the system is initialized at a high temperature, the algorithm immediately cools the system. We investigate the performance of our RL-driven SA agent in generalizing to all Hamiltonians of a specific class. When trained on random Hamiltonians of nearest-neighbour spin glasses, the RL agent is able to control the SA process for other Hamiltonians, reaching the ground state with a higher probability than a simple linear annealing schedule. Furthermore, the scaling performance (with respect to system size) of the RL approach is far more favourable, achieving a performance improvement of almost two orders of magnitude onL= 14(2)systems. We demonstrate the robustness of the RL approach when the system operates in a 'destructive observation' mode, an allusion to a quantum system where measurements destroy the state of the system. The success of the RL agent could have far-reaching impacts, from classical optimization, to quantum annealing and to the simulation of physical systems.
引用
收藏
页码:509 / 517
页数:9
相关论文
共 50 条
  • [31] Constructing realistic effective spin Hamiltonians with machine learning approaches
    Li, Xue-Yang
    Lou, Feng
    Gong, Xin-Gao
    Xiang, Hongjun
    NEW JOURNAL OF PHYSICS, 2020, 22 (05):
  • [32] Learning a spin glass: Determining Hamiltonians from metastable states
    Kuva, SM
    Kinouchi, O
    Caticha, N
    PHYSICA A, 1998, 257 (1-4): : 28 - 35
  • [33] Reinforcement learning state estimator
    Morimoto, Jun
    Doya, Kenji
    NEURAL COMPUTATION, 2007, 19 (03) : 730 - 756
  • [34] NOTE ON SPIN HAMILTONIANS
    STEVENS, KWH
    JOURNAL OF PHYSICS PART C SOLID STATE PHYSICS, 1970, 3 (12): : 2387 - &
  • [35] GROUND-STATE OF A SPIN GLASS
    EDWARDS, SF
    JOURNAL OF PHYSICS F-METAL PHYSICS, 1976, 6 (10): : 1923 - 1926
  • [36] Ground state structure of spin glasses
    Palassini, M
    Young, AP
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2000, 69 : 165 - 168
  • [37] Ground state entanglement in spin systems
    Yano, H
    Nishimori, H
    PROGRESS OF THEORETICAL PHYSICS SUPPLEMENT, 2005, (157): : 164 - 167
  • [38] Spin spiral ground state of γ-iron
    Knöpfle, K
    Sandratskii, LM
    Kübler, J
    PHYSICAL REVIEW B, 2000, 62 (09) : 5564 - 5569
  • [39] Ground-state spin logic
    Whitfield, J. D.
    Faccin, M.
    Biamonte, J. D.
    EPL, 2012, 99 (05)
  • [40] Spin-liquid ground state
    David Ciudad
    Nature Materials, 2016, 15 (1) : 3 - 3