Model-Free Non-Stationarity Detection and Adaptation in Reinforcement Learning

被引:7
|
作者
Canonaco, Giuseppe [1 ]
Restelli, Marcello [1 ]
Roveri, Manuel [1 ]
机构
[1] Politecn Milan, Milan, Italy
关键词
D O I
10.3233/FAIA200200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most Reinforcement Learning (RL) studies, the considered task is assumed to be stationary, i.e., it does not change its behavior or its characteristics over time, as this allows to generate all the convergence properties of RL techniques. Unfortunately, this assumption does not hold in real-world scenarios where systems and environments typically evolve over time. For instance, in robotic applications, sensor or actuator faults would induce a sudden change in the RL settings, while in financial applications the evolution of the market can cause a more gradual variation over time. In this paper, we present an adaptive RL algorithm able to detect changes in the environment or in the reward function and react to these changes by adapting to the new conditions of the task. At first, we develop a figure of merit onto which a hypothesis test can be applied to detect changes between two different learning iterations. Then, we extended this test to sequentially operate over time by means of the CUmulative SUM (CUSUM) approach. Finally, the proposed changedetection mechanism is combined (following an adaptive-active approach) with a well known RL algorithm to make it able to deal with non-stationary tasks. We tested the proposed algorithm on two well-known continuous-control tasks to check its effectiveness in terms of non-stationarity detection and adaptation over a vanilla RL algorithm.
引用
收藏
页码:1047 / 1054
页数:8
相关论文
共 50 条
  • [31] Robotic Table Tennis with Model-Free Reinforcement Learning
    Gao, Wenbo
    Graesser, Laura
    Choromanski, Krzysztof
    Song, Xingyou
    Lazic, Nevena
    Sanketi, Pannag
    Sindhwani, Vikas
    Jaitly, Navdeep
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5556 - 5563
  • [32] MODEL-FREE ONLINE REINFORCEMENT LEARNING OF A ROBOTIC MANIPULATOR
    Sweafford, Jerry, Jr.
    Fahimi, Farbod
    MECHATRONIC SYSTEMS AND CONTROL, 2019, 47 (03): : 136 - 143
  • [33] Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems
    Amarildo Likmeta
    Alberto Maria Metelli
    Giorgia Ramponi
    Andrea Tirinzoni
    Matteo Giuliani
    Marcello Restelli
    Machine Learning, 2021, 110 : 2541 - 2576
  • [34] Combined model-free and model-sensitive reinforcement learning in non-human primates
    Miranda, Bruno
    Malalasekera, W. M. Nishantha
    Behrens, Timothy E.
    Dayan, Peter
    Kennerley, Steven W.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (06)
  • [35] Dealing with multiple experts and non-stationarity in inverse reinforcement learning: an application to real-life problems
    Likmeta, Amarildo
    Metelli, Alberto Maria
    Ramponi, Giorgia
    Tirinzoni, Andrea
    Giuliani, Matteo
    Restelli, Marcello
    MACHINE LEARNING, 2021, 110 (09) : 2541 - 2576
  • [36] SIMPLE-MODEL OF NON-STATIONARITY OF SYSTEMATIC RISK
    BRENNER, M
    SMIDT, S
    JOURNAL OF FINANCE, 1977, 32 (04): : 1081 - 1092
  • [37] Detection of non-stationarity in precipitation extremes using a max-stable process model
    Westra, Seth
    Sisson, Scott A.
    JOURNAL OF HYDROLOGY, 2011, 406 (1-2) : 119 - 128
  • [38] Model-free learning control of neutralization processes using reinforcement learning
    Syafiie, S.
    Tadeo, F.
    Martinez, E.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2007, 20 (06) : 767 - 782
  • [39] Beyond Cumulative Sum Charting in Non-Stationarity Detection and Estimation
    Zhan, Felix
    Martinez, Anthony
    Rai, Nilab
    Mcconnell, Richard
    Swan, Matthew
    Bhaduri, Moinak
    Zhan, Justin
    Gewali, Laxmi
    Oh, Paul
    IEEE ACCESS, 2019, 7 : 140860 - 140874
  • [40] Model-based and Model-free Reinforcement Learning for Visual Servoing
    Farahmand, Amir Massoud
    Shademan, Azad
    Jagersand, Martin
    Szepesvari, Csaba
    ICRA: 2009 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-7, 2009, : 4135 - 4142