META-GRADIENTS IN NON-STATIONARY ENVIRONMENTS

被引:0
|
作者
Luketina, Jelena [1 ,2 ]
Flennerhag, Sebastian [2 ]
Schroecker, Yannick [2 ]
Abel, David [2 ]
Zahavy, Tom [2 ]
Singh, Satinder [2 ]
机构
[1] Univ Oxford, Oxford, England
[2] DeepMind, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-gradient methods (Xu et al., 2018; Zahavy et al., 2020) offer a promising solution to the problem of hyperparameter selection and adaptation in non-stationary reinforcement learning problems. However, the properties of meta-gradients in such environments have not been systematically studied. In this work, we bring new clarity to meta-gradients in non-stationary environments. Concretely, we ask: (i) how much information should be given to the learned optimizers, so as to enable faster adaptation and generalization over a lifetime, (ii) what meta-optimizer functions are learned in this process, and (iii) whether meta-gradient methods provide a bigger advantage in highly non-stationary environments. To study the effect of information provided to the meta-optimizer, as in recent works (Flennerhag et al., 2022; Almeida et al., 2021), we replace the tuned meta-parameters of fixed update rules with learned meta-parameter functions of selected context features. The context features carry information about agent performance and changes in the environment and hence can inform learned meta-parameter schedules. We find that adding more contextual information is generally beneficial, leading to faster adaptation of meta-parameter values and increased performance. We support these results with a qualitative analysis of resulting meta-parameter schedules and learned functions of context features. Lastly, we find that without context, meta-gradients do not provide a consistent advantage over the baseline in highly non-stationary environments. Our findings suggest that contextualising meta-gradients can play a pivotal role in extracting high performance from meta-gradients in non-stationary settings.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] An Online Algorithm for Computation Offloading in Non-Stationary Environments
    Rahman, Aniq Ur
    Ghatak, Gourab
    De Domenico, Antonio
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (10) : 2167 - 2171
  • [42] Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
    Noda, Itsuki
    PRINCIPLES OF PRACTICE IN MULTI-AGENT SYSTEMS, 2009, 5925 : 525 - 533
  • [43] Weighted Gaussian Process Bandits for Non-stationary Environments
    Deng, Yuntian
    Zhou, Xingyu
    Kim, Baekjin
    Tewari, Ambuj
    Gupta, Abhishek
    Shroff, Ness
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [44] Stochastic Bandits with Graph Feedback in Non-Stationary Environments
    National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing
    210023, China
    不详
    100102, China
    AAAI Conf. Artif. Intell., AAAI, 1600, (8758-8766): : 8758 - 8766
  • [45] Forecasting in non-stationary environments with fuzzy time series
    de Lima e Silva, Petronio Candido
    Severiano Junior, Carlos Alberto
    Alves, Marcos Antonio
    Silva, Rodrigo
    Cohen, Miri Weiss
    Guimaraes, Frederico Gadelha
    APPLIED SOFT COMPUTING, 2020, 97
  • [46] Adaptive Beamforming with Augmentable Arrays in Non-Stationary Environments
    Odom, Jonathan L.
    Krolik, Jeffrey L.
    2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013), 2013, : 324 - 327
  • [47] Adaptive and on-line learning in non-stationary environments
    Lughofer, Edwin
    Sayed-Mouchaweh, Moamar
    EVOLVING SYSTEMS, 2015, 6 (02) : 75 - 77
  • [48] Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
    Noda, Itsuki
    ADAPTIVE AND LEARNING AGENTS, 2010, 5924 : 74 - 90
  • [49] Global localization with detection of changes in non-stationary environments
    Tanaka, K
    Kimuro, Y
    Okada, N
    Kondo, E
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 1487 - 1492
  • [50] Multi-Agent Combat in Non-Stationary Environments
    Li, Shengang
    Chi, Haoang
    Xie, Tao
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,