Robust Black-Box Optimization for Stochastic Search and Episodic Reinforcement Learning

被引:0
|
作者
Huttenrauch, Maximilian [1 ]
Neumann, Gerhard [1 ]
机构
[1] Karlsruhe Inst Technol, Dept Comp Sci, Karlsruhe, Germany
关键词
black-box optimization; stochastic search; derivative-free optimization; evolution strategies; episodic reinforcement learning; EVOLUTIONARY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Black -box optimization is a versatile approach to solve complex problems where the objective function is not explicitly known and no higher order information is available. Due to its general nature, it finds widespread applications in function optimization as well as machine learning, especially episodic reinforcement learning tasks. While traditional black -box optimizers like CMA-ES may falter in noisy scenarios due to their reliance on ranking -based transformations, a promising alternative emerges in the form of the Model -based Relative Entropy Stochastic Search (MORE) algorithm. MORE can be derived from natural policy gradients and compatible function approximation and directly optimizes the expected fitness without resorting to rankings. However, in its original formulation, MORE often cannot achieve state of the art performance. In this paper, we improve MORE by decoupling the update of the search distribution's mean and covariance and an improved entropy scheduling technique based on an evolution path resulting in faster convergence, and a simplified model learning approach in comparison to the original paper. We show that our algorithm performs comparable to state-of-the-art black -box optimizers on standard benchmark functions. Further, it clearly outperforms ranking -based methods and other policy -gradient based black -box algorithms as well as state of the art deep reinforcement learning algorithms when used for episodic reinforcement learning tasks.
引用
收藏
页码:1 / 44
页数:44
相关论文
共 50 条
  • [21] Extensive antibody search with whole spectrum black-box optimization
    Tucs, Andrejs
    Ito, Tomoyuki
    Kurumida, Yoichi
    Kawada, Sakiya
    Nakazawa, Hikaru
    Saito, Yutaka
    Umetsu, Mitsuo
    Tsuda, Koji
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [22] ROCK☆ - Efficient Black-box Optimization for Policy Learning
    Hwangbo, Jemin
    Gehring, Christian
    Sommer, Hannes
    Siegwart, Roland
    Buchli, Jonas
    2014 14TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2014, : 535 - 540
  • [23] Policy Learning with an Effcient Black-Box Optimization Algorithm
    Hwangbo, Jemin
    Gehring, Christian
    Sommer, Hannes
    Siegwart, Roland
    Buchli, Jonas
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2015, 12 (03)
  • [24] A black-box scatter search for optimization problems with integer variables
    Manuel Laguna
    Francisco Gortázar
    Micael Gallego
    Abraham Duarte
    Rafael Martí
    Journal of Global Optimization, 2014, 58 : 497 - 516
  • [25] Accelerated Random Search for Black-Box Constraint Satisfaction and Optimization
    Iorio, Jenna N.
    Regis, Rommel G.
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [26] Black-Box Optimization in an Extended Search Space for SAT Solving
    Zaikin, Oleg
    Kochemazov, Stepan
    MATHEMATICAL OPTIMIZATION THEORY AND OPERATIONS RESEARCH, 2019, 11548 : 402 - 417
  • [27] Solving Black-Box Optimization Challenge via Learning Search Space Partition for Local Bayesian Optimization
    Sazanovich, Mikita
    Nikolskaya, Anastasiya
    Belousov, Yury
    Shpilman, Aleksei
    NEURIPS 2020 COMPETITION AND DEMONSTRATION TRACK, VOL 133, 2020, 133 : 77 - 85
  • [28] BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
    Oh, Changdae
    Hwang, Hyeji
    Lee, Hee-young
    Lim, YongTaek
    Jung, Geunyoung
    Jung, Jiyoung
    Choi, Hosik
    Song, Kyungwoo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24224 - 24235
  • [29] Versatile Black-Box Optimization
    Liu, Jialin
    Moreau, Antoine
    Preuss, Mike
    Rapin, Jeremy
    Roziere, Baptiste
    Teytaud, Fabien
    Teytaud, Olivier
    GECCO'20: PROCEEDINGS OF THE 2020 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2020, : 620 - 628
  • [30] Black-box Optimization with a Politician
    Bubeck, Sebastien
    Lee, Yin-Tat
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48