Noisy Derivative-Free Optimization with Value Suppression

被引:0
|
作者
Wang, Hong [1 ]
Qian, Hong [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
关键词
EVOLUTIONARY; ENVIRONMENTS; STRATEGY; TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Derivative-free optimization has shown advantage in solving sophisticated problems such as policy search, when the environment is noise-free. Many real-world environments are noisy, where solution evaluations are inaccurate due to the noise. Noisy evaluation can badly injure derivative-free optimization, as it may make a worse solution looks better. Sampling is a straightforward way to reduce noise, while previous studies have shown that delay the noise handling to the comparison time point (i.e., threshold selection) can be helpful for derivative-free optimization. This work further delays the noise handling, and proposes a simple noise handling mechanism, i.e., value suppression. By value suppression, we do nothing about noise until the best-so-far solution has not been improved for a period, and then suppress the value of the best-so-far solution and continue the optimization. On synthetic problems as well as reinforcement learning tasks, experiments verify that value suppression can be significantly more effective than the previous methods.
引用
收藏
页码:1447 / 1454
页数:8
相关论文
共 50 条
  • [1] Effective matrix adaptation strategy for noisy derivative-free optimization
    Kimiaei, Morteza
    Neumaier, Arnold
    MATHEMATICAL PROGRAMMING COMPUTATION, 2024, 16 (03) : 459 - 501
  • [2] DERIVATIVE-FREE OPTIMIZATION OF NOISY FUNCTIONS VIA QUASI-NEWTON METHODS
    Berahas, Albert S.
    Byrd, Richard H.
    Nocedal, Jorge
    SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (02) : 965 - 993
  • [3] ADAPTIVE FINITE-DIFFERENCE INTERVAL ESTIMATION FOR NOISY DERIVATIVE-FREE OPTIMIZATION
    Shi, Hao-Jun Michael
    Xie, Yuchen
    Xuan, Melody Qiming
    Nocedal, Jorge
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2022, 44 (04): : A2302 - A2321
  • [4] Decomposition in derivative-free optimization
    Kaiwen Ma
    Nikolaos V. Sahinidis
    Sreekanth Rajagopalan
    Satyajith Amaran
    Scott J Bury
    Journal of Global Optimization, 2021, 81 : 269 - 292
  • [5] Efficient derivative-free optimization
    Belitz, Paul
    Bewley, Thomas
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 5607 - 5612
  • [6] Decomposition in derivative-free optimization
    Ma, Kaiwen
    Sahinidis, Nikolaos V.
    Rajagopalan, Sreekanth
    Amaran, Satyajith
    Bury, Scott J.
    JOURNAL OF GLOBAL OPTIMIZATION, 2021, 81 (02) : 269 - 292
  • [7] SURVEY OF DERIVATIVE-FREE OPTIMIZATION
    Xi, Min
    Sun, Wenyu
    Chen, Jun
    NUMERICAL ALGEBRA CONTROL AND OPTIMIZATION, 2020, 10 (04): : 537 - 555
  • [8] Derivative-free optimization methods
    Larson, Jeffrey
    Menickelly, Matt
    Wild, Stefan M.
    ACTA NUMERICA, 2019, 28 : 287 - 404
  • [9] Derivative-Free and Blackbox Optimization
    Huyer, W.
    MONATSHEFTE FUR MATHEMATIK, 2020, 192 (02): : 480 - 480
  • [10] ZOOpt: a toolbox for derivative-free optimization
    Liu, Yu-Ren
    Hu, Yi-Qi
    Qian, Hong
    Qian, Chao
    Yu, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (10)