Noisy Derivative-Free Optimization with Value Suppression

被引:0
|
作者
Wang, Hong [1 ]
Qian, Hong [1 ]
Yu, Yang [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
关键词
EVOLUTIONARY; ENVIRONMENTS; STRATEGY; TIME;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Derivative-free optimization has shown advantage in solving sophisticated problems such as policy search, when the environment is noise-free. Many real-world environments are noisy, where solution evaluations are inaccurate due to the noise. Noisy evaluation can badly injure derivative-free optimization, as it may make a worse solution looks better. Sampling is a straightforward way to reduce noise, while previous studies have shown that delay the noise handling to the comparison time point (i.e., threshold selection) can be helpful for derivative-free optimization. This work further delays the noise handling, and proposes a simple noise handling mechanism, i.e., value suppression. By value suppression, we do nothing about noise until the best-so-far solution has not been improved for a period, and then suppress the value of the best-so-far solution and continue the optimization. On synthetic problems as well as reinforcement learning tasks, experiments verify that value suppression can be significantly more effective than the previous methods.
引用
收藏
页码:1447 / 1454
页数:8
相关论文
共 50 条
  • [41] A derivative-free descent method in set optimization
    Jahn, Johannes
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2015, 60 (02) : 393 - 411
  • [42] A DERIVATIVE-FREE METHOD FOR STRUCTURED OPTIMIZATION PROBLEMS
    Cristofari, Andrea
    Rinaldi, Francesco
    SIAM JOURNAL ON OPTIMIZATION, 2021, 31 (02) : 1079 - 1107
  • [43] Penalty Fuzzy Function for Derivative-Free Optimization
    Matias, J.
    Mestre, P.
    Correia, A.
    Couto, P.
    Serodio, C.
    Melo-Pinto, P.
    EUROFUSE 2011: WORKSHOP ON FUZZY METHODS FOR KNOWLEDGE-BASED SYSTEMS, 2011, 107 : 293 - +
  • [44] Derivative-Free Optimization for Population Dynamic Models
    Schaarschmidt, Ute
    Steihaug, Trond
    Subbey, Sam
    MODELLING, COMPUTATION AND OPTIMIZATION IN INFORMATION SYSTEMS AND MANAGEMENT SCIENCES - MCO 2015, PT 1, 2015, 359 : 391 - 402
  • [45] Penalty fuzzy function for derivative-free optimization
    Matias, J.
    Mestre, P.
    Correia, A.
    Couto, P.
    Serodio, C.
    Melo-Pinto, P.
    Advances in Intelligent and Soft Computing, 2011, 107 : 293 - 301
  • [46] A derivative-free descent method in set optimization
    Johannes Jahn
    Computational Optimization and Applications, 2015, 60 : 393 - 411
  • [47] Bilevel derivative-free optimization and its application to robust optimization
    Conn, A. R.
    Vicente, L. N.
    OPTIMIZATION METHODS & SOFTWARE, 2012, 27 (03): : 561 - 577
  • [48] Global Optimization with Derivative-free, Derivative-based and Evolutionary Algorithms
    Bashir, Hassan A.
    Neville, Richard S.
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 100 - 105
  • [49] A derivative-free algorithm for sparse unconstrained optimization problems
    Colson, B
    Toint, PL
    TRENDS IN INDUSTRIAL AND APPLIED MATHEMATICS, PROCEEDINGS, 2002, 72 : 131 - 147
  • [50] Extremum Seeking Tracking for Derivative-Free Distributed Optimization
    Mimmo, Nicola
    Carnevale, Guido
    Testa, Andrea
    Notarstefano, Giuseppe
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2025, 12 (01): : 584 - 595