On the Trend-corrected Variant of Adaptive Stochastic Optimization Methods

被引:0
|
作者
Zhou, Bingxin [1 ]
Zheng, Xuebin [1 ]
Gao, Junbin [1 ]
机构
[1] Univ Sydney, Business Sch, Sydney, NSW, Australia
关键词
Stochastic Gradient Descent; ADAM; Deep Learning; Optimization;
D O I
10.1109/ijcnn48605.2020.9207166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adam-type optimizers, as a class of adaptive moment estimation methods with the exponential moving average scheme, have been successfully used in many applications of deep learning. Such methods are appealing due to the capability on large-scale sparse datasets with high computational efficiency. In this paper, we present a new framework for Adam-type methods with the trend information when updating the parameters with the adaptive step size and gradients. The additional terms in the algorithm promise an efficient movement on the complex cost surface, and thus the loss would converge more rapidly. We show empirically the importance of adding the trend component, where our framework outperforms the conventional Adam and AMSGrad methods constantly on the classical models with several real-world datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Dualityfree Methods for Stochastic Composition Optimization
    Liu, Liu
    Liu, Ji
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (04) : 1205 - 1217
  • [32] Stochastic Methods for Practical Global Optimization
    Zelda B. Zabinsky
    Journal of Global Optimization, 1998, 13 : 433 - 444
  • [33] CONCURRENT STOCHASTIC METHODS FOR GLOBAL OPTIMIZATION
    BYRD, RH
    DERT, CL
    KAN, AHGR
    SCHNABEL, RB
    MATHEMATICAL PROGRAMMING, 1990, 46 (01) : 1 - 29
  • [34] Stochastic optimization methods for protein folding
    Schug, Alexander
    Herges, Thomas
    Verma, Abhinav
    Wenzel, Wolfgang
    RECENT ADVANCES IN THE THEORY OF CHEMICAL AND PHYSICAL SYSTEMS, 2006, 15 : 557 - +
  • [35] Comparing stochastic methods on SMES optimization
    Hajji, O
    Brisset, S
    Brochet, P
    OPTIMIZATION AND INVERSE PROBLEMS IN ELECTROMAGNETISM, 2003, : 21 - 32
  • [36] Stochastic methods for practical global optimization
    Zabinsky, ZB
    JOURNAL OF GLOBAL OPTIMIZATION, 1998, 13 (04) : 433 - 444
  • [37] MINORANT METHODS OF STOCHASTIC GLOBAL OPTIMIZATION
    Norkin, V. I.
    Onishchenko, B. O.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2005, 41 (02) : 203 - 214
  • [38] A novel term structure stochastic model with adaptive correlation for trend analysis
    Du, Jiangze
    Lai, Shaojie
    Lai, Kin Keung
    Zhou, Shifei
    INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (04) : 5485 - 5498
  • [39] The linear quadratic problem for the Ito stochastic equation: An adaptive variant
    Sragovich, VG
    AUTOMATION AND REMOTE CONTROL, 1999, 60 (07) : 983 - 985
  • [40] Adaptive Stochastic Optimization: From Sets to Paths
    Lim, Zhan Wei
    Hsu, David
    Lee, Wee Sun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28