On the Trend-corrected Variant of Adaptive Stochastic Optimization Methods

被引:0
|
作者
Zhou, Bingxin [1 ]
Zheng, Xuebin [1 ]
Gao, Junbin [1 ]
机构
[1] Univ Sydney, Business Sch, Sydney, NSW, Australia
关键词
Stochastic Gradient Descent; ADAM; Deep Learning; Optimization;
D O I
10.1109/ijcnn48605.2020.9207166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Adam-type optimizers, as a class of adaptive moment estimation methods with the exponential moving average scheme, have been successfully used in many applications of deep learning. Such methods are appealing due to the capability on large-scale sparse datasets with high computational efficiency. In this paper, we present a new framework for Adam-type methods with the trend information when updating the parameters with the adaptive step size and gradients. The additional terms in the algorithm promise an efficient movement on the complex cost surface, and thus the loss would converge more rapidly. We show empirically the importance of adding the trend component, where our framework outperforms the conventional Adam and AMSGrad methods constantly on the classical models with several real-world datasets.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] ADAPTIVE STOCHASTIC OPTIMIZATION USING MULTIPROCESSORS
    MASRI, SF
    BEKEY, GA
    CAUGHEY, TK
    VANDEVELDE, E
    APPLIED MATHEMATICS AND COMPUTATION, 1995, 72 (2-3) : 225 - 257
  • [22] Adaptive stochastic optimization using multiprocessors
    Appl Math Comput (New York), 2-3 (225):
  • [23] ADAPTIVE SAMPLING STRATEGIES FOR STOCHASTIC OPTIMIZATION
    Bollapragada, Raghu
    Byrd, Richard
    Nocedal, Jorge
    SIAM JOURNAL ON OPTIMIZATION, 2018, 28 (04) : 3312 - 3343
  • [24] Adaptive Sampling Stochastic Multigradient Algorithm for Stochastic Multiobjective Optimization
    Zhao, Yong
    Chen, Wang
    Yang, Xinmin
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2024, 200 (01) : 215 - 241
  • [25] Adaptive Sampling Stochastic Multigradient Algorithm for Stochastic Multiobjective Optimization
    Yong Zhao
    Wang Chen
    Xinmin Yang
    Journal of Optimization Theory and Applications, 2024, 200 (1) : 215 - 241
  • [26] Adaptive Methods for Nonconvex Optimization
    Zaheer, Manzil
    Reddi, Sashank J.
    Sachan, Devendra
    Kale, Satyen
    Kumar, Sanjiv
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [27] ADAPTIVE METHODS OF MULTICRITERIAL OPTIMIZATION
    RASTRIGIN, LA
    EIDUK, YY
    AUTOMATION AND REMOTE CONTROL, 1985, 46 (01) : 1 - 21
  • [28] ADAPTIVE METHODS OF TREND DETECTION AND THEIR APPLICATION IN ANALYZING BIOSIGNALS
    SCHACK, B
    GRIESZBACH, G
    BIOMETRICAL JOURNAL, 1994, 36 (04) : 429 - 452
  • [29] Stochastic methods for practical global optimization
    J of Global Optim, 4 (433-444):
  • [30] STOCHASTIC OPTIMIZATION METHODS IN STRUCTURAL MECHANICS
    MARTI, K
    ZEITSCHRIFT FUR ANGEWANDTE MATHEMATIK UND MECHANIK, 1990, 70 (06): : T742 - T745