First-Order Optimization (Training) Algorithms in Deep Learning

被引:0
|
作者
Rudenko, Oleg [1 ]
Bezsonov, Oleksandr [1 ]
Oliinyk, Kyrylo [1 ]
机构
[1] Kharkiv Natl Univ Radio Elect, Nauky Ave 14, UA-61166 Kharkiv, Ukraine
关键词
Convolution; Optimization; Neural Network; Algorithm; Gradient; Training; Image Recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of artificial neural networks (ANN) requires solving structural and parametric identification problems corresponding to the choice of the optimal network topology and its training (parameter settings). In contrast to the problem of determining the structure, which is a discrete optimization (combinatorial), the search for optimal parameters is carried out in continuous space using some optimization methods. The most widely used optimization method in deep learning is the first-order algorithm that based on gradient descent (GD). In the given paper a comparative analysis of convolutional neural networks training algorithms that are used in tasks of image recognition is provided. Comparison of training algorithms was carried out on the Oxford17 category flower dataset with TensorFlow framework usage. Studies show that for this task a simple gradient descent algorithm is quite effective. At the same time, however, the problem of selecting the optimal values of the algorithms parameters that provide top speed of learning still remains open.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] NEW FIRST-ORDER ALGORITHMS FOR STOCHASTIC VARIATIONAL INEQUALITIES
    Huang, K. E. V. I. N.
    Zhang, S. H. U. Z. H. O. N. G.
    SIAM JOURNAL ON OPTIMIZATION, 2022, 32 (04) : 2745 - 2772
  • [42] LINEARLY CONVERGENT FIRST-ORDER ALGORITHMS FOR SEMIDEFINITE PROGRAMMING
    Dang, Cong D.
    Lan, Guanghui
    Wen, Zaiwen
    JOURNAL OF COMPUTATIONAL MATHEMATICS, 2017, 35 (04) : 452 - 468
  • [43] Designing Universally-Approximating Deep Neural Networks: A First-Order Optimization Approach
    Wu, Zhoutong
    Xiao, Mingqing
    Fang, Cong
    Lin, Zhouchen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6231 - 6246
  • [44] First-order rules for nonsmooth constrained optimization
    Lassonde, M
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2001, 44 (08) : 1031 - 1056
  • [45] A Study of Condition Numbers for First-Order Optimization
    Guille-Escuret, Charles
    Goujaud, Baptiste
    Girotti, Manuela
    Mitliagkas, Ioannis
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [46] On Fundamental Proof Structures in First-Order Optimization
    Goujaud, Baptiste
    Dieuleveut, Aymeric
    Taylor, Adrien
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3023 - 3030
  • [47] First-order approximation and model management in optimization
    Alexandrov, NM
    Lewis, RM
    LARGE-SCALE PDE-CONSTRAINED OPTIMIZATION, 2003, 30 : 63 - 79
  • [48] FIRST-ORDER PENALTY METHODS FOR BILEVEL OPTIMIZATION
    Lu, Zhaosong
    Mei, Sanyou
    SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (02) : 1937 - 1969
  • [49] Control Interpretations for First-Order Optimization Methods
    Hu, Bin
    Lessard, Laurent
    2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 3114 - 3119
  • [50] Analysis and Design of First-Order Distributed Optimization Algorithms Over Time-Varying Graphs
    Sundararajan, Akhil
    Van Scoy, Bryan
    Lessard, Laurent
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (04): : 1597 - 1608