First-Order Optimization (Training) Algorithms in Deep Learning

被引：0

作者：

Rudenko, Oleg ^{[1
]}

Bezsonov, Oleksandr ^{[1
]}

Oliinyk, Kyrylo ^{[1
]}

机构：

[1] Kharkiv Natl Univ Radio Elect, Nauky Ave 14, UA-61166 Kharkiv, Ukraine

来源：

COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS (COLINS 2020), VOL I: MAIN CONFERENCE | 2020年 / 2604卷

关键词：

Convolution; Optimization; Neural Network; Algorithm; Gradient; Training; Image Recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of artificial neural networks (ANN) requires solving structural and parametric identification problems corresponding to the choice of the optimal network topology and its training (parameter settings). In contrast to the problem of determining the structure, which is a discrete optimization (combinatorial), the search for optimal parameters is carried out in continuous space using some optimization methods. The most widely used optimization method in deep learning is the first-order algorithm that based on gradient descent (GD). In the given paper a comparative analysis of convolutional neural networks training algorithms that are used in tasks of image recognition is provided. Comparison of training algorithms was carried out on the Oxford17 category flower dataset with TensorFlow framework usage. Studies show that for this task a simple gradient descent algorithm is quite effective. At the same time, however, the problem of selecting the optimal values of the algorithms parameters that provide top speed of learning still remains open.

引用

页数：15

共 50 条

[41] NEW FIRST-ORDER ALGORITHMS FOR STOCHASTIC VARIATIONAL INEQUALITIES
Huang, K. E. V. I. N.
Zhang, S. H. U. Z. H. O. N. G.
SIAM JOURNAL ON OPTIMIZATION, 2022, 32 (04) : 2745 - 2772
[42] LINEARLY CONVERGENT FIRST-ORDER ALGORITHMS FOR SEMIDEFINITE PROGRAMMING
Dang, Cong D.
Lan, Guanghui
Wen, Zaiwen
JOURNAL OF COMPUTATIONAL MATHEMATICS, 2017, 35 (04) : 452 - 468
[43] Designing Universally-Approximating Deep Neural Networks: A First-Order Optimization Approach
Wu, Zhoutong
Xiao, Mingqing
Fang, Cong
Lin, Zhouchen
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 6231 - 6246
[44] First-order rules for nonsmooth constrained optimization
Lassonde, M
NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2001, 44 (08) : 1031 - 1056
[45] A Study of Condition Numbers for First-Order Optimization
Guille-Escuret, Charles
Goujaud, Baptiste
Girotti, Manuela
Mitliagkas, Ioannis
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[46] On Fundamental Proof Structures in First-Order Optimization
Goujaud, Baptiste
Dieuleveut, Aymeric
Taylor, Adrien
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 3023 - 3030
[47] First-order approximation and model management in optimization
Alexandrov, NM
Lewis, RM
LARGE-SCALE PDE-CONSTRAINED OPTIMIZATION, 2003, 30 : 63 - 79
[48] FIRST-ORDER PENALTY METHODS FOR BILEVEL OPTIMIZATION
Lu, Zhaosong
Mei, Sanyou
SIAM JOURNAL ON OPTIMIZATION, 2024, 34 (02) : 1937 - 1969
[49] Control Interpretations for First-Order Optimization Methods
Hu, Bin
Lessard, Laurent
2017 AMERICAN CONTROL CONFERENCE (ACC), 2017, : 3114 - 3119
[50] Analysis and Design of First-Order Distributed Optimization Algorithms Over Time-Varying Graphs
Sundararajan, Akhil
Van Scoy, Bryan
Lessard, Laurent
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (04): : 1597 - 1608

← 1 2 3 4 5 →