Inner Loop-Based Modified Differentiable Architecture Search

被引:1
|
作者
Jin, Cong [1 ]
Huang, Jinjie [1 ,2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
[2] Harbin Univ Sci & Technol, Sch Automat, Harbin 150006, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Neural network; differentiable architecture search; deep learning; implicit regularization;
D O I
10.1109/ACCESS.2024.3377888
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Differentiable neural architecture search, which significantly reduces the computational cost of architecture search by several orders of magnitude, has become a popular research issue in recent years. Architecture search can fundamentally be described as an optimization problem. The differentiable architecture search updates the search process based on gradients, then derives the final sub-network architecture from the super network of the search space. However, the gap between the super network and its sub-networks together with the inaccuracy of the gradient approximation during architecture optimization bring performance collapse problems in the architecture search, making the search process extremely unstable. To this end, we propose an inner loop-based modified differentiable neural architecture search method (InLM-NAS). Firstly, we redefine the objective function of the architecture optimization process in the search process by introducing an inner-loop mechanism to prevent overfitting problems of architecture parameters and avoid convergence of the architecture search to suboptimal architectures. Secondly, a novel approximation calculation is introduced in the architecture optimization process, which reduces the error caused by the gradient approximation. It alleviates the sensitivity to the hyper-parameters setting during the architecture search and enhances the stability of the architecture search. Finally, extensive validation experiments on public datasets demonstrate that our proposed method has a more robust search process, and the searched neural network architecture has a superior network performance.
引用
收藏
页码:41918 / 41933
页数:16
相关论文
共 50 条
  • [31] Layered feature representation for differentiable architecture search
    Jie Hao
    William Zhu
    Soft Computing, 2022, 26 : 4741 - 4753
  • [32] Delve into the Performance Degradation of Differentiable Architecture Search
    Zhang, Jiuling
    Ding, Zhiming
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 2547 - 2556
  • [33] AUTOKWS: KEYWORD SPOTTING WITH DIFFERENTIABLE ARCHITECTURE SEARCH
    Zhang, Bo
    Li, Wenfeng
    Li, Qingyuan
    Zhuang, Weiji
    Chu, Xiangxiang
    Wang, Yujun
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2830 - 2834
  • [34] Image Understanding by Captioning with Differentiable Architecture Search
    Hosseini, Ramtin
    Xie, Pengtao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4665 - 4673
  • [35] Incremental Learning with Differentiable Architecture and Forgetting Search
    Smith, James Seale
    Seymour, Zachary
    Chiu, Han-Pang
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] Decoupled differentiable graph neural architecture search
    Chen, Jiamin
    Gao, Jianliang
    Wu, Zhenpeng
    Al-Sabri, Raeed
    Oloulade, Babatounde Moctard
    INFORMATION SCIENCES, 2024, 673
  • [37] Graph Differentiable Architecture Search with Structure Learning
    Qin, Yijian
    Wang, Xin
    Zhang, Zeyang
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [38] Layered feature representation for differentiable architecture search
    Hao, Jie
    Zhu, William
    SOFT COMPUTING, 2022, 26 (10) : 4741 - 4753
  • [39] Approach to behavioral synthesis for loop-based BIST
    Peking Univ, Beijing, China
    Proc IEEE Int Symp Circuits Syst, (VI-374-VI-377):
  • [40] Scalable and Universal Quantum Computing with Continuous-Variable Gate Sequence in a Loop-Based Architecture
    Takeda, Shuntaro
    Furusawa, Akira
    2018 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2018,