Differentiable Architecture Search with Random Features

被引:7
|
作者
Zhang, Xuanyang [1 ]
Li, Yonggang [2 ]
Zhang, Xiangyu [1 ]
Wang, Yongtao [2 ]
Sun, Jian [1 ]
机构
[1] IMEGVII Technol, Beijing, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01541
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentiable architecture search (DARTS) has significantly promoted the development of NAS techniques because of its high search efficiency and effectiveness but suffers from performance collapse. In this paper, we make efforts to alleviate the performance collapse problem for DARTS from two aspects. First, we investigate the expressive power of the supernet in DARTS and then derive a new setup of DARTS paradigm with only training Batch-Norm. Second, we theoretically find that random features dilute the auxiliary connection role of skip-connection in supernet optimization and enable search algorithm focus on fairer operation selection, thereby solving the performance collapse problem. We instantiate DARTS and PC-DARTS with random features to build an improved version for each named RF-DARTS and RF-PCDARTS respectively. Experimental results show that RF-DARTS obtains 94.36% test accuracy on CIFAR-10 (which is the nearest optimal result in NAS-Bench-201), and achieves the newest state-of-the-art top-1 test error of 24.0% on ImageNet when transferring from CIFAR-10. Moreover, RF-DARTS performs robustly across three datasets (CIFAR-10, CIFAR-100, and SVHN) and four search spaces (S1-S4). Besides, RF-PCDARTS achieves even better results on ImageNet, that is, 23.9% top-1 and 7.1% top-5 test error, surpassing representative methods like single-path, training-free, and partial-channel paradigms directly searched on ImageNet.
引用
收藏
页码:16060 / 16069
页数:10
相关论文
共 50 条
  • [1] Cyclic Differentiable Architecture Search
    Yu, Hongyuan
    Peng, Houwen
    Huang, Yan
    Fu, Jianlong
    Du, Hao
    Wang, Liang
    Ling, Haibin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 211 - 228
  • [2] Differentiable quantum architecture search
    Zhang, Shi-Xin
    Hsieh, Chang-Yu
    Zhang, Shengyu
    Yao, Hong
    QUANTUM SCIENCE AND TECHNOLOGY, 2022, 7 (04)
  • [3] Regularized Differentiable Architecture Search
    Wang, Lanfei
    Xie, Lingxi
    Zhao, Kaili
    Guo, Jun
    Tian, Qi
    IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (03) : 129 - 132
  • [4] The limitations of differentiable architecture search
    Guillaume, Lacharme
    Hubert, Cardot
    Christophe, Lente
    Nicolas, Monmarche
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [5] Group Differentiable Architecture Search
    Shen, Chaoyuan
    Xu, Jinhua
    IEEE ACCESS, 2021, 9 : 76585 - 76591
  • [6] Enhanced Gradient for Differentiable Architecture Search
    Zhang, Haichao
    Hao, Kuangrong
    Gao, Lei
    Tang, Xuesong
    Wei, Bing
    Wei, Bing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) : 9606 - 9620
  • [7] Sparse Gate for Differentiable Architecture Search
    Fan, Liang
    Wang, Handing
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Differentiable Architecture Search for Reinforcement Learning
    Miao, Yingjie
    Song, Xingyou
    Co-Reyes, John D.
    Peng, Daiyi
    Yue, Summer
    Brevdo, Eugene
    Faust, Aleksandra
    INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, VOL 188, 2022, 188
  • [9] IDARTS: Interactive Differentiable Architecture Search
    Xue, Song
    Wang, Runqi
    Zhang, Baochang
    Wang, Tian
    Guo, Guodong
    Doermann, David
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 1143 - 1152
  • [10] An architecture entropy regularizer for differentiable neural architecture search
    Jing, Kun
    Chen, Luoyu
    Xu, Jungang
    NEURAL NETWORKS, 2023, 158 : 111 - 120