Differentiable Architecture Search with Random Features

被引:7
|
作者
Zhang, Xuanyang [1 ]
Li, Yonggang [2 ]
Zhang, Xiangyu [1 ]
Wang, Yongtao [2 ]
Sun, Jian [1 ]
机构
[1] IMEGVII Technol, Beijing, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01541
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Differentiable architecture search (DARTS) has significantly promoted the development of NAS techniques because of its high search efficiency and effectiveness but suffers from performance collapse. In this paper, we make efforts to alleviate the performance collapse problem for DARTS from two aspects. First, we investigate the expressive power of the supernet in DARTS and then derive a new setup of DARTS paradigm with only training Batch-Norm. Second, we theoretically find that random features dilute the auxiliary connection role of skip-connection in supernet optimization and enable search algorithm focus on fairer operation selection, thereby solving the performance collapse problem. We instantiate DARTS and PC-DARTS with random features to build an improved version for each named RF-DARTS and RF-PCDARTS respectively. Experimental results show that RF-DARTS obtains 94.36% test accuracy on CIFAR-10 (which is the nearest optimal result in NAS-Bench-201), and achieves the newest state-of-the-art top-1 test error of 24.0% on ImageNet when transferring from CIFAR-10. Moreover, RF-DARTS performs robustly across three datasets (CIFAR-10, CIFAR-100, and SVHN) and four search spaces (S1-S4). Besides, RF-PCDARTS achieves even better results on ImageNet, that is, 23.9% top-1 and 7.1% top-5 test error, surpassing representative methods like single-path, training-free, and partial-channel paradigms directly searched on ImageNet.
引用
收藏
页码:16060 / 16069
页数:10
相关论文
共 50 条
  • [31] Operation-level Progressive Differentiable Architecture Search
    Zhu, Xunyu
    Li, Jian
    Liu, Yong
    Liao, Jun
    Wang, Weiping
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1559 - 1564
  • [32] DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
    Hong, Weijun
    Li, Guilin
    Zhang, Weinan
    Tang, Ruiming
    Wang, Yunhe
    Li, Zhenguo
    Yu, Yong
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2326 - 2332
  • [33] DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
    Gu, Yu-Chao
    Wang, Li-Juan
    Liu, Yun
    Yang, Yi
    Wu, Yu-Huan
    Lu, Shao-Ping
    Cheng, Ming-Ming
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12306 - 12315
  • [34] Operation and Topology Aware Fast Differentiable Architecture Search
    Siddiqui, Shahid
    Kyrkou, Christos
    Theocharides, Theocharis
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9666 - 9673
  • [35] iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
    Zhang, Miao
    Su, Steven
    Pan, Shirui
    Chang, Xiaojun
    Abbasnejad, Ehsan
    Haffari, Reza
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [36] Understanding the wiring evolution in differentiable neural architecture search
    Xie, Sirui
    Hu, Shoukang
    Wang, Xinjiang
    Liu, Chunxiao
    Shi, Jianping
    Liu, Xunying
    Lin, Dahua
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [37] Differentiable Architecture Search Algorithm Based on Global Comparison
    Zeng, Xianglun
    Xiao, Hongxiang
    IEEE ACCESS, 2023, 11 : 82674 - 82684
  • [38] Memory-Efficient Differentiable Transformer Architecture Search
    Zhao, Yuekai
    Dong, Li
    Shen, Yelong
    Zhang, Zhihua
    Wei, Furu
    Chen, Weizhu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4254 - 4264
  • [39] Exploiting Operation Importance for Differentiable Neural Architecture Search
    Zhou, Yuan
    Xie, Xukai
    Kung, Sun-Yuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6235 - 6248
  • [40] Adaptive Channel Allocation for Robust Differentiable Architecture Search
    Li, Chao
    Ning, Jia
    Hu, Han
    He, Kun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,