Differentiable Architecture Search with Random Features

被引：7

作者：

Zhang, Xuanyang ^{[1
]}

Li, Yonggang ^{[2
]}

Zhang, Xiangyu ^{[1
]}

Wang, Yongtao ^{[2
]}

Sun, Jian ^{[1
]}

机构：

[1] IMEGVII Technol, Beijing, Peoples R China

[2] Peking Univ, Beijing, Peoples R China

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.01541

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Differentiable architecture search (DARTS) has significantly promoted the development of NAS techniques because of its high search efficiency and effectiveness but suffers from performance collapse. In this paper, we make efforts to alleviate the performance collapse problem for DARTS from two aspects. First, we investigate the expressive power of the supernet in DARTS and then derive a new setup of DARTS paradigm with only training Batch-Norm. Second, we theoretically find that random features dilute the auxiliary connection role of skip-connection in supernet optimization and enable search algorithm focus on fairer operation selection, thereby solving the performance collapse problem. We instantiate DARTS and PC-DARTS with random features to build an improved version for each named RF-DARTS and RF-PCDARTS respectively. Experimental results show that RF-DARTS obtains 94.36% test accuracy on CIFAR-10 (which is the nearest optimal result in NAS-Bench-201), and achieves the newest state-of-the-art top-1 test error of 24.0% on ImageNet when transferring from CIFAR-10. Moreover, RF-DARTS performs robustly across three datasets (CIFAR-10, CIFAR-100, and SVHN) and four search spaces (S1-S4). Besides, RF-PCDARTS achieves even better results on ImageNet, that is, 23.9% top-1 and 7.1% top-5 test error, surpassing representative methods like single-path, training-free, and partial-channel paradigms directly searched on ImageNet.

引用

页码：16060 / 16069

页数：10

共 50 条

[31] Operation-level Progressive Differentiable Architecture Search
Zhu, Xunyu
Li, Jian
Liu, Yong
Liao, Jun
Wang, Weiping
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, : 1559 - 1564
[32] DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
Hong, Weijun
Li, Guilin
Zhang, Weinan
Tang, Ruiming
Wang, Yunhe
Li, Zhenguo
Yu, Yong
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2326 - 2332
[33] DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
Gu, Yu-Chao
Wang, Li-Juan
Liu, Yun
Yang, Yi
Wu, Yu-Huan
Lu, Shao-Ping
Cheng, Ming-Ming
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12306 - 12315
[34] Operation and Topology Aware Fast Differentiable Architecture Search
Siddiqui, Shahid
Kyrkou, Christos
Theocharides, Theocharis
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9666 - 9673
[35] iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
Zhang, Miao
Su, Steven
Pan, Shirui
Chang, Xiaojun
Abbasnejad, Ehsan
Haffari, Reza
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
[36] Understanding the wiring evolution in differentiable neural architecture search
Xie, Sirui
Hu, Shoukang
Wang, Xinjiang
Liu, Chunxiao
Shi, Jianping
Liu, Xunying
Lin, Dahua
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[37] Differentiable Architecture Search Algorithm Based on Global Comparison
Zeng, Xianglun
Xiao, Hongxiang
IEEE ACCESS, 2023, 11 : 82674 - 82684
[38] Memory-Efficient Differentiable Transformer Architecture Search
Zhao, Yuekai
Dong, Li
Shen, Yelong
Zhang, Zhihua
Wei, Furu
Chen, Weizhu
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 4254 - 4264
[39] Exploiting Operation Importance for Differentiable Neural Architecture Search
Zhou, Yuan
Xie, Xukai
Kung, Sun-Yuan
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6235 - 6248
[40] Adaptive Channel Allocation for Robust Differentiable Architecture Search
Li, Chao
Ning, Jia
Hu, Han
He, Kun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,

← 1 2 3 4 5 →