RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning

被引:29
|
作者
Tan, Hao [1 ]
Cheng, Ran [1 ]
Huang, Shihua [1 ]
He, Cheng [1 ]
Qiu, Changxiao [2 ]
Yang, Fan [2 ]
Luo, Ping [3 ]
机构
[1] Southern Univ Sci & Technol, Univ Key Lab Evolving Intelligent Syst Guangdong, Dept Comp Sci & Engn, Shenzhen 518055, Peoples R China
[2] Huawei Technol Co Ltd, Hisilicon Res Dept, Shenzhen 518055, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer architecture; Statistics; Sociology; Search problems; Optimization; Neural networks; Estimation; AutoML; convolutional neural network (CNN); neural architecture search (NAS); population-based search; slow-fast learning; NETWORKS;
D O I
10.1109/TNNLS.2021.3096658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the remarkable successes of convolutional neural networks (CNNs) in computer vision, it is time-consuming and error-prone to manually design a CNN. Among various neural architecture search (NAS) methods that are motivated to automate designs of high-performance CNNs, the differentiable NAS and population-based NAS are attracting increasing interests due to their unique characters. To benefit from the merits while overcoming the deficiencies of both, this work proposes a novel NAS method, RelativeNAS. As the key to efficient search, RelativeNAS performs joint learning between fast learners (i.e., decoded networks with relatively lower loss value) and slow learners in a pairwise manner. Moreover, since RelativeNAS only requires low-fidelity performance estimation to distinguish each pair of fast learner and slow learner, it saves certain computation costs for training the candidate architectures. The proposed RelativeNAS brings several unique advantages: 1) it achieves state-of-the-art performances on ImageNet with top-1 error rate of 24.88%, that is, outperforming DARTS and AmoebaNet-B by 1.82% and 1.12%, respectively; 2) it spends only 9 h with a single 1080Ti GPU to obtain the discovered cells, that is, 3.75x and 7875x faster than DARTS and AmoebaNet, respectively; and 3) it provides that the discovered cells obtained on CIFAR-10 can be directly transferred to object detection, semantic segmentation, and keypoint detection, yielding competitive results of 73.1% mAP on PASCAL VOC, 78.7% mIoU on Cityscapes, and 68.5% AP on MSCOCO, respectively. The implementation of RelativeNAS is available at https://github.com/EMI-Group/RelativeNAS.
引用
收藏
页码:475 / 489
页数:15
相关论文
共 50 条
  • [21] Thinking Fast and Slow with Deep Learning and Tree Search
    Anthony, Thomas
    Tian, Zheng
    Barber, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [22] Generic Neural Architecture Search via Regression
    Li, Yuhong
    Hao, Cong
    Li, Pan
    Xiong, Jinjun
    Chen, Deming
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] On analysis of inputs triggering large nonlinear neural responses Slow-fast dynamics in the Wendling neural mass model
    Hebbink, Jurgen
    van Gils, Stephan A.
    Meijer, Hil G. E.
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2020, 83
  • [24] Neural Architecture Search via Proxy Validation
    Li, Yanxi
    Dong, Minjing
    Wang, Yunhe
    Xu, Chang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 7595 - 7610
  • [25] Fast Hardware-Aware Neural Architecture Search
    Zhang, Li Lyna
    Yang, Yuqing
    Jiang, Yuhang
    Zhu, Wenwu
    Liu, Yunxin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2959 - 2967
  • [26] FLASH: Fast Neural Architecture Search with Hardware Optimization
    Li, Guihong
    Mandal, Sumit K.
    Ogras, Umit Y.
    Marculescu, Radu
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2021, 20 (05)
  • [27] Deep Active Learning with a Neural Architecture Search
    Geifman, Yonatan
    El-Yaniv, Ran
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] NASIL: NEURAL ARCHITECTURE SEARCH WITH IMITATION LEARNING
    Fard, Farzaneh S.
    Rad, Arash
    Tomar, Vikrant Singh
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3972 - 3976
  • [29] Reinforcement learning for neural architecture search: A review
    Jaafra, Yesmina
    Laurent, Jean Luc
    Deruyver, Aline
    Naceur, Mohamed Saber
    IMAGE AND VISION COMPUTING, 2019, 89 : 57 - 66
  • [30] Improving Deep Learning based Optical Character Recognition via Neural Architecture Search
    Zhao, Zhenyao
    Jiang, Min
    Guo, Shihui
    Wang, Zhenzhong
    Chao, Fei
    Tan, Kay Chen
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,