One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

被引:9
|
作者
Lu, Bingqian [1 ]
Yang, Jianyi [1 ]
Jiang, Weiwen [2 ]
Shi, Yiyu [3 ]
Ren, Shaolei [1 ]
机构
[1] Univ Calif Riverside, 900 Univ Ave, Riverside, CA 92521 USA
[2] George Mason Univ, 4400 Univ Dr, Fairfax, VA 22030 USA
[3] Univ Notre Dame, 257 Fitzpatrick Hall, Notre Dame, IN 46556 USA
关键词
Neural Architecture Search; Hardware-Aware; Scalability; AutoML;
D O I
10.1145/3491046
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolutional neural networks (CNNs) are used in numerous real-world applications such as vision-based autonomous driving and video content analysis. To run CNN inference on various target devices, hardwareaware neural architecture search (NAS) is crucial. A key requirement of efficient hardware-aware NAS is the fast evaluation of inference latencies in order to rank different architectures. While building a latency predictor for each target device has been commonly used in state of the art, this is a very time-consuming process, lacking scalability in the presence of extremely diverse devices. In this work, we address the scalability challenge by exploiting latency monotonicity - the architecture latency rankings on different devices are often correlated. When strong latency monotonicity exists, we can re-use architectures searched for one proxy device on new target devices, without losing optimality. In the absence of strong latency monotonicity, we propose an efficient proxy adaptation technique to significantly boost the latency monotonicity. Finally, we validate our approach and conduct experiments with devices of different platforms on multiple mainstream search spaces, including MobileNet-V2, MobileNet-V3, NAS-Bench-201, ProxylessNAS and FBNet. Our results highlight that, by using just one proxy device, we can find almost the same Pareto-optimal architectures as the existing per-device NAS, while avoiding the prohibitive cost of building a latency predictor for each device.
引用
收藏
页数:34
相关论文
共 50 条
  • [41] A Hardware-Aware Sampling Parameter Search for Efficient Probabilistic Object Detection
    Hoefer, Julian
    Hotfilter, Tim
    Kress, Fabian
    Qiu, Chen
    Harbaum, Tanja
    Becker, Juergen
    COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 299 - 309
  • [42] APNAS: Accuracy-and-Performance-Aware Neural Architecture Search for Neural Hardware Accelerators
    Achararit, Paniti
    Hanif, Muhammad Abdullah
    Putra, Rachmad Vidya Wicaksana
    Shafique, Muhammad
    Hara-Azumi, Yuko
    IEEE ACCESS, 2020, 8 : 165319 - 165334
  • [43] Noise-Tolerant Hardware-Aware Pruning for Deep Neural Networks
    Lu, Shun
    Chen, Cheng
    Zhang, Kunlong
    Zheng, Yang
    Hu, Zheng
    Hong, Wenjing
    Li, Guiying
    Yao, Xin
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 : 127 - 138
  • [44] Hardware-Aware Model of Sigma-Delta Cellular Neural Network
    Aomori, Hisashi
    Naito, Yuki
    Otake, Tsuyoshi
    Takahashi, Nobuaki
    Matsuda, Ichiro
    Itoh, Susumu
    Tanaka, Mamoru
    2009 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOLS 1 AND 2, 2009, : 311 - +
  • [45] Multi-Objective Hardware Aware Neural Architecture Search using Hardware Cost Diversity
    Sinha, Nilotpal
    Rostami, Peyman
    Shabayek, Abd El Rahman
    Kacem, Anis
    Aouada, Djamila
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 8032 - 8039
  • [46] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Heidorn, Christian
    Sabih, Muhammad
    Meyerhoefer, Nicolai
    Schinabeck, Christian
    Teich, Juergen
    Hannig, Frank
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
  • [47] A Study on Hardware-Aware Training Techniques for Feedforward Artificial Neural Networks
    Parvin, Sajjad
    Altun, Mustafa
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 406 - 411
  • [48] Hardware-Aware Multi-Objective Neural Architecture Search Approach; [基于硬件感知的多目标神经结构搜索方法]
    Xu K.
    Meng Y.
    Yang S.-S.
    Tian Y.
    Zhang X.-Y.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (12): : 2652 - 2669
  • [49] Quantized rewiring: hardware-aware training of sparse deep neural networks
    Petschenig, Horst
    Legenstein, Robert
    NEUROMORPHIC COMPUTING AND ENGINEERING, 2023, 3 (02):
  • [50] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
    Christian Heidorn
    Muhammad Sabih
    Nicolai Meyerhöfer
    Christian Schinabeck
    Jürgen Teich
    Frank Hannig
    International Journal of Parallel Programming, 2024, 52 : 40 - 58