One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

被引：9

作者：

Lu, Bingqian ^{[1
]}

Yang, Jianyi ^{[1
]}

Jiang, Weiwen ^{[2
]}

Shi, Yiyu ^{[3
]}

Ren, Shaolei ^{[1
]}

机构：

[1] Univ Calif Riverside, 900 Univ Ave, Riverside, CA 92521 USA

[2] George Mason Univ, 4400 Univ Dr, Fairfax, VA 22030 USA

[3] Univ Notre Dame, 257 Fitzpatrick Hall, Notre Dame, IN 46556 USA

来源：

PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS | 2021年 / 5卷 / 03期

关键词：

Neural Architecture Search; Hardware-Aware; Scalability; AutoML;

D O I：

10.1145/3491046

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Convolutional neural networks (CNNs) are used in numerous real-world applications such as vision-based autonomous driving and video content analysis. To run CNN inference on various target devices, hardwareaware neural architecture search (NAS) is crucial. A key requirement of efficient hardware-aware NAS is the fast evaluation of inference latencies in order to rank different architectures. While building a latency predictor for each target device has been commonly used in state of the art, this is a very time-consuming process, lacking scalability in the presence of extremely diverse devices. In this work, we address the scalability challenge by exploiting latency monotonicity - the architecture latency rankings on different devices are often correlated. When strong latency monotonicity exists, we can re-use architectures searched for one proxy device on new target devices, without losing optimality. In the absence of strong latency monotonicity, we propose an efficient proxy adaptation technique to significantly boost the latency monotonicity. Finally, we validate our approach and conduct experiments with devices of different platforms on multiple mainstream search spaces, including MobileNet-V2, MobileNet-V3, NAS-Bench-201, ProxylessNAS and FBNet. Our results highlight that, by using just one proxy device, we can find almost the same Pareto-optimal architectures as the existing per-device NAS, while avoiding the prohibitive cost of building a latency predictor for each device.

引用

页数：34

共 50 条

[41] A Hardware-Aware Sampling Parameter Search for Efficient Probabilistic Object Detection
Hoefer, Julian
Hotfilter, Tim
Kress, Fabian
Qiu, Chen
Harbaum, Tanja
Becker, Juergen
COMPUTER VISION SYSTEMS, ICVS 2023, 2023, 14253 : 299 - 309
[42] APNAS: Accuracy-and-Performance-Aware Neural Architecture Search for Neural Hardware Accelerators
Achararit, Paniti
Hanif, Muhammad Abdullah
Putra, Rachmad Vidya Wicaksana
Shafique, Muhammad
Hara-Azumi, Yuko
IEEE ACCESS, 2020, 8 : 165319 - 165334
[43] Noise-Tolerant Hardware-Aware Pruning for Deep Neural Networks
Lu, Shun
Chen, Cheng
Zhang, Kunlong
Zheng, Yang
Hu, Zheng
Hong, Wenjing
Li, Guiying
Yao, Xin
ADVANCES IN SWARM INTELLIGENCE, ICSI 2023, PT II, 2023, 13969 : 127 - 138
[44] Hardware-Aware Model of Sigma-Delta Cellular Neural Network
Aomori, Hisashi
Naito, Yuki
Otake, Tsuyoshi
Takahashi, Nobuaki
Matsuda, Ichiro
Itoh, Susumu
Tanaka, Mamoru
2009 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOLS 1 AND 2, 2009, : 311 - +
[45] Multi-Objective Hardware Aware Neural Architecture Search using Hardware Cost Diversity
Sinha, Nilotpal
Rostami, Peyman
Shabayek, Abd El Rahman
Kacem, Anis
Aouada, Djamila
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, : 8032 - 8039
[46] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
Heidorn, Christian
Sabih, Muhammad
Meyerhoefer, Nicolai
Schinabeck, Christian
Teich, Juergen
Hannig, Frank
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2024, 52 (1-2) : 40 - 58
[47] A Study on Hardware-Aware Training Techniques for Feedforward Artificial Neural Networks
Parvin, Sajjad
Altun, Mustafa
2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 406 - 411
[48] Hardware-Aware Multi-Objective Neural Architecture Search Approach; [基于硬件感知的多目标神经结构搜索方法]
Xu K.
Meng Y.
Yang S.-S.
Tian Y.
Zhang X.-Y.
Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (12): : 2652 - 2669
[49] Quantized rewiring: hardware-aware training of sparse deep neural networks
Petschenig, Horst
Legenstein, Robert
NEUROMORPHIC COMPUTING AND ENGINEERING, 2023, 3 (02):
[50] Hardware-Aware Evolutionary Explainable Filter Pruning for Convolutional Neural Networks
Christian Heidorn
Muhammad Sabih
Nicolai Meyerhöfer
Christian Schinabeck
Jürgen Teich
Frank Hannig
International Journal of Parallel Programming, 2024, 52 : 40 - 58

← 1 2 3 4 5 →