A Comprehensive Benchmark of Deep Learning Libraries on Mobile Devices

被引：26

作者：

Zhang, Qiyang ^{[1
]}

Li, Xiang ^{[2
]}

Che, Xiangying ^{[1
]}

Ma, Xiao ^{[1
]}

Zhou, Ao ^{[1
]}

Xu, Mengwei ^{[1
]}

Wang, Shangguang ^{[1
]}

Ma, Yun ^{[3
]}

Liu, Xuanzhe ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

[2] China Univ Petr, Beijing, Peoples R China

[3] Peking Univ, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22) | 2022年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Benchmark; Deep Learning; Mobile Devices;

D O I：

10.1145/3485447.3512148

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deploying deep learning (DL) on mobile devices has been a notable trend in recent years. To support fast inference of on-device DL, DL libraries play a critical role as algorithms and hardware do. Unfortunately, no prior work ever dives deep into the ecosystem of modern DL libs and provides quantitative results on their performance. In this paper, we first build a comprehensive benchmark that includes 6 representative DL libs and 15 diversified DL models. We then perform extensive experiments on 10 mobile devices, which help reveal a complete landscape of the current mobile DL libs ecosystem. For example, we find that the best-performing DL lib is severely fragmented across different models and hardware, and the gap between those DL libs can be rather huge. In fact, the impacts of DL libs can overwhelm the optimizations from algorithms or hardware, e.g., model quantization and GPU/DSP-based heterogeneous computing. Finally, atop the observations, we summarize practical implications to different roles in the DL lib ecosystem.

引用

页码：3298 / 3307

页数：10

共 50 条

[11] Deep learning for source camera identification on mobile devices
Freire-Obregon, David
Narducci, Fabio
Barra, Silvio
Castrillon-Santana, Modesto
PATTERN RECOGNITION LETTERS, 2019, 126 : 86 - 91
[12] Delivering Deep Learning to Mobile Devices via Offloading
Ran, Xukan
Chen, Haoliang
Liu, Zhenming
Chen, Jiasi
VR/AR NETWORK '17: PROCEEDINGS OF THE 2017 WORKSHOP ON VIRTUAL REALITY AND AUGMENTED REALITY NETWORK, 2017, : 42 - 47
[13] Exploring the Capabilities of Mobile Devices Supporting Deep Learning
Chen, Yitao
Biookaghazadeh, Saman
Zhao, Ming
HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING: POSTERS/DOCTORAL CONSORTIUM, 2018, : 17 - 18
[14] Exploring the Capabilities of Mobile Devices in Supporting Deep Learning
Chen, Yitao
Biookaghazadeh, Saman
Zhao, Ming
SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 127 - 138
[15] Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling
Marc Pagès-Gallego
Jeroen de Ridder
Genome Biology, 24
[16] Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling
Pages-Gallego, Marc
de Ridder, Jeroen
GENOME BIOLOGY, 2023, 24 (01)
[17] A Benchmark of Deep Learning Models for Multi-leaf Diseases for Edge Devices
Pham Tuan Anh
Hoang Trong Minh Duc
2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 318 - 323
[18] A Power Consumption Benchmark for Reasoners on Mobile Devices
Patton, Evan W.
McGuinness, Deborah L.
SEMANTIC WEB - ISWC 2014, PT I, 2014, 8796 : 409 - 424
[19] A Benchmark for ML Inference Latency on Mobile Devices
Li, Zhuojin
Paolieri, Marco
Golubchik, Leana
7TH INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING, EDGESYS 2024, 2024, : 31 - 36
[20] ARBench: Augmented Reality Benchmark For Mobile Devices
Chetoui, Sofiane
Shahi, Rahul
Abdelaziz, Seif
Golas, Abhinav
Hijaz, Farrukh
Reda, Sherief
2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 242 - 244

← 1 2 3 4 5 →