A Comprehensive Benchmark of Deep Learning Libraries on Mobile Devices

被引:26
|
作者
Zhang, Qiyang [1 ]
Li, Xiang [2 ]
Che, Xiangying [1 ]
Ma, Xiao [1 ]
Zhou, Ao [1 ]
Xu, Mengwei [1 ]
Wang, Shangguang [1 ]
Ma, Yun [3 ]
Liu, Xuanzhe [3 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
[2] China Univ Petr, Beijing, Peoples R China
[3] Peking Univ, Beijing, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Benchmark; Deep Learning; Mobile Devices;
D O I
10.1145/3485447.3512148
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deploying deep learning (DL) on mobile devices has been a notable trend in recent years. To support fast inference of on-device DL, DL libraries play a critical role as algorithms and hardware do. Unfortunately, no prior work ever dives deep into the ecosystem of modern DL libs and provides quantitative results on their performance. In this paper, we first build a comprehensive benchmark that includes 6 representative DL libs and 15 diversified DL models. We then perform extensive experiments on 10 mobile devices, which help reveal a complete landscape of the current mobile DL libs ecosystem. For example, we find that the best-performing DL lib is severely fragmented across different models and hardware, and the gap between those DL libs can be rather huge. In fact, the impacts of DL libs can overwhelm the optimizations from algorithms or hardware, e.g., model quantization and GPU/DSP-based heterogeneous computing. Finally, atop the observations, we summarize practical implications to different roles in the DL lib ecosystem.
引用
收藏
页码:3298 / 3307
页数:10
相关论文
共 50 条
  • [11] Deep learning for source camera identification on mobile devices
    Freire-Obregon, David
    Narducci, Fabio
    Barra, Silvio
    Castrillon-Santana, Modesto
    PATTERN RECOGNITION LETTERS, 2019, 126 : 86 - 91
  • [12] Delivering Deep Learning to Mobile Devices via Offloading
    Ran, Xukan
    Chen, Haoliang
    Liu, Zhenming
    Chen, Jiasi
    VR/AR NETWORK '17: PROCEEDINGS OF THE 2017 WORKSHOP ON VIRTUAL REALITY AND AUGMENTED REALITY NETWORK, 2017, : 42 - 47
  • [13] Exploring the Capabilities of Mobile Devices Supporting Deep Learning
    Chen, Yitao
    Biookaghazadeh, Saman
    Zhao, Ming
    HPDC '18: PROCEEDINGS OF THE 27TH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING: POSTERS/DOCTORAL CONSORTIUM, 2018, : 17 - 18
  • [14] Exploring the Capabilities of Mobile Devices in Supporting Deep Learning
    Chen, Yitao
    Biookaghazadeh, Saman
    Zhao, Ming
    SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 127 - 138
  • [15] Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling
    Marc Pagès-Gallego
    Jeroen de Ridder
    Genome Biology, 24
  • [16] Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling
    Pages-Gallego, Marc
    de Ridder, Jeroen
    GENOME BIOLOGY, 2023, 24 (01)
  • [17] A Benchmark of Deep Learning Models for Multi-leaf Diseases for Edge Devices
    Pham Tuan Anh
    Hoang Trong Minh Duc
    2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 318 - 323
  • [18] A Power Consumption Benchmark for Reasoners on Mobile Devices
    Patton, Evan W.
    McGuinness, Deborah L.
    SEMANTIC WEB - ISWC 2014, PT I, 2014, 8796 : 409 - 424
  • [19] A Benchmark for ML Inference Latency on Mobile Devices
    Li, Zhuojin
    Paolieri, Marco
    Golubchik, Leana
    7TH INTERNATIONAL WORKSHOP ON EDGE SYSTEMS, ANALYTICS AND NETWORKING, EDGESYS 2024, 2024, : 31 - 36
  • [20] ARBench: Augmented Reality Benchmark For Mobile Devices
    Chetoui, Sofiane
    Shahi, Rahul
    Abdelaziz, Seif
    Golas, Abhinav
    Hijaz, Farrukh
    Reda, Sherief
    2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 242 - 244