A Low-Cost Neural ODE with Depthwise Separable Convolution for Edge Domain Adaptation on FPGAs

被引:1
|
作者
Kawakami, Hiroki [1 ]
Watanabe, Hirohisa [1 ]
Sugiura, Keisuke [1 ]
Matsutani, Hiroki [1 ]
机构
[1] Keio Univ, Grad Sch Sci & Technol, Yokohama 2238522, Japan
关键词
domain adaptation; neural ODE; distillation; FPGA; edge device;
D O I
10.1587/transinf.2022EDP7149
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance deep neural network (DNN)-based systems are in high demand in edge environments. Due to its high com-putational complexity, it is challenging to deploy DNNs on edge devices with strict limitations on computational resources. In this paper, we derive a compact while highly-accurate DNN model, termed dsODENet, by com-bining recently-proposed parameter reduction techniques: Neural ODE (Ordinary Differential Equation) and DSC (Depthwise Separable Convo-lution). Neural ODE exploits a similarity between ResNet and ODE, and shares most of weight parameters among multiple layers, which greatly reduces the memory consumption. We apply dsODENet to a domain adap-tation as a practical use case with image classification datasets. We also propose a resource-efficient FPGA-based design for dsODENet, where all the parameters and feature maps except for pre-and post-processing lay-ers can be mapped onto on-chip memories. It is implemented on Xilinx ZCU104 board and evaluated in terms of domain adaptation accuracy, in-ference speed, FPGA resource utilization, and speedup rate compared to a software counterpart. The results demonstrate that dsODENet achieves comparable or slightly better domain adaptation accuracy compared to our baseline Neural ODE implementation, while the total parameter size with-out pre-and post-processing layers is reduced by 54.2% to 79.8%. Our FPGA implementation accelerates the inference speed by 23.8 times.
引用
收藏
页码:1186 / 1197
页数:12
相关论文
共 50 条
  • [21] A Depthwise Separable Convolution Neural Network for Small-footprint Keyword Spotting Using Approximate MAC Unit and Streaming Convolution Reuse
    Lu, Yicheng
    Shan, Weiwei
    Xu, Jiaming
    2019 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2019), 2019, : 309 - 312
  • [22] Parallel Computing with Low-Cost FPGAs: A Framework for COPACOBANA
    Gueneysu, Tim
    Paar, Christof
    Pelzl, Jan
    Pfeiffer, Gerd
    Schimmler, Manfred
    Schleiffer, Christian
    PARALLEL COMPUTING: ARCHITECTURES, ALGORITHMS AND APPLICATIONS, 2008, 15 : 741 - +
  • [23] Energy Efficient Loop Unrolling for Low-Cost FPGAs
    Dumpala, Naveen Kumar
    Patil, Shivukumar B.
    Holcomb, Daniel
    Tessier, Russell
    2017 IEEE 25TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2017), 2017, : 117 - 120
  • [24] Low-Cost Online Convolution Checksum Checker
    Filippas, Dionysios
    Margomenos, Nikolaos
    Mitianoudis, Nikolaos
    Nicopoulos, Chrysostomos
    Dimitrakopoulos, Giorgos
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (02) : 201 - 212
  • [25] DeepFuseOSV: online signature verification using hybrid feature fusion and depthwise separable convolution neural network architecture
    Vorugunti, Chandra Sekhar
    Pulabaigari, Viswanath
    Mukherjee, Prerana
    Sharma, Abhishek
    IET BIOMETRICS, 2020, 9 (06) : 259 - 268
  • [26] A Low-Cost Laser Welding Monitoring Framework Based on Depth-Wise Separable Convolution with Photoelectric Signals
    Cheng, Wenhao
    Zhang, Yanxi
    Gao, Xiangdong
    Pocorni, Jetro Kenneth
    Jiang, Xiaoming
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2025, 26 (01) : 1 - 15
  • [27] High-speed and High-accurate SAR Ship Detection Based on a Depthwise Separable Convolution Neural Network
    Zhang X.
    Zhang T.
    Shi J.
    Wei S.
    Journal of Radars, 2019, 8 (06) : 841 - 851
  • [28] A Low-Cost Video Analytics System with Velocity Based Configuration Adaptation in Edge Computing
    Kim, Woo-Joong
    Youn, Chan-Hyun
    ADVANCES IN ARTIFICIAL INTELLIGENCE AND APPLIED COGNITIVE COMPUTING, 2021, : 667 - 677
  • [29] Computer aided decision support system for mitral valve diagnosis and classification using depthwise separable convolution neural network
    A. Anbarasi
    S. Ravi
    J. Vaishnavi
    S. V. Suresh Babu Matla
    Multimedia Tools and Applications, 2021, 80 : 21409 - 21424
  • [30] Low-cost Technique for Measuring Clock Duty Cycle on FPGAs
    Lee, Seongkwan
    Kim, Taehwan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2018,