Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

被引:7
|
作者
Zhang, Bingyi [1 ]
Kannan, Rajgopal [2 ]
Prasanna, Viktor [1 ]
Busart, Carl [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] DEVCOM US Army Res Lab, Washington, DC USA
来源
2022 32ND INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL | 2022年
基金
美国国家科学基金会;
关键词
SAR ATR; graph neural network (GNN); hardware architecture; NETWORK;
D O I
10.1109/FPL57034.2022.00013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from high computation cost and large memory footprint, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture co-design on FPGA to address the above issues. Model design: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near 1/100 computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. Architecture design: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation patterns of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with 1/3258 computation cost and 1/83 model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves 14.8x/2.5x speedup (latency) and is 62x/39x more energy efficient.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [41] Edge Coordinated Query Configuration for Low-Latency and Accurate Video Analytics
    Yang, Peng
    Lyu, Feng
    Wu, Wen
    Zhang, Ning
    Yu, Li
    Shen, Xuemin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (07) : 4855 - 4864
  • [42] The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection
    Zanfir, Mihai
    Leordeanu, Marius
    Sminchisescu, Cristian
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2752 - 2759
  • [43] Low-Latency Neural Network for Efficient Hyperspectral Image Classification
    Li, Chunchao
    Li, Jun
    Peng, Mingrui
    Rasti, Behnood
    Duan, Puhong
    Tang, Xuebin
    Ma, Xiaoguang
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7374 - 7390
  • [44] An efficient and low-latency MAC protocol for wireless sensor network
    Gu, Zhichao
    Sun, Jifeng
    MOBILE AD-HOC AND SENSOR NETWORKS, PROCEEDINGS, 2007, 4864 : 209 - +
  • [45] An Efficient and Low-Latency Deep Inertial Odometer for Smartphone Positioning
    Soyer, M. Serhat
    Abdel-Qader, A.
    Onbasli, Mehmet Cengiz
    IEEE SENSORS JOURNAL, 2021, 21 (24) : 27676 - 27685
  • [46] A Low-Latency Algorithm and FPGA Design for the Min-Search of LDPC Decoders
    Tzimpragos, Georgios
    Kachris, Christoforos
    Soudris, Dimitrios
    Tomkos, Ioannis
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 269 - 274
  • [47] FPGA-based Low-Latency Audio Coprocessor for Networked Music Performance
    Bert, Diego
    Domini, Nicola
    Peloso, Riccardo
    Severi, Leonardo
    Sacchetto, Matteo
    Bianco, Andrea
    Rottondi, Cristina
    2023 4TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, 2023, : 128 - 135
  • [48] GraphAGILE: An FPGA-based Overlay Accelerator for Low-latency GNN Inference
    Zhang, Bingyi
    Zeng, Hanqing
    Prasanna, Viktor
    arXiv, 2023,
  • [49] An FPGA-Based Low-Latency Accelerator for Randomly Wired Neural Networks
    Kuramochi, Ryosuke
    Nakahara, Hiroki
    2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 298 - 303
  • [50] State-of-the-art of SAR automatic target recognition
    Novak, LM
    RECORD OF THE IEEE 2000 INTERNATIONAL RADAR CONFERENCE, 2000, : 836 - 843