Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

被引:7
|
作者
Zhang, Bingyi [1 ]
Kannan, Rajgopal [2 ]
Prasanna, Viktor [1 ]
Busart, Carl [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] DEVCOM US Army Res Lab, Washington, DC USA
基金
美国国家科学基金会;
关键词
SAR ATR; graph neural network (GNN); hardware architecture; NETWORK;
D O I
10.1109/FPL57034.2022.00013
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from high computation cost and large memory footprint, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture co-design on FPGA to address the above issues. Model design: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near 1/100 computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. Architecture design: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation patterns of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with 1/3258 computation cost and 1/83 model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves 14.8x/2.5x speedup (latency) and is 62x/39x more energy efficient.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [21] Toward Low-Latency and Accurate State Synchronization for Programmable Networks
    Chen, Xiang
    Liu, Hongyan
    Huang, Qun
    Zhang, Dong
    Zhou, Haifeng
    Wu, Chunming
    Liu, Xuan
    Yang, Qiang
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1400 - 1415
  • [22] Accelerating GNN-based SAR Automatic Target Recognition on HBM-enabled FPGA
    Zhang, Bingyi
    Kannan, Rajgopal
    Prasanna, Viktor
    Busart, Carl
    2023 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE, HPEC, 2023,
  • [23] Energy-efficient low-latency audio on android
    Balsini, Alessio
    Cucinotta, Tommaso
    Abeni, Luca
    Fernandes, Joel
    Burk, Phil
    Bellasi, Patrick
    Rasmussen, Morten
    JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 : 182 - 195
  • [24] Adaptive boosting for SAR automatic target recognition
    Sun, Yijun
    Liu, Zhipeng
    Todorovic, Sinisa
    Li, Jian
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2007, 43 (01) : 112 - 125
  • [25] An Effective Method for SAR Automatic Target Recognition
    Li, Ying
    Gong, Hongli
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2010, 6319 : 440 - 447
  • [26] SAR Automatic Target Recognition with Less Labels
    Comer, Joseph F.
    Andrews, Reed W.
    Naderializadeh, Navid
    Kolouri, Soheil
    Hoffmann, Heiko
    AUTOMATIC TARGET RECOGNITION XXX, 2020, 11394
  • [27] Low-latency hand gesture recognition with a low resolution thermal imager
    Vandersteegen, Maarten
    Reusen, Wouter
    Van Beeck, Kristof
    Goedeme, Toon
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 440 - 449
  • [28] Low-latency implementation of coordinate conversion in virtex II pro FPGA
    Jablonski, G.
    Przygoda, K.
    MIXDES 2007: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS:, 2007, : 120 - 123
  • [29] A low-latency LSTM accelerator using balanced sparsity based on FPGA
    Jiang, Jingfei
    Xiao, Tao
    Xu, Jinwei
    Wen, Dong
    Gao, Lei
    Dou, Yong
    MICROPROCESSORS AND MICROSYSTEMS, 2022, 89
  • [30] A Scalable Architecture for Low-Latency Market-Data Processing on FPGA
    Tang, Qiu
    Su, Majing
    Jiang, Lei
    Yang, Jiajia
    Bai, Xu
    2016 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2016, : 597 - 603