Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

被引：7

作者：

Zhang, Bingyi ^{[1
]}

Kannan, Rajgopal ^{[2
]}

Prasanna, Viktor ^{[1
]}

Busart, Carl ^{[2
]}

机构：

[1] Univ Southern Calif, Los Angeles, CA 90007 USA

[2] DEVCOM US Army Res Lab, Washington, DC USA

来源：

2022 32ND INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL | 2022年

基金：

美国国家科学基金会;

关键词：

SAR ATR; graph neural network (GNN); hardware architecture; NETWORK;

D O I：

10.1109/FPL57034.2022.00013

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from high computation cost and large memory footprint, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture co-design on FPGA to address the above issues. Model design: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near 1/100 computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. Architecture design: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation patterns of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with 1/3258 computation cost and 1/83 model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves 14.8x/2.5x speedup (latency) and is 62x/39x more energy efficient.

引用

页码：1 / 8

页数：8

共 50 条

[41] Edge Coordinated Query Configuration for Low-Latency and Accurate Video Analytics
Yang, Peng
Lyu, Feng
Wu, Wen
Zhang, Ning
Yu, Li
Shen, Xuemin
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (07) : 4855 - 4864
[42] The Moving Pose: An Efficient 3D Kinematics Descriptor for Low-Latency Action Recognition and Detection
Zanfir, Mihai
Leordeanu, Marius
Sminchisescu, Cristian
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2752 - 2759
[43] Low-Latency Neural Network for Efficient Hyperspectral Image Classification
Li, Chunchao
Li, Jun
Peng, Mingrui
Rasti, Behnood
Duan, Puhong
Tang, Xuebin
Ma, Xiaoguang
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7374 - 7390
[44] An efficient and low-latency MAC protocol for wireless sensor network
Gu, Zhichao
Sun, Jifeng
MOBILE AD-HOC AND SENSOR NETWORKS, PROCEEDINGS, 2007, 4864 : 209 - +
[45] An Efficient and Low-Latency Deep Inertial Odometer for Smartphone Positioning
Soyer, M. Serhat
Abdel-Qader, A.
Onbasli, Mehmet Cengiz
IEEE SENSORS JOURNAL, 2021, 21 (24) : 27676 - 27685
[46] A Low-Latency Algorithm and FPGA Design for the Min-Search of LDPC Decoders
Tzimpragos, Georgios
Kachris, Christoforos
Soudris, Dimitrios
Tomkos, Ioannis
PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 269 - 274
[47] FPGA-based Low-Latency Audio Coprocessor for Networked Music Performance
Bert, Diego
Domini, Nicola
Peloso, Riccardo
Severi, Leonardo
Sacchetto, Matteo
Bianco, Andrea
Rottondi, Cristina
2023 4TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, 2023, : 128 - 135
[48] GraphAGILE: An FPGA-based Overlay Accelerator for Low-latency GNN Inference
Zhang, Bingyi
Zeng, Hanqing
Prasanna, Viktor
arXiv, 2023,
[49] An FPGA-Based Low-Latency Accelerator for Randomly Wired Neural Networks
Kuramochi, Ryosuke
Nakahara, Hiroki
2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, : 298 - 303
[50] State-of-the-art of SAR automatic target recognition
Novak, LM
RECORD OF THE IEEE 2000 INTERNATIONAL RADAR CONFERENCE, 2000, : 836 - 843

← 1 2 3 4 5 →