UAPT: an underwater acoustic target recognition method based on pre-trained Transformer

被引:0
|
作者
Tang, Jun [1 ]
Ma, Enxue [1 ]
Qu, Yang [1 ]
Gao, Wenbo [1 ]
Zhang, Yuchen [1 ]
Gan, Lin [2 ]
机构
[1] Tianjin Univ, Sch Civil Engn, Tianjin 300072, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
关键词
Underwater acoustic target recognition; Transformer; Transfer learning; Deep learning; Pre-train; MODEL;
D O I
10.1007/s00530-024-01614-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Convolutional Neural Network (CNN) model in underwater acoustic target recognition (UATR) research reveals limitations arising from its inability to capture long-distance dependencies, impeding its capacity to focus on global information within the underwater acoustic signal. In contrast, the Transformer model has progressively emerged as the optimal choice in various studies, owing to its exclusive dependence on the attention mechanism for extracting global features from input data. Limited research utilizing the Transformer model in UATR has relied on an early ViT model, while in this paper, two refined Transformer models, namely Swin Transformer and Biformer, are adopted as the foundational networks, and a novel Swin Biformer model is proposed by harnessing the strengths of the two. Experimental results demonstrate the consistent superiority of the three models over CNN and ViT in UATR, and the Swin Biformer model remarkably attains the highest recognition accuracy of 94.3% evaluated on a dataset constructed from the Deepship database. At the same time, this paper proposes a UATR method based on pre-trained Transformer, the effectiveness of which is underscored by experimental findings as a recognition accuracy of approximately 97% was achieved on a generalized dataset derived from the Shipsear database. Even with limited data samples and more stringent classification requirements, the method maintains a recognition accuracy of over 90%, all while significantly reducing the training duration.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Constructing a Multi-Modal Based Underwater Acoustic Target Recognition Method With a Pre-Trained Language-Audio Model
    Fu, Bowen
    Nie, Jiangtao
    Wei, Wei
    Zhang, Lei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [2] Underwater Image Enhancement Using Pre-trained Transformer
    Boudiaf, Abderrahmene
    Guo, Yuhang
    Ghimire, Adarsh
    Werghi, Naoufel
    De Masi, Giulia
    Javed, Sajid
    Dias, Jorge
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT III, 2022, 13233 : 480 - 488
  • [3] A Novel Multi-Feature Fusion Model Based on Pre-Trained Wav2vec 2.0 for Underwater Acoustic Target Recognition
    Pu, Zijun
    Zhang, Qunfei
    Xue, Yangtao
    Zhu, Peican
    Cui, Xiaodong
    REMOTE SENSING, 2024, 16 (13)
  • [4] TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models
    Li, Minghao
    Lv, Tengchao
    Chen, Jingye
    Cui, Lei
    Lu, Yijuan
    Florencio, Dinei
    Zhang, Cha
    Li, Zhoujun
    Wei, Furu
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13094 - 13102
  • [5] A PRE-TRAINED AUDIO-VISUAL TRANSFORMER FOR EMOTION RECOGNITION
    Minh Tran
    Soleymani, Mohammad
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4698 - 4702
  • [6] The Process Analysis Method of SAR Target Recognition in Pre-Trained CNN Models
    Zheng, Tong
    Li, Jin
    Tian, Hao
    Wu, Qing
    SENSORS, 2023, 23 (14)
  • [7] An Underwater Acoustic Target Recognition Method Based on AMNet
    Wang, Biao
    Zhang, Wei
    Zhu, Yunan
    Wu, Chengxi
    Zhang, Shizhen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [8] An Underwater Acoustic Target Recognition Method Based on AMNet
    Wang, Biao
    Zhang, Wei
    Zhu, Yunan
    Wu, Chengxi
    Zhang, Shizhen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [9] Pre-Trained Image Processing Transformer
    Chen, Hanting
    Wang, Yunhe
    Guo, Tianyu
    Xu, Chang
    Deng, Yiping
    Liu, Zhenhua
    Ma, Siwei
    Xu, Chunjing
    Xu, Chao
    Gao, Wen
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
  • [10] Feature extraction analysis method of pre-trained CNN model for SAR target recognition
    Zheng, Tong
    Feng, Wenbin
    Yu, Chongchong
    Wu, Qing
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (07) : 2294 - 2316