Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

被引:0
|
作者
Li, Zeyu [1 ,2 ]
Xiang, Suncheng [1 ,3 ]
Yu, Tong [1 ,2 ]
Gao, Jingsheng [1 ,2 ]
Ruan, Jiacheng [1 ,2 ]
Hu, Yanping [1 ,2 ]
Liu, Ting [1 ,2 ]
Fu, Yuzhuo [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[3] Sch Biomed Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Underwater Acoustic Target Recognition; Audio Retrieval; Zero-Shot Classification;
D O I
10.1007/978-981-97-5591-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition of underwater audio plays a significant role in identifying a vessel while it is in motion. Underwater target recognition tasks have a wide range of applications in areas such as marine environmental protection, detection of ship radiated noise, underwater noise control, and coastal vessel dispatch. The traditional UATR task involves training a network to extract features from audio data and predict the vessel type. The current UATR dataset exhibits shortcomings in both duration and sample quantity. In this paper, we propose Oceanship, a large-scale and diverse underwater audio dataset. This dataset comprises 15 categories, spans a total duration of 121 h, and includes comprehensive annotation information such as coordinates, velocity, vessel types, and timestamps. We compiled the dataset by crawling and organizing original communication data from the Ocean Communication Network (ONC) database between 2021 and 2022. While audio retrieval tasks are well-established in general audio classification, they have not been explored in the context of underwater audio recognition. Leveraging the Oceanship dataset, we introduce a baseline model named Oceannet for underwater audio retrieval. This model achieves a recall at 1 (R@1) accuracy of 67.11% and a recall at 5 (R@5) accuracy of 99.13% on the Deepship dataset.
引用
收藏
页码:475 / 486
页数:12
相关论文
共 50 条
  • [21] Training Convolutional Neural Network for Sketch Recognition on Large-Scale Dataset
    Zhou, Wen
    Jia, Jinyuan
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (01) : 82 - 89
  • [22] A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
    Abbasi, Ahmed
    Javed, Abdul Rehman Rehman
    Yasin, Amanullah
    Jalil, Zunera
    Kryvinska, Natalia
    Tariq, Usman
    IEEE ACCESS, 2022, 10 : 38885 - 38894
  • [23] A large-scale and PCR-referenced vocal audio dataset for COVID-19
    Budd, Jobie
    Baker, Kieran
    Karoune, Emma
    Coppock, Harry
    Patel, Selina
    Payne, Richard
    Tendero Canadas, Ana
    Titcomb, Alexander
    Hurley, David
    Egglestone, Sabrina
    Butler, Lorraine
    Mellor, Jonathon
    Nicholson, George
    Kiskin, Ivan
    Koutra, Vasiliki
    Jersakova, Radka
    Mckendry, Rachel A.
    Diggle, Peter
    Richardson, Sylvia
    Schuller, Bjoern W.
    Gilmour, Steven
    Pigoli, Davide
    Roberts, Stephen
    Packham, Josef
    Thornley, Tracey
    Holmes, Chris
    SCIENTIFIC DATA, 2024, 11 (01)
  • [24] Large-scale underwater fish recognition via deep adversarial learning
    Zhang, Zhixue
    Du, Xiujuan
    Jin, Long
    Wang, Shuqiao
    Wang, Lijuan
    Liu, Xiuxiu
    KNOWLEDGE AND INFORMATION SYSTEMS, 2022, 64 (02) : 353 - 379
  • [25] Large-scale underwater fish recognition via deep adversarial learning
    Zhixue Zhang
    Xiujuan Du
    Long Jin
    Shuqiao Wang
    Lijuan Wang
    Xiuxiu Liu
    Knowledge and Information Systems, 2022, 64 : 353 - 379
  • [26] Recognition technology of large-scale hybrid-distortion target
    Ren, Bing
    Ma, Jun
    Lan, Lan
    Wang, Wen-Sheng
    Ren, Bing, 1600, Chinese Optical Society (43):
  • [27] DMDD: A Large-Scale Dataset for Dataset Mentions Detection
    Pan, Huitong
    Zhang, Qi
    Dragut, Eduard
    Caragea, Cornelia
    Latecki, Longin Jan
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1132 - 1146
  • [28] Products-6K: A Large-Scale Groceries Product Recognition Dataset
    Georgiadis, Kostas
    Kordopatis-Zilos, Giorgos
    Kalaganis, Fotis P.
    Migkotzidis, Panagiotis
    Chatzilari, Elisavet
    Panakidou, Valasia
    Pantouvakis, Kyriakos
    Tortopidis, Savvas
    Papadopoulos, Symeon
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021, 2021, : 1 - 7
  • [29] I-Nema: a large-scale microscopic image dataset for nematode recognition
    Shenglin Lu
    Sheldon Fung
    Yihao Wang
    Xuequan Lu
    Wanli Ouyang
    Xue Qing
    Hongmei Li
    Neural Computing and Applications, 2025, 37 (4) : 2763 - 2773
  • [30] IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
    Wu, Xiaoping
    Zhan, Chi
    Lai, Yu-Kun
    Cheng, Ming-Ming
    Yang, Jufeng
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8779 - 8788