Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition

被引:0
|
作者
Li, Zeyu [1 ,2 ]
Xiang, Suncheng [1 ,3 ]
Yu, Tong [1 ,2 ]
Gao, Jingsheng [1 ,2 ]
Ruan, Jiacheng [1 ,2 ]
Hu, Yanping [1 ,2 ]
Liu, Ting [1 ,2 ]
Fu, Yuzhuo [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[3] Sch Biomed Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Underwater Acoustic Target Recognition; Audio Retrieval; Zero-Shot Classification;
D O I
10.1007/978-981-97-5591-2_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition of underwater audio plays a significant role in identifying a vessel while it is in motion. Underwater target recognition tasks have a wide range of applications in areas such as marine environmental protection, detection of ship radiated noise, underwater noise control, and coastal vessel dispatch. The traditional UATR task involves training a network to extract features from audio data and predict the vessel type. The current UATR dataset exhibits shortcomings in both duration and sample quantity. In this paper, we propose Oceanship, a large-scale and diverse underwater audio dataset. This dataset comprises 15 categories, spans a total duration of 121 h, and includes comprehensive annotation information such as coordinates, velocity, vessel types, and timestamps. We compiled the dataset by crawling and organizing original communication data from the Ocean Communication Network (ONC) database between 2021 and 2022. While audio retrieval tasks are well-established in general audio classification, they have not been explored in the context of underwater audio recognition. Leveraging the Oceanship dataset, we introduce a baseline model named Oceannet for underwater audio retrieval. This model achieves a recall at 1 (R@1) accuracy of 67.11% and a recall at 5 (R@5) accuracy of 99.13% on the Deepship dataset.
引用
收藏
页码:475 / 486
页数:12
相关论文
共 50 条
  • [31] SER30K: A Large-Scale Dataset for Sticker Emotion Recognition
    Liu, Shengzhe
    Zhang, Xin
    Yang, Jufeng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [32] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Large-scale RDF Dataset Slicing
    Marx, Edgard
    Shekarpour, Saeedeh
    Auer, Soeren
    Ngomo, Axel-Cyrille Ngonga
    2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 228 - 235
  • [34] Euler Clustering on Large-scale Dataset
    Wu, Jian-Sheng
    Zheng, Wei-Shi
    Lai, Jian-Huang
    Suen, Ching Y.
    IEEE TRANSACTIONS ON BIG DATA, 2018, 4 (04) : 502 - 515
  • [35] Experimental Study on Echo Characteristics of Multistatic Large-Scale Underwater Complex Target
    Zhu Ling-Guo
    Zhao An-Bang
    Liu Wen-zhang
    Han Jing
    Wang Shu-tao
    Ma Zhong-cheng
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2018, 2018
  • [36] MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information
    Wang, Jianrong
    Huo, Yuchen
    Liu, Li
    Xu, Tianyi
    Li, Qi
    Li, Sen
    INTERSPEECH 2023, 2023, : 2113 - 2117
  • [37] The Jester Dataset: A Large-Scale Video Dataset of Human Gestures
    Materzynska, Joanna
    Berger, Guillaume
    Bax, Ingo
    Memisevic, Roland
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2874 - 2882
  • [38] FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
    Khan, Faizan Farooq
    Li, Xiang
    Temple, Andrew J.
    Elhoseiny, Mohamed
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20439 - 20449
  • [39] EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition
    Song, Yingjie
    Liu, Zhi
    Li, Gongyang
    Xie, Jiawei
    Wu, Qiang
    Zeng, Dan
    Xu, Lihua
    Zhang, Tianhong
    Wang, Jijun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [40] Palmprint and Palmvein Recognition Based on DCNN and A New Large-Scale Contactless Palmvein Dataset
    Zhang, Lin
    Cheng, Zaixi
    Shen, Ying
    Wang, Dongqing
    SYMMETRY-BASEL, 2018, 10 (04):