A Large-Scale UAV Audio Dataset and Audio-Based UAV Classification Using CNN

被引:8
|
作者
Wang, Yaqin [1 ]
Chu, Zhiwei [1 ]
Ku, Ilmun [2 ]
Smith, E. Cho [1 ]
Matson, Eric T. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Hankuk Univ Foreign Studies, Seoul, South Korea
关键词
Drone Audio Dataset; UAV Classification; Machine Learning; Convolutional Neural Network; PARAMETRIC REPRESENTATIONS;
D O I
10.1109/IRC55401.2022.00039
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increased popularity and accessibility of UAVs may create potential threats. Researchers have been developing UAV detection and classification systems with different methods, including audio-based approach. However, the number of publicly available UAV audio datasets is limited. To fill this gap, we selected 10 different UAVs, ranging from toy hand drones to Class I drones, and recorded a total of 5215 seconds length of audio data generated from the flying UAVs. To the best of our knowledge, the proposed dataset is the largest audio dataset for UAVs so far. We further implemented a convolutional neural network (CNN) model for 10-class UAV classification and trained the model with the collected data. The overall test accuracy of the trained model is 97.7% and the test loss is 0.085.
引用
收藏
页码:186 / 189
页数:4
相关论文
共 50 条
  • [1] A 15-Category Audio Dataset for Drones and an Audio-Based UAV Classification Using Machine Learning
    Wang, Mia Yaqin
    Chu, Zhiwei
    Ku, Ilmun
    Smith, E. Cho
    Matson, Eric T.
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2024, 18 (02) : 257 - 272
  • [2] CNN ARCHITECTURES FOR LARGE-SCALE AUDIO CLASSIFICATION
    Hershey, Shawn
    Chaudhuri, Sourish
    Ellis, Daniel P. W.
    Gemmeke, Jort F.
    Jansen, Aren
    Moore, R. Channing
    Plakal, Manoj
    Platt, Devin
    Saurous, Rif A.
    Seybold, Bryan
    Slaney, Malcolm
    Weiss, Ron J.
    Wilson, Kevin
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 131 - 135
  • [3] An audio-based framework for anomaly detection in large-scale structural testing
    Munko, Marek J.
    Cuthill, Fergus
    Camacho, Miguel A. Valdivia
    Bradaigh, Conchur M. o
    Dubon, Sergio Lopez
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 142
  • [4] VGGSOUND: A LARGE-SCALE AUDIO-VISUAL DATASET
    Chen, Honglie
    Xie, Weidi
    Vedaldi, Andrea
    Zisserman, Andrew
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 721 - 725
  • [5] AUDIO-BASED CLASSIFICATION OF SPEAKER CHARACTERISTICS
    Dutta, Promiti
    Haubold, Alexander
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 422 - 425
  • [6] The Blackbird Dataset: A Large-Scale Dataset for UAV Perception in Aggressive Flight
    Antonini, Amado
    Guerra, Winter
    Murali, Varun
    Sayre-McCord, Thomas
    Karaman, Sertac
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2020, 11 : 130 - 139
  • [7] A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
    Abbasi, Ahmed
    Javed, Abdul Rehman Rehman
    Yasin, Amanullah
    Jalil, Zunera
    Kryvinska, Natalia
    Tariq, Usman
    IEEE ACCESS, 2022, 10 : 38885 - 38894
  • [8] Oceanship: A Large-Scale Dataset for Underwater Audio Target Recognition
    Li, Zeyu
    Xiang, Suncheng
    Yu, Tong
    Gao, Jingsheng
    Ruan, Jiacheng
    Hu, Yanping
    Liu, Ting
    Fu, Yuzhuo
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 475 - 486
  • [9] REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION
    Wu, Yuzhong
    Lee, Tan
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 331 - 335
  • [10] CONSTRUCTING AN AUDIO DATASET OF CONSTRUCTION EQUIPMENT FROM ONLINE SOURCES FOR AUDIO-BASED RECOGNITION
    Jeong, Gilsu
    Ahn, Changbum R.
    Park, Moonseo
    2022 WINTER SIMULATION CONFERENCE (WSC), 2022, : 2354 - 2364