A Large-Scale UAV Audio Dataset and Audio-Based UAV Classification Using CNN

被引:8
|
作者
Wang, Yaqin [1 ]
Chu, Zhiwei [1 ]
Ku, Ilmun [2 ]
Smith, E. Cho [1 ]
Matson, Eric T. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Hankuk Univ Foreign Studies, Seoul, South Korea
关键词
Drone Audio Dataset; UAV Classification; Machine Learning; Convolutional Neural Network; PARAMETRIC REPRESENTATIONS;
D O I
10.1109/IRC55401.2022.00039
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increased popularity and accessibility of UAVs may create potential threats. Researchers have been developing UAV detection and classification systems with different methods, including audio-based approach. However, the number of publicly available UAV audio datasets is limited. To fill this gap, we selected 10 different UAVs, ranging from toy hand drones to Class I drones, and recorded a total of 5215 seconds length of audio data generated from the flying UAVs. To the best of our knowledge, the proposed dataset is the largest audio dataset for UAVs so far. We further implemented a convolutional neural network (CNN) model for 10-class UAV classification and trained the model with the collected data. The overall test accuracy of the trained model is 97.7% and the test loss is 0.085.
引用
收藏
页码:186 / 189
页数:4
相关论文
共 50 条
  • [21] Large scale data based audio scene classification
    Sophiya E.
    Jothilakshmi S.
    International Journal of Speech Technology, 2018, 21 (04) : 825 - 836
  • [22] Factor Analysis for Audio-based Video Genre Classification
    Rouvier, Mickael
    Matrouf, Driss
    Linares, Georges
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1131 - 1134
  • [23] Audio-Based Music Classification with DenseNet and Data Augmentation
    Bian, Wenhao
    Wang, Jie
    Zhuang, Bojin
    Yang, Jiankui
    Wang, Shaojun
    Xiao, Jing
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 56 - 65
  • [24] Audio-Based Granularity-Adapted Emotion Classification
    Shepstone, Sven Ewan
    Tan, Zheng-Hua
    Jensen, Soren Holdt
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (02) : 176 - 190
  • [25] Using Syntax in Large-Scale Audio Document Translation
    Zheng, Jing
    Ayan, Necip Fazil
    Wang, Wen
    Burkett, David
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 444 - +
  • [26] MODIFIED LASSO SCREENING FOR AUDIO WORD-BASED MUSIC CLASSIFICATION USING LARGE-SCALE DICTIONARY
    Jao, Ping-Keng
    Yeh, Chin-Chia Michael
    Yang, Yi-Hsuan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [27] Audio features based ADS-CNN method for flight attitude recognition of quadrotor UAV
    Jiao, Qingchun
    Wang, Xiaolong
    Wang, Lijun
    Bai, Huihui
    APPLIED ACOUSTICS, 2023, 211
  • [28] OLKAVS: AN OPEN LARGE-SCALE KOREAN AUDIO-VISUAL SPEECH DATASET
    Park, Jeongkyun
    Hwang, Jung-Wook
    Choi, Kwanghee
    Lee, Seung-Hyeon
    Ahn, Jun Hwan
    Park, Rae-Hong
    Park, Hyung-Min
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6385 - 6389
  • [29] Large-Scale Room Impulse Response Dataset Compression With Neural Audio Codecs
    Mezza, Alessandro Ilic
    Bernardini, Alberto
    Antonacci, Fabio
    2024 IEEE 5TH INTERNATIONAL SYMPOSIUM ON THE INTERNET OF SOUNDS, IS2 2024, 2024, : 102 - 109
  • [30] Audio-based gender identification using bootstrapping
    Tzanetakis, G
    2005 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2005, : 432 - 433