Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

被引:23
|
作者
Yang, Liping [1 ]
Tao, Lianjie [1 ]
Chen, Xinxing [1 ]
Gu, Xiaohua [2 ]
机构
[1] Chongqing Univ, MOE, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale feature learning; Convolutional neural networks; Data augmentation; Acoustic scene classification (ASC); Machine listening; NETWORKS;
D O I
10.1016/j.apacoust.2020.107238
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A multi-scale semantic feature fusion method for remote sensing crop classification
    Huang, Xizhi
    Wang, Hong
    Li, Xiaobing
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 224
  • [2] MULTI-SCALE FEATURE FUSION FOR HYPERSPECTRAL AND LIDAR DATA JOINT CLASSIFICATION
    Zhang, Maqun
    Gao, Feng
    Dong, Junyu
    Qi, Lin
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2856 - 2859
  • [3] Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion
    Guo, Baoyun
    Sun, Xiaokai
    Li, Cailin
    Sun, Na
    Wang, Yue
    Yao, Yukai
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2024, 90 (09): : 553 - 563
  • [4] A MULTI-SCALE DEEP FEATURE LEARNING AND SEMANTIC ENHANCEMENT APPROACH FOR REMOTE SENSING SCENE CLASSIFICATION
    Huang, Hengyi
    Wang, Wenzhen
    Liao, Wenzhi
    Xiao, Liang
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5419 - 5422
  • [5] Acoustic Scene Classification using Convolutional Neural Networks and Multi-Scale Multi-Feature Extraction
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [6] Short-time acoustic scene recognition method using multi-scale feature fusion
    Wang, Meng
    Zhang, Pengyuan
    Shengxue Xuebao/Acta Acustica, 2022, 47 (06): : 717 - 726
  • [7] Scene Classification of High-Resolution Remote Sensing Image by Multi-scale and Multi-feature Fusion
    Huang H.
    Xu K.-J.
    Shi G.-Y.
    Huang, Hong (hhuang@cqu.edu.cn), 1824, Chinese Institute of Electronics (48): : 1824 - 1833
  • [8] Image scene classification based on multi-scale and contextual semantic information
    Zhang, Rui-Jie
    Li, Bi-Cheng
    Wei, Fu-Shan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2014, 42 (04): : 646 - 652
  • [9] AtResNet: Residual Atrous CNN with Multi-scale Feature Representation for Low Complexity Acoustic Scene Classification
    Aswathy Madhu
    K. Suresh
    Circuits, Systems, and Signal Processing, 2022, 41 : 7035 - 7056
  • [10] AtResNet: Residual Atrous CNN with Multi-scale Feature Representation for Low Complexity Acoustic Scene Classification
    Madhu, Aswathy
    Suresh, K.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (12) : 7035 - 7056