Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

被引:23
|
作者
Yang, Liping [1 ]
Tao, Lianjie [1 ]
Chen, Xinxing [1 ]
Gu, Xiaohua [2 ]
机构
[1] Chongqing Univ, MOE, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale feature learning; Convolutional neural networks; Data augmentation; Acoustic scene classification (ASC); Machine listening; NETWORKS;
D O I
10.1016/j.apacoust.2020.107238
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Semantic Segmentation Method Based on Residual and Multi-Scale Feature Fusion
    Xiu, Chunbo
    Su, Huan
    Su, Xuemiao
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2078 - 2083
  • [22] Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion
    Zhang J.
    Jin Q.
    Wang H.
    Da C.
    Xiang S.
    Pan C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1509 - 1517
  • [23] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [24] A CNN-Based Multi-Scale Pooling Strategy for Acoustic Scene Classification
    Huang, Rong
    Xie, Yue
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (01) : 153 - 156
  • [25] Typhoon Classification Model Based on Multi-Scale Convolution Feature Fusion
    Lu Peng
    Zou Peiqi
    Zou Guoliang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (16)
  • [26] STMSF: Swin Transformer with Multi-Scale Fusion for Remote Sensing Scene Classification
    Duan, Yingtao
    Song, Chao
    Zhang, Yifan
    Cheng, Puyu
    Mei, Shaohui
    REMOTE SENSING, 2025, 17 (04)
  • [27] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    Multimedia Tools and Applications, 2024, 83 : 30073 - 30093
  • [28] Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion
    Du, Jing
    Jiang, Zuning
    Huang, Shangfeng
    Wang, Zongyue
    Su, Jinhe
    Su, Songjian
    Wu, Yundong
    Cai, Guorong
    SENSORS, 2021, 21 (05) : 1 - 20
  • [29] Global and Local Multi-scale Feature Fusion for Object Detection and Semantic Segmentation
    Lim, Young-Chul
    Kang, Minsung
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 2557 - 2562
  • [30] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30073 - 30093