Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

被引:23
|
作者
Yang, Liping [1 ]
Tao, Lianjie [1 ]
Chen, Xinxing [1 ]
Gu, Xiaohua [2 ]
机构
[1] Chongqing Univ, MOE, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale feature learning; Convolutional neural networks; Data augmentation; Acoustic scene classification (ASC); Machine listening; NETWORKS;
D O I
10.1016/j.apacoust.2020.107238
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] MSFFNet: multi-scale feature fusion network with semantic optimization for crowd counting
    Rohra, Avinash
    Yin, Baoqun
    Bilal, Hazrat
    Kumar, Aakash
    Ali, Munawar
    Li, Yang
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [32] REFERENCE-BASED VIDEO COLORIZATION WITH MULTI-SCALE SEMANTIC FUSION AND TEMPORAL AUGMENTATION
    Liu, Yaxin
    Zhang, Xiaoyan
    Xu, Xiaogang
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1924 - 1928
  • [33] Multi-Scale Graph-Based Feature Fusion for Few-Shot Remote Sensing Image Scene Classification
    Jiang, Nan
    Shi, Haowen
    Geng, Jie
    REMOTE SENSING, 2022, 14 (21)
  • [34] Multi-Scale Feature Fusion and Advanced Representation Learning for Multi Label Image Classification
    Zhong, Naikang
    Lin, Xiao
    Du, Wen
    Shi, Jin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (03):
  • [35] Multi-Scale and Multi-Network Deep Feature Fusion for Discriminative Scene Classification of High-Resolution Remote Sensing Images
    Yuan, Baohua
    Sehra, Sukhjit Singh
    Chiu, Bernard
    REMOTE SENSING, 2024, 16 (21)
  • [36] Acoustic scene classification method based on multi-stream convolution and data augmentation
    Cao Y.
    Fei H.
    Li P.
    Zhang X.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (04): : 40 - 46
  • [37] Improved heterogeneous data fusion and multi-scale feature selection method for lung cancer subtype classification
    Zhang, Yanan
    Zhao, Juanjuan
    Qiang, Yan
    Yang, Xiaotang
    Wu, Wei
    Jia, Liye
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (01):
  • [38] Diagnosis of Arrhythmia Based on Multi-scale Feature Fusion and Imbalanced Data
    Cheng, Z.
    Liu, Zx
    Yang, Gl
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 92 - 98
  • [39] Multi-scale Remote Sensing Image Classification Based on Weighted Feature Fusion
    Cheng Yinzhu
    Liu Song
    Wang Nan
    Shi Yuetian
    Zhang Geng
    ACTA PHOTONICA SINICA, 2023, 52 (11)
  • [40] Improved Remote Sensing Image Classification Based on Multi-Scale Feature Fusion
    Zhang, Chengming
    Chen, Yan
    Yang, Xiaoxia
    Gao, Shuai
    Li, Feng
    Kong, Ailing
    Zu, Dawei
    Sun, Li
    REMOTE SENSING, 2020, 12 (02)