Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

被引:23
|
作者
Yang, Liping [1 ]
Tao, Lianjie [1 ]
Chen, Xinxing [1 ]
Gu, Xiaohua [2 ]
机构
[1] Chongqing Univ, MOE, Key Lab Optoelect Technol & Syst, Chongqing 400044, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing 401331, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-scale feature learning; Convolutional neural networks; Data augmentation; Acoustic scene classification (ASC); Machine listening; NETWORKS;
D O I
10.1016/j.apacoust.2020.107238
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] IX-ResNet: fragmented multi-scale feature fusion for image classification
    Xue, Tao
    Hong, Yang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27855 - 27865
  • [42] Research on Waste Plastics Classification Method Based on Multi-Scale Feature Fusion
    Cai, Zhenxing
    Yang, Jianhong
    Fang, Huaiying
    Ji, Tianchen
    Hu, Yangyang
    Wang, Xin
    SENSORS, 2022, 22 (20)
  • [43] Hyperspectral image classification based on octave convolution and multi-scale feature fusion
    Li, Zhiyong
    Wen, Bo
    Luo, Yunzhong
    Li, Qiaochu
    Song, Lulu
    PRECISION ENGINEERING-JOURNAL OF THE INTERNATIONAL SOCIETIES FOR PRECISION ENGINEERING AND NANOTECHNOLOGY, 2022, 75 : 80 - 94
  • [44] Multi-Scale Feature Fusion with Adaptive Weighting for Diabetic Retinopathy Severity Classification
    Fan, Runze
    Liu, Yuhong
    Zhang, Rongfen
    ELECTRONICS, 2021, 10 (12)
  • [45] HiFuse: Hierarchical multi-scale feature fusion network for medical image classification
    Huo, Xiangzuo
    Sun, Gang
    Tian, Shengwei
    Wang, Yan
    Yu, Long
    Long, Jun
    Zhang, Wendong
    Li, Aolun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [46] Fine-Grained Image Classification Based on Multi-Scale Feature Fusion
    Li Siyao
    Liu Yuhong
    Zhang Rongfen
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (12)
  • [47] IX-ResNet: fragmented multi-scale feature fusion for image classification
    Tao Xue
    Yang Hong
    Multimedia Tools and Applications, 2021, 80 : 27855 - 27865
  • [48] Research on the Classification of Complex Wheat Fields Based on Multi-Scale Feature Fusion
    Mu, Fei
    Chu, Hongli
    Shi, Shuaiqi
    Yuan, Minxin
    Liu, Qi
    Yang, Fuzeng
    AGRONOMY-BASEL, 2022, 12 (11):
  • [49] Multi-scale feature fusion and class weight loss for skin lesion classification
    Hu Z.
    Mei W.
    Chen H.
    Hou W.
    Computers in Biology and Medicine, 2024, 176
  • [50] Multi-Scale Deep Feature Fusion with Machine Learning Classifier for Birdsong Classification
    Li, Wei
    Lv, Danju
    Yu, Yueyun
    Zhang, Yan
    Gu, Lianglian
    Wang, Ziqian
    Zhu, Zhicheng
    APPLIED SCIENCES-BASEL, 2025, 15 (04):