MFF-Net: A multi-scale feature fusion network for birdsong classification

被引:0
|
作者
Zhou, Hongfang [1 ,2 ]
Zheng, Kangyun [1 ,2 ]
Zhu, Wenjing [4 ]
Tong, Jiahao [1 ,2 ]
Cao, Chenhui [1 ,2 ]
Pan, Heng [1 ,2 ,3 ]
Li, Junhuai [1 ,2 ]
机构
[1] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[2] Shaanxi Key Lab Network Comp & Secur Technol, Xian 710048, Peoples R China
[3] Shaanxi Expressway Testing & Measuring Co Ltd, Xian 710000, Peoples R China
[4] Xi Univ Posts & Telecommun, Coll Econ & Management, Xian 710061, Peoples R China
基金
中国国家自然科学基金;
关键词
Birdsong classification; Multi-scale feature fusion; Channel attention mechanism;
D O I
10.1016/j.apacoust.2025.110561
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel birdsong classification network, MFF-Net(Multi-scale Feature Fusion Network), which enhances classification performance through multi-scale feature fusion. The network is composed of four components. The first one is a multi-scale feature extraction module that extracts different scale features from the original sound. The second one is a feature fusion module utilizing a channel attention mechanism to integrate these features effectively. The third one is a feature replacement module designed to replace low-weight features and enhance feature representation. And the fourth one is a classifier module that performs birdsong classification. The proposed method was evaluated on two publicly available birdsong datasets and an urban sound dataset(Urbansound8k) to test its generalization performance. Experimental results showed that MFF-Net achieved a classification accuracy of 96.83 % on the BirdCLEF-13 dataset and demonstrated good generalization performance on the urban sound dataset (UrbanSound8k), achieving competitive results. These results highlight the robustness and effectiveness of MFF-Net in noisy and diverse environments.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Multi-scale pulmonary nodule classification with deep feature fusion via residual network
    Zhang G.
    Zhu D.
    Liu X.
    Chen M.
    Itti L.
    Luo Y.
    Lu J.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (11) : 14829 - 14840
  • [32] MSF-Net: A Lightweight Multi-Scale Feature Fusion Network for Skin Lesion Segmentation
    Shao, Dangguo
    Ren, Lifan
    Ma, Lei
    BIOMEDICINES, 2023, 11 (06)
  • [33] DEMF-Net: A dual encoder multi-scale feature fusion network for polyp segmentation
    Cao, Xiaorui
    Yu, He
    Yan, Kang
    Cui, Rong
    Guo, Jinming
    Li, Xuan
    Xing, Xiaoxue
    Huang, Tao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
  • [34] Fast multi-scale feature fusion for ECG heartbeat classification
    Ai, Danni
    Yang, Jian
    Wang, Zeyu
    Fan, Jingfan
    Ai, Changbin
    Wang, Yongtian
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [35] Classification of Star Spectrum Based on Multi-Scale Feature Fusion
    Han Bo-chong
    Song Yi-han
    Zhao Yong-heng
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44 (08) : 2284 - 2288
  • [36] A Vehicle Classification Model Based on Multi-scale Feature Fusion
    Wang, Xuanhong
    Yang, Shiyu
    Sun, Zengguo
    Li, Xiaojun
    Xiao, Yun
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7180 - 7185
  • [37] Classification of crop pests based on multi-scale feature fusion
    Wei, Depeng
    Chen, Jiqing
    Luo, Tian
    Long, Teng
    Wang, Huabin
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 194
  • [38] Fast multi-scale feature fusion for ECG heartbeat classification
    Danni Ai
    Jian Yang
    Zeyu Wang
    Jingfan Fan
    Changbin Ai
    Yongtian Wang
    EURASIP Journal on Advances in Signal Processing, 2015
  • [39] MFFE: Multi-scale Feature Fusion Enhanced Net for image dehazing
    Zhang, Xinyu
    Li, Jinjiang
    Hua, Zhen
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 105
  • [40] MFF-Net: A Lightweight Multi-Frequency Network for Measuring Heart Rhythm from Facial Videos
    Yan, Wenqin
    Zhuang, Jialiang
    Chen, Yuheng
    Zhang, Yun
    Zheng, Xiujuan
    SENSORS, 2024, 24 (24)