MFF-Net: A multi-scale feature fusion network for birdsong classification

被引:0
|
作者
Zhou, Hongfang [1 ,2 ]
Zheng, Kangyun [1 ,2 ]
Zhu, Wenjing [4 ]
Tong, Jiahao [1 ,2 ]
Cao, Chenhui [1 ,2 ]
Pan, Heng [1 ,2 ,3 ]
Li, Junhuai [1 ,2 ]
机构
[1] Xian Univ Technol, Sch Comp Sci & Engn, Xian 710048, Peoples R China
[2] Shaanxi Key Lab Network Comp & Secur Technol, Xian 710048, Peoples R China
[3] Shaanxi Expressway Testing & Measuring Co Ltd, Xian 710000, Peoples R China
[4] Xi Univ Posts & Telecommun, Coll Econ & Management, Xian 710061, Peoples R China
基金
中国国家自然科学基金;
关键词
Birdsong classification; Multi-scale feature fusion; Channel attention mechanism;
D O I
10.1016/j.apacoust.2025.110561
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel birdsong classification network, MFF-Net(Multi-scale Feature Fusion Network), which enhances classification performance through multi-scale feature fusion. The network is composed of four components. The first one is a multi-scale feature extraction module that extracts different scale features from the original sound. The second one is a feature fusion module utilizing a channel attention mechanism to integrate these features effectively. The third one is a feature replacement module designed to replace low-weight features and enhance feature representation. And the fourth one is a classifier module that performs birdsong classification. The proposed method was evaluated on two publicly available birdsong datasets and an urban sound dataset(Urbansound8k) to test its generalization performance. Experimental results showed that MFF-Net achieved a classification accuracy of 96.83 % on the BirdCLEF-13 dataset and demonstrated good generalization performance on the urban sound dataset (UrbanSound8k), achieving competitive results. These results highlight the robustness and effectiveness of MFF-Net in noisy and diverse environments.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Birdsong classification based on multi feature channel fusion
    Zhihua Liu
    Wenjie Chen
    Aibin Chen
    Guoxiong Zhou
    Jizheng Yi
    Multimedia Tools and Applications, 2022, 81 : 15469 - 15490
  • [22] Birdsong classification based on multi feature channel fusion
    Liu, Zhihua
    Chen, Wenjie
    Chen, Aibin
    Zhou, Guoxiong
    Yi, Jizheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15469 - 15490
  • [23] Birdsong classification based on multi-feature fusion
    Yan, Na
    Chen, Aibin
    Zhou, Guoxiong
    Zhang, Zhiqiang
    Liu, Xiangyong
    Wang, Jianwu
    Liu, Zhihua
    Chen, Wenjie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (30) : 36529 - 36547
  • [24] Birdsong classification based on multi-feature fusion
    Na Yan
    Aibin Chen
    Guoxiong Zhou
    Zhiqiang Zhang
    Xiangyong Liu
    Jianwu Wang
    Zhihua Liu
    Wenjie Chen
    Multimedia Tools and Applications, 2021, 80 : 36529 - 36547
  • [25] MFF-DTA: Multi-scale feature fusion for drug-target affinity prediction
    Tang, Xiwei
    Ma, Wanjun
    Yang, Mengyun
    Li, Wenjun
    METHODS, 2024, 231 : 1 - 7
  • [26] Ship fine-grained classification network based on multi-scale feature fusion
    Chen, Lisu
    Wang, Qian
    Zhu, Enyan
    Feng, Daolun
    Wu, Huafeng
    Liu, Tao
    OCEAN ENGINEERING, 2025, 318
  • [27] Multi-scale high and low feature fusion attention network for intestinal image classification
    Li, Sheng
    Zhu, Beibei
    Guo, Xinran
    Ye, Shufang
    Ye, Jietong
    Zhuang, Yongwei
    He, Xiongxiong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (06) : 2877 - 2886
  • [28] MSFF: A Multi-Scale Feature Fusion Convolutional Neural Network for Hyperspectral Image Classification
    Gong, Gu
    Wang, Xiaopeng
    Zhang, Jiahua
    Shang, Xiaodi
    Pan, Zhicheng
    Li, Zhiyuan
    Zhang, Junshi
    ELECTRONICS, 2025, 14 (04):
  • [29] Multi-scale high and low feature fusion attention network for intestinal image classification
    Sheng Li
    Beibei Zhu
    Xinran Guo
    Shufang Ye
    Jietong Ye
    Yongwei Zhuang
    Xiongxiong He
    Signal, Image and Video Processing, 2023, 17 : 2877 - 2886
  • [30] Facial Expression Image Classification Based on Multi-scale Feature Fusion Residual Network
    Zhao, Yuxi
    Wang, Chunzhi
    Zhou, Xianjing
    Liu, Hu
    Communications in Computer and Information Science, 2023, 1811 CCIS : 105 - 118