Improving sound event detection through enhanced feature extraction and attention mechanisms

被引:0
|
作者
Zhang, Dongping [1 ]
Wu, Siyi [1 ]
Lu, Zhanhong [2 ]
Zhang, Zhehao [3 ]
Hu, Haimiao [4 ]
Yu, Jiabin [1 ]
机构
[1] China Jiliang Univ, Coll Informat Engn, Hangzhou 310018, Peoples R China
[2] Hangzhou Hikvis Digital Technol Co Ltd, Hangzhou 310051, Peoples R China
[3] Hangzhou Aihua Intelligent Technol Co Ltd, Hangzhou 311422, Peoples R China
[4] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
关键词
D O I
10.1007/s11704-025-41108-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
引用
收藏
页数:3
相关论文
共 50 条
  • [31] SSW-YOLO: Enhanced Blood Cell Detection with Improved Feature Extraction and Multi-scale Attention
    Sun, Hai
    Wan, Xiaorong
    Tang, Shouguo
    Li, Yingna
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,
  • [32] IMPROVING SOUND EVENT DETECTION METRICS: INSIGHTS FROM DCASE 2020
    Ferroni, Giacomo
    Turpault, Nicolas
    Azcarreta, Juan
    Tuveri, Francesco
    Serizel, Romain
    Bilen, Cagdas
    Krstulovic, Sacha
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 631 - 635
  • [33] Enhanced mechanisms of pooling and channel attention for deep learning feature maps
    Li, Hengyi
    Yue, Xuebin
    Meng, Lin
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [34] Enhanced mechanisms of pooling and channel attention for deep learning feature maps
    Li H.
    Yue X.
    Meng L.
    PeerJ Computer Science, 2022, 8
  • [35] Sound Event Detection: A Journey Through DCASE Challenge Series
    Khandelwal, Tanmay
    Das, Rohan Kumar
    Chng, Eng Siong
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (01) : 1 - 63
  • [36] Joining Sound Event Detection and Localization Through Spatial Segregation
    Trowitzsch, Ivo
    Schymura, Christopher
    Kolossa, Dorothea
    Obermayer, Klaus
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 487 - 502
  • [37] CNN-TRANSFORMER WITH SELF-ATTENTION NETWORK FOR SOUND EVENT DETECTION
    Wakayama, Keigo
    Saito, Shoichiro
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 806 - 810
  • [38] Polyphonic sound event localization and detection based on Multiple Attention Fusion ResNet
    Zhang S.
    Zhang Y.
    Liao Y.
    Pang K.
    Wan Z.
    Zhou S.
    Mathematical Biosciences and Engineering, 2024, 21 (02) : 2004 - 2023
  • [39] Sound Event Localization and Detection Using Parallel Multi-attention Enhancement
    Zhengyu Chen
    Qinghua Huang
    Circuits, Systems, and Signal Processing, 2024, 43 (1) : 545 - 567
  • [40] Sound Event Localization and Detection Using Parallel Multi-attention Enhancement
    Chen, Zhengyu
    Huang, Qinghua
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (01) : 545 - 567