RECOGNITION AND RETRIEVAL OF SOUND EVENTS USING SPARSE CODING CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Wang, Chien-Yao [1 ]
Santoso, Andri [1 ]
Mathulaprangsan, Seksan [1 ]
Chiang, Chin-Chin [1 ]
Wu, Chung-Hsien [2 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
关键词
Sparse coding convolutional neural network; sound event recognition; sound event retrieval; IMAGE FEATURE; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding, storage, and recollection. Sound data from the RWCP sound scene dataset with added noise from NOISEX-92 noise dataset are used to compare the performance of the proposed system with the state-of-the-art baselines. The experimental results indicated that the proposed SC-CNN outperformed the state-of-the-art systems in sound event recognition and retrieval. In the sound event recognition task, the proposed system achieved an accuracy of 94.6%, 100% and 100% under 0db, 10db and clean noise conditions, respectively. In the retrieval task, the proposed system improves the mAP rate of the general CNN by approximately 6%.
引用
收藏
页码:589 / 594
页数:6
相关论文
共 50 条
  • [21] Chinese Herbal Medicine Image Recognition and Retrieval by Convolutional Neural Network
    Sun, Xin
    Qian, Huinan
    PLOS ONE, 2016, 11 (06):
  • [22] Animal Sound Classification Using A Convolutional Neural Network
    Sasmaz, Emre
    Tek, F. Boray
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 625 - 629
  • [23] Visual recognition for urban traffic data retrieval and analysis in major events using convolutional neural networks
    Pi, Yalong
    Duffield, Nick
    Behzadan, Amir H.
    Lomax, Tim
    COMPUTATIONAL URBAN SCIENCE, 2022, 2 (01):
  • [24] Visual recognition for urban traffic data retrieval and analysis in major events using convolutional neural networks
    Yalong Pi
    Nick Duffield
    Amir H. Behzadan
    Tim Lomax
    Computational Urban Science, 2
  • [25] Convolutional Neural Networks Analyzed via Convolutional Sparse Coding
    Papyan, Vardan
    Romano, Yaniv
    Elad, Michael
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18 : 1 - 52
  • [26] Character Recognition for Automotive Parts Coding Based on Convolutional Neural Network
    Li, Daiyang
    Tang, Qian
    Zhou, Hao
    Li, Yi
    Su, Qiguang
    Lin, Zhaofu
    2020 4TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2020), 2020, 1518
  • [27] Lightweight Environmental Sound Recognition Using Convolutional Neural Networks
    Tang, Liwen
    Du, Zhouyang
    Wang, Yawen
    Xiao, Zhuoling
    Lin, Jiazhen
    Zhang, Xiaoyan
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 215 - 220
  • [28] ROBUST SOUND EVENT RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Haomin
    McLoughlin, Ian
    Song, Yan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 559 - 563
  • [29] Sparse Direct Convolutional Neural Network
    Daultani, Vijay
    Ohno, Yoshiyuki
    Ishizaka, Kazuhisa
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 293 - 303
  • [30] Image Denoising Using Convolutional Sparse Coding Network with Dry Friction
    Zhang, Yali
    Wang, Xiaofan
    Wang, Fengpin
    Wang, Jinjia
    COMPUTER VISION - ACCV 2022, PT I, 2023, 13841 : 587 - 601