RECOGNITION AND RETRIEVAL OF SOUND EVENTS USING SPARSE CODING CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Wang, Chien-Yao [1 ]
Santoso, Andri [1 ]
Mathulaprangsan, Seksan [1 ]
Chiang, Chin-Chin [1 ]
Wu, Chung-Hsien [2 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
关键词
Sparse coding convolutional neural network; sound event recognition; sound event retrieval; IMAGE FEATURE; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding, storage, and recollection. Sound data from the RWCP sound scene dataset with added noise from NOISEX-92 noise dataset are used to compare the performance of the proposed system with the state-of-the-art baselines. The experimental results indicated that the proposed SC-CNN outperformed the state-of-the-art systems in sound event recognition and retrieval. In the sound event recognition task, the proposed system achieved an accuracy of 94.6%, 100% and 100% under 0db, 10db and clean noise conditions, respectively. In the retrieval task, the proposed system improves the mAP rate of the general CNN by approximately 6%.
引用
收藏
页码:589 / 594
页数:6
相关论文
共 50 条
  • [31] A convolutional neural network with sparse representation
    Yang, Guoan
    Yang, Junjie
    Lu, Zhengzhi
    Liu, Deyang
    KNOWLEDGE-BASED SYSTEMS, 2020, 209
  • [32] SUnCNN: Sparse Unmixing Using Unsupervised Convolutional Neural Network
    Rasti, Behnood
    Koirala, Bikram
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] DATA DRIVEN CONVOLUTIONAL SPARSE CODING FOR VISUAL RECOGNITION
    Zeng, Yijie
    Chen, Jichao
    Huang, Guang-Bin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2736 - 2740
  • [34] Manufacturing Feature Recognition With a Sparse Voxel-Based Convolutional Neural Network
    Vatandoust, Farzad
    Yan, Xiaoliang
    Rosen, David
    Melkote, Shreyes N.
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (03)
  • [35] KNOWLEDGE TRANSFER FROM WEAKLY LABELED AUDIO USING CONVOLUTIONAL NEURAL NETWORK FOR SOUND EVENTS AND SCENES
    Kumar, Anurag
    Khadkevich, Maksim
    Fugen, Christian
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 326 - 330
  • [36] Recognition of Chinese food using convolutional neural network
    Teng, Jianing
    Zhang, Dong
    Lee, Dah-Jye
    Chou, Yao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) : 11155 - 11172
  • [37] Genre Recognition of Artworks using Convolutional Neural Network
    Hosainl, Md Kamran
    Harun-Ur-Rashid
    Taher, Tasnova Bintee
    Rahman, Mohammad Masudur
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [38] Facial Expression Recognition Using Convolutional Neural Network
    Agrawal, Ved
    Bamb, Chirag
    Mata, Harsh
    Dhunde, Harshal
    Hablani, Ramchand
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 267 - 278
  • [39] Recognition of Chinese food using convolutional neural network
    Jianing Teng
    Dong Zhang
    Dah-Jye Lee
    Yao Chou
    Multimedia Tools and Applications, 2019, 78 : 11155 - 11172
  • [40] Medical image retrieval using deep convolutional neural network
    Qayyum, Adnan
    Anwar, Syed Muhammad
    Awais, Muhammad
    Majid, Muhammad
    NEUROCOMPUTING, 2017, 266 : 8 - 20