RECOGNITION AND RETRIEVAL OF SOUND EVENTS USING SPARSE CODING CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Wang, Chien-Yao [1 ]
Santoso, Andri [1 ]
Mathulaprangsan, Seksan [1 ]
Chiang, Chin-Chin [1 ]
Wu, Chung-Hsien [2 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan
关键词
Sparse coding convolutional neural network; sound event recognition; sound event retrieval; IMAGE FEATURE; CLASSIFICATION;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding, storage, and recollection. Sound data from the RWCP sound scene dataset with added noise from NOISEX-92 noise dataset are used to compare the performance of the proposed system with the state-of-the-art baselines. The experimental results indicated that the proposed SC-CNN outperformed the state-of-the-art systems in sound event recognition and retrieval. In the sound event recognition task, the proposed system achieved an accuracy of 94.6%, 100% and 100% under 0db, 10db and clean noise conditions, respectively. In the retrieval task, the proposed system improves the mAP rate of the general CNN by approximately 6%.
引用
收藏
页码:589 / 594
页数:6
相关论文
共 50 条
  • [1] Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks
    Wang, Chien-Yao
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    Santoso, Andri
    Mathulaprangsan, Seksan
    Chiang, Chin-Chin
    Wu, Chung-Hsien
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1875 - 1887
  • [2] Bird Sound Recognition Using a Convolutional Neural Network
    Incze, Agnes
    Jancso, Henrietta-Bernadett
    Szilagyi, Zoltan
    Farkas, Attila
    Sulyok, Csaba
    2018 IEEE 16TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY 2018), 2018, : 295 - 300
  • [3] Multispecies bird sound recognition using a fully convolutional neural network
    María Teresa García-Ordás
    Sergio Rubio-Martín
    José Alberto Benítez-Andrades
    Hector Alaiz-Moretón
    Isaías García-Rodríguez
    Applied Intelligence, 2023, 53 : 23287 - 23300
  • [4] Multispecies bird sound recognition using a fully convolutional neural network
    Garcia-Ordas, Maria Teresa
    Rubio-Martin, Sergio
    Benitez-Andrades, Jose Alberto
    Alaiz-Moreton, Hector
    Garcia-Rodriguez, Isaias
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23287 - 23300
  • [5] A Method of Speech Coding for Speech Recognition Using a Convolutional Neural Network
    Kubanek, Mariusz
    Bobulski, Janusz
    Kulawik, Joanna
    SYMMETRY-BASEL, 2019, 11 (09): : 1 - 12
  • [6] Insect Sound Recognition Based on Convolutional Neural Network
    Dong, Xue
    Yan, Ning
    Wei, Ying
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC), 2018, : 855 - 859
  • [7] High-resolution CT Image Retrieval Using Sparse Convolutional Neural Network
    Lei, Yang
    Xu, Dong
    Zhou, Zhengyang
    Higgins, Kristin
    Dong, Xue
    Liu, Tian
    Shim, Hyunsuk
    Mao, Hui
    Curran, Walter J.
    Yang, Xiaofeng
    MEDICAL IMAGING 2018: PHYSICS OF MEDICAL IMAGING, 2018, 10573
  • [8] Convolutional Neural Networks Based on Sparse Coding for Human Postures Recognition
    Yang, Ning
    Li, Yawei
    Yang, Yuliang
    Zhu, Mengyu
    AOPC 2017: OPTICAL SENSING AND IMAGING TECHNOLOGY AND APPLICATIONS, 2017, 10462
  • [9] Convolutional Sparse Coding for Face Recognition
    Jin, Junwei
    Chen, C. L. Philip
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 137 - 141
  • [10] Clothing recognition based on deep sparse convolutional neural network
    Xiang, Jun
    Pan, Ruru
    Gao, Weidong
    INTERNATIONAL JOURNAL OF CLOTHING SCIENCE AND TECHNOLOGY, 2022, 34 (01) : 119 - 133