RECOGNITION AND RETRIEVAL OF SOUND EVENTS USING SPARSE CODING CONVOLUTIONAL NEURAL NETWORK

被引：0

作者：

Wang, Chien-Yao ^{[1
]}

Santoso, Andri ^{[1
]}

Mathulaprangsan, Seksan ^{[1
]}

Chiang, Chin-Chin ^{[1
]}

Wu, Chung-Hsien ^{[2
]}

Wang, Jia-Ching ^{[1
]}

机构：

[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan

[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan, Taiwan

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2017年

关键词：

Sparse coding convolutional neural network; sound event recognition; sound event retrieval; IMAGE FEATURE; CLASSIFICATION;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

This paper proposes a novel deep convolutional neural network (CNN), called sparse coding convolutional neural network (SC-CNN), to address the problem of sound event recognition and retrieval task. Unlike the general framework of a CNN, in which feature learning process is performed hierarchically, the proposed framework models the whole memorizing procedures in the human brain, including encoding, storage, and recollection. Sound data from the RWCP sound scene dataset with added noise from NOISEX-92 noise dataset are used to compare the performance of the proposed system with the state-of-the-art baselines. The experimental results indicated that the proposed SC-CNN outperformed the state-of-the-art systems in sound event recognition and retrieval. In the sound event recognition task, the proposed system achieved an accuracy of 94.6%, 100% and 100% under 0db, 10db and clean noise conditions, respectively. In the retrieval task, the proposed system improves the mAP rate of the general CNN by approximately 6%.

引用

页码：589 / 594

页数：6

共 50 条

[31] A convolutional neural network with sparse representation
Yang, Guoan
Yang, Junjie
Lu, Zhengzhi
Liu, Deyang
KNOWLEDGE-BASED SYSTEMS, 2020, 209
[32] SUnCNN: Sparse Unmixing Using Unsupervised Convolutional Neural Network
Rasti, Behnood
Koirala, Bikram
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[33] DATA DRIVEN CONVOLUTIONAL SPARSE CODING FOR VISUAL RECOGNITION
Zeng, Yijie
Chen, Jichao
Huang, Guang-Bin
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2736 - 2740
[34] Manufacturing Feature Recognition With a Sparse Voxel-Based Convolutional Neural Network
Vatandoust, Farzad
Yan, Xiaoliang
Rosen, David
Melkote, Shreyes N.
JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2025, 25 (03)
[35] KNOWLEDGE TRANSFER FROM WEAKLY LABELED AUDIO USING CONVOLUTIONAL NEURAL NETWORK FOR SOUND EVENTS AND SCENES
Kumar, Anurag
Khadkevich, Maksim
Fugen, Christian
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 326 - 330
[36] Recognition of Chinese food using convolutional neural network
Teng, Jianing
Zhang, Dong
Lee, Dah-Jye
Chou, Yao
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (09) : 11155 - 11172
[37] Genre Recognition of Artworks using Convolutional Neural Network
Hosainl, Md Kamran
Harun-Ur-Rashid
Taher, Tasnova Bintee
Rahman, Mohammad Masudur
2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
[38] Facial Expression Recognition Using Convolutional Neural Network
Agrawal, Ved
Bamb, Chirag
Mata, Harsh
Dhunde, Harshal
Hablani, Ramchand
SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 4, SMARTCOM 2024, 2024, 948 : 267 - 278
[39] Recognition of Chinese food using convolutional neural network
Jianing Teng
Dong Zhang
Dah-Jye Lee
Yao Chou
Multimedia Tools and Applications, 2019, 78 : 11155 - 11172
[40] Medical image retrieval using deep convolutional neural network
Qayyum, Adnan
Anwar, Syed Muhammad
Awais, Muhammad
Majid, Muhammad
NEUROCOMPUTING, 2017, 266 : 8 - 20

← 1 2 3 4 5 →