Acoustic Scene Classification Using A Deeper Training Method for Convolution Neural Network

被引:0
|
作者
Tan Doan [1 ]
Hung Nguyen [1 ]
Dat Thanh Ngo [1 ]
Lam Pham [1 ]
Ha Hoang Kha [1 ]
机构
[1] Ho Chi Minh City Univ Technol, Fac Elect & Elect Engn, VNU HCM, Ho Chi Minh City, Vietnam
关键词
Acoustic scene classification; deep learning; convolutional neural network; Gammatone spectrogram;
D O I
10.1109/isee2.2019.8921365
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a deep learning framework applied for acoustic scene classification (ASC) recognizing the environmental sounds. Since an audio scene related to a given location potentially contains numerous sound events, only few of these events supply helpful information on the scene, which makes the acoustic scene classification task become a very complex problem. To confront this challenge, we suggest a novel architecture consisting of two basic processes. The front-end process approaches a spectrogram feature, using Gammatone filters. Regarding the back-end classification, we propose a novel convolutional neural network (CNN) architecture that enforces the network deeply learning middle convolutional layers. Our experiments conducted over DCASE2016 task 1A dataset offer the highest classification accuracy of 84.4% as compared to 72.5% of DCASE2016 baseline.
引用
收藏
页码:63 / 67
页数:5
相关论文
共 50 条
  • [1] Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices
    Lam Pham
    Khoa Tran
    Dat Ngo
    Hieu Tang
    Son Phan
    Schindler, Alexander
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [2] Acoustic scene classification using projection Kervolutional neural network
    Mulimani, Manjunath
    Nandi, Ritika
    Koolagudi, Shashidhar G.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 9447 - 9457
  • [3] Acoustic scene classification using projection Kervolutional neural network
    Manjunath Mulimani
    Ritika Nandi
    Shashidhar G Koolagudi
    Multimedia Tools and Applications, 2023, 82 : 9447 - 9457
  • [4] Acoustic Scene Classification Using Bilinear Pooling on Time-liked and Frequency-liked Convolution Neural Network
    Kek, Xing Yong
    Chin, Cheng Siong
    Li, Ye
    2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 3189 - 3194
  • [5] A Convolutional Neural Network Approach for Acoustic Scene Classification
    Valenti, Michele
    Squartini, Stefano
    Diment, Aleksandr
    Parascandolo, Giambattista
    Virtanen, Tuomas
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554
  • [6] Large-scale text classification with deeper and wider convolution neural network
    Huang M.
    Huang W.
    International Journal of Simulation and Process Modelling, 2020, 15 (1-2) : 120 - 133
  • [7] Acoustic Scene Classification Using Self-Determination Convolutional Neural Network
    Wang, Chien-Yao
    Santoso, Andri
    Wang, Jia-Ching
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 19 - 22
  • [8] Analysis of Deep Neural Network Models for Acoustic Scene Classification
    Basbug, Ahmet Melih
    Sert, Mustafa
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [9] A Time Delay Convolutional Neural Network for Acoustic Scene Classification
    Lee, Younglo
    Park, Sangwook
    Ko, Hanseok
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [10] High Performance Neural Network based Acoustic Scene Classification
    Prakruthi, U. S.
    Kiran, Divya
    Ramasangu, Hariharan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 781 - 784