LEARNING ENVIRONMENTAL SOUNDS WITH END-TO-END CONVOLUTIONAL NEURAL NETWORK

被引:0
|
作者
Tokozume, Yuji [1 ]
Harada, Tatsuya [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
关键词
Environmental sound classification; convolutional neural network; end-to-end system; feature learning;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Environmental sound classification (ESC) is usually conducted based on handcrafted features such as the log-mel feature. Meanwhile, end-to-end classification systems perform feature extraction jointly with classification and have achieved success particularly in image classification. In the same manner, if environmental sounds could be directly learned from the raw waveforms, we would be able to extract a new feature effective for classification that could not have been designed by humans, and thi s new feature could improve the classification performance. In this paper, we propose a novel end-to-end ESC system using a convolutional neural network (CNN). The classification accuracy of our system on ESC-50 is 5.1% higher than that achieved when using logmel-CNN with the static log-mel feature. Moreover, we achieve a 6.5% improvement in classification accuracy over the state-of-the-art logmel-CNN with the static and delta log-mel feature, simply by combining our system and logmel-CNN.
引用
收藏
页码:2721 / 2725
页数:5
相关论文
共 50 条
  • [41] End-to-end recognition of slab identification numbers using a deep convolutional neural network
    Lee, Sang Jun
    Yun, Jong Pil
    Koo, Gyogwon
    Kim, Sang Woo
    KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 1 - 10
  • [42] End-to-End Convolutional Neural Network Feature Extraction for Remote Sensed Images Classification
    Alem, Abebaw
    Kumar, Shailender
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [43] Airport Detection Using End-to-End Convolutional Neural Network with Hard Example Mining
    Cai, Bowen
    Jiang, Zhiguo
    Zhang, Haopeng
    Zhao, Danpei
    Yao, Yuan
    REMOTE SENSING, 2017, 9 (11)
  • [44] An end-to-end convolutional network for estimating the essential matrix
    Yang, Ruiqi
    Zhang, Junhua
    Li, Bo
    IMAGE AND VISION COMPUTING, 2023, 130
  • [45] End-to-End Object Detection with Fully Convolutional Network
    Wang, Jianfeng
    Song, Lin
    Li, Zeming
    Sun, Hongbin
    Sun, Jian
    Zheng, Nanning
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15844 - 15853
  • [46] End-to-end learning of convolutional neural net and dynamic programming for left ventricle segmentation
    Nguyen, Nhat M.
    Ray, Nilanjan
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 555 - 569
  • [47] Streaming Convolutional Neural Networks for End-to-End Learning With Multi-Megapixel Images
    Pinckaers, Hans
    van Ginneken, Bram
    Litjens, Geert
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1581 - 1590
  • [48] End-to-End Control Chart Pattern Classification Using a 1D Convolutional Neural Network and Transfer Learning
    Cheng, Chuen-Sheng
    Ho, Ying
    Chiu, Tzu-Cheng
    PROCESSES, 2021, 9 (09)
  • [49] Inter-subject transfer learning with an end-to-end deep convolutional neural network for EEG-based BCI
    Fahimi, Fatemeh
    Zhang, Zhuo
    Goh, Wooi Boon
    Lee, Tih-Shi
    Ang, Kai Keng
    Guan, Cuntai
    JOURNAL OF NEURAL ENGINEERING, 2019, 16 (02)