Lightweight deep convolutional neural network for background sound classification in speech signals

被引:6
|
作者
Dayal, Aveen [1 ]
Yeduri, Sreenivasa Reddy [1 ]
Koduru, Balu Harshavardan [1 ]
Jaiswal, Rahul Kumar [1 ]
Soumya, J. [2 ]
Srinivas, M. B. [2 ]
Pandey, Om Jee [3 ]
Cenkeramaddi, Linga Reddy [1 ]
机构
[1] Univ Agder, Dept ICT, N-4879 Grimstad, Norway
[2] Birla Inst Technol & Sci Pilani, Hyderabad, India
[3] IIT BHU Varanasi, Dept Elect Engn, Varanasi 221005, Uttar Pradesh, India
来源
关键词
RECOGNITION; PATTERN;
D O I
10.1121/10.0010257
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognizing background information in human speech signals is a task that is extremely useful in a wide range of practical applications, and many articles on background sound classification have been published. It has not, however, been addressed with background embedded in real-world human speech signals. Thus, this work proposes a lightweight deep convolutional neural network (CNN) in conjunction with spectrograms for an efficient background sound classification with practical human speech signals. The proposed model classifies 11 different background sounds such as airplane, airport, babble, car, drone, exhibition, helicopter, restaurant, station, street, and train sounds embedded in human speech signals. The proposed deep CNN model consists of four convolution layers, four max-pooling layers, and one fully connected layer. The model is tested on human speech signals with varying signal-to-noise ratios (SNRs). Based on the results, the proposed deep CNN model utilizing spectrograms achieves an overall background sound classification accuracy of 95.2% using the human speech signals with a wide range of SNRs. It is also observed that the proposed model outperforms the benchmark models in terms of both accuracy and inference time when evaluated on edge computing devices. (C) 2022 Acoustical Society of America.
引用
收藏
页码:2773 / 2786
页数:14
相关论文
共 50 条
  • [41] Contextual background modeling using deep convolutional neural network
    Vijayan, Midhula
    Mohan, R.
    Raguraman, Preeth
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (15-16) : 11083 - 11105
  • [42] Deep Convolutional Neural Network Approach for Classification of Poems
    Deshmukh, Rushali
    Kiwelekar, Arvind W.
    INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2021, 2022, 13184 : 74 - 88
  • [43] A novel deep convolutional neural network for arrhythmia classification
    Dang, Hao
    Sun, Muyi
    Zhang, Guanhong
    Zhou, Xiaoguang
    Chang, Qing
    Xu, Xiangdong
    2019 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2019, : 7 - 11
  • [44] Fetal Distress Classification with Deep Convolutional Neural Network
    Singh, Harman Deep
    Saini, Munish
    Kaur, Jasdeep
    CURRENT WOMENS HEALTH REVIEWS, 2021, 17 (01) : 60 - 73
  • [45] DeepDocClassifier: Document Classification with Deep Convolutional Neural Network
    Afzal, Muhammad Zeshan
    Capobianco, Samuele
    Malik, Muhammad Imran
    Marinai, Simone
    Breuel, Thomas M.
    Dengel, Andreas
    Liwicki, Marcus
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1111 - 1115
  • [46] A deep residual convolutional neural network for mineral classification
    Agrawal, Neelam
    Govil, Himanshu
    ADVANCES IN SPACE RESEARCH, 2023, 71 (08) : 3186 - 3202
  • [47] Fingerprint Classification using a Deep Convolutional Neural Network
    Pandya, Bhavesh
    Cosma, Georgina
    Alani, Ali A.
    Taherkhani, Aboozar
    Bharadi, Vinayak
    McGinnity, T. M.
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 86 - 91
  • [48] Gemstone Classification Using Deep Convolutional Neural Network
    Chakraborty B.
    Mukherjee R.
    Das S.
    Journal of The Institution of Engineers (India): Series B, 2024, 105 (04) : 773 - 785
  • [49] Contextual background modeling using deep convolutional neural network
    Midhula Vijayan
    R. Mohan
    Preeth Raguraman
    Multimedia Tools and Applications, 2020, 79 : 11083 - 11105
  • [50] A deep convolutional neural network for video sequence background subtraction
    Babaee, Mohammadreza
    Duc Tung Dinh
    Rigoll, Gerhard
    PATTERN RECOGNITION, 2018, 76 : 635 - 649