Ensemble System of Deep Neural Networks for Single-Channel Audio Separation

被引:2
|
作者
Al-Kaltakchi, Musab T. S. [1 ]
Mohammad, Ahmad Saeed [2 ]
Woo, Wai Lok [3 ]
机构
[1] Mustansiriyah Univ, Coll Engn, Dept Elect Engn, Baghdad, Iraq
[2] Mustansiriyah Univ, Coll Engn, Dept Comp Engn, Baghdad, Iraq
[3] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, England
关键词
single-channel audio separation; deep neural networks; ideal binary mask; feature fusion; EXTREME LEARNING-MACHINE; NONNEGATIVE MATRIX FACTORIZATION; SPEECH SEPARATION; ALGORITHM;
D O I
10.3390/info14070352
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech separation is a well-known problem, especially when there is only one sound mixture available. Estimating the Ideal Binary Mask (IBM) is one solution to this problem. Recent research has focused on the supervised classification approach. The challenge of extracting features from the sources is critical for this method. Speech separation has been accomplished by using a variety of feature extraction models. The majority of them, however, are concentrated on a single feature. The complementary nature of various features have not been thoroughly investigated. In this paper, we propose a deep neural network (DNN) ensemble architecture to completely explore the complimentary nature of the diverse features obtained from raw acoustic features. We examined the penultimate discriminative representations instead of employing the features acquired from the output layer. The learned representations were also fused to produce a new features vector, which was then classified by using the Extreme Learning Machine (ELM). In addition, a genetic algorithm (GA) was created to optimize the parameters globally. The results of the experiments showed that our proposed system completely considered various features and produced a high-quality IBM under different conditions.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME, SINGLE-CHANNEL SPEECH SEPARATION
    Luo, Yi
    Mesgarani, Nima
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 696 - 700
  • [42] SINGLE-CHANNEL SOURCE SEPARATION OF AUDIO SIGNALS USING BARK SCALE WAVELET PACKET DECOMPOSITION
    Litvin, Yevgeni
    Cohen, Israel
    2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 305 - 308
  • [43] Single-Channel Source Separation of Audio Signals Using Bark Scale Wavelet Packet Decomposition
    Litvin, Yevgeni
    Cohen, Israel
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 65 (03): : 339 - 350
  • [44] Deep clustering-based single-channel speech separation and recent advances
    Aihara, Ryo
    Wichern, Gordon
    Le Roux, Jonathan
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2020, 41 (02) : 465 - 471
  • [45] Single-Channel Source Separation of Audio Signals Using Bark Scale Wavelet Packet Decomposition
    Yevgeni Litvin
    Israel Cohen
    Journal of Signal Processing Systems, 2011, 65 : 339 - 350
  • [46] An Ensemble of Convolutional Neural Networks for Audio Classification
    Nanni, Loris
    Maguolo, Gianluca
    Brahnam, Sheryl
    Paci, Michelangelo
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [47] Single channel audio source separation
    Gao, Bin
    Woo, W.L.
    Dlay, S.S.
    WSEAS Transactions on Signal Processing, 2008, 4 (04): : 173 - 182
  • [48] A blind signal separation method for single-channel electromagnetic surveillance system
    Pang, Lihui
    Qi, Zhilong
    Li, Shuai
    Tang, Bin
    INTERNATIONAL JOURNAL OF ELECTRONICS, 2015, 102 (10) : 1634 - 1651
  • [49] GAUSSIAN DENSITY GUIDED DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chai, Li
    Du, Jun
    Wang, Yan-nan
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [50] NOISE-ADAPTIVE DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chung, Hanwook
    Kim, Taesup
    Plourde, Eric
    Champagne, Benoit
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,