Ensemble System of Deep Neural Networks for Single-Channel Audio Separation

被引:2
|
作者
Al-Kaltakchi, Musab T. S. [1 ]
Mohammad, Ahmad Saeed [2 ]
Woo, Wai Lok [3 ]
机构
[1] Mustansiriyah Univ, Coll Engn, Dept Elect Engn, Baghdad, Iraq
[2] Mustansiriyah Univ, Coll Engn, Dept Comp Engn, Baghdad, Iraq
[3] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, England
关键词
single-channel audio separation; deep neural networks; ideal binary mask; feature fusion; EXTREME LEARNING-MACHINE; NONNEGATIVE MATRIX FACTORIZATION; SPEECH SEPARATION; ALGORITHM;
D O I
10.3390/info14070352
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech separation is a well-known problem, especially when there is only one sound mixture available. Estimating the Ideal Binary Mask (IBM) is one solution to this problem. Recent research has focused on the supervised classification approach. The challenge of extracting features from the sources is critical for this method. Speech separation has been accomplished by using a variety of feature extraction models. The majority of them, however, are concentrated on a single feature. The complementary nature of various features have not been thoroughly investigated. In this paper, we propose a deep neural network (DNN) ensemble architecture to completely explore the complimentary nature of the diverse features obtained from raw acoustic features. We examined the penultimate discriminative representations instead of employing the features acquired from the output layer. The learned representations were also fused to produce a new features vector, which was then classified by using the Extreme Learning Machine (ELM). In addition, a genetic algorithm (GA) was created to optimize the parameters globally. The results of the experiments showed that our proposed system completely considered various features and produced a high-quality IBM under different conditions.
引用
收藏
页数:24
相关论文
共 50 条
  • [21] Improving Single-Network Single-Channel Separation of Musical Audio with Convolutional Layers
    Roma, Gerard
    Green, Owen
    Tremblay, Pierre Alexandre
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 306 - 315
  • [22] Unsupervised Single-Channel Speech Separation via Deep Neural Network for Different Gender Mixtures
    Wang, Yannan
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [23] Deep Clustering in Complex Domain for Single-Channel Speech Separation
    Liu, Runling
    Tang, Yu
    Mang, Hongwei
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1463 - 1468
  • [24] A VQ-based Single-Channel Audio Separation for Music/Speech Mixtures
    Asgari, Meysam
    Fallah, Mahdi
    Mehrizi, Elahe Abouie
    Mostafavi, Ali
    UKSIM 2009: ELEVENTH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION, 2009, : 223 - +
  • [25] Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
    Kong, Qiuqiang
    Xu, Yong
    Jackson, Philip J. B.
    Wang, Wenwu
    Plumbley, Mark D.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2747 - 2753
  • [26] Single Channel Speech Source Separation Using Hierarchical Deep Neural Networks
    Noorani, Seyed Majid
    Seyedin, Sanaz
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 466 - 470
  • [27] Joint Optimization of Perceptual Gain Function and Deep Neural Networks for Single-Channel Speech Enhancement
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    Sun, Meng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 714 - 717
  • [28] JOINT OPTIMIZATION OF AUDIBLE NOISE SUPPRESSION AND DEEP NEURAL NETWORKS FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Sun, Meng
    Yang, Jibin
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [29] Deep Neural Network for Supervised Single-Channel Speech Enhancement
    Saleem, Nasir
    Irfan Khattak, Muhammad
    Ali, Muhammad Yousaf
    Shafi, Muhammad
    ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
  • [30] Joint constraint algorithm based on deep neural network with dual outputs for single-channel speech separation
    Sun, Linhui
    Zhu, Ge
    Li, Pingan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2020, 14 (07) : 1387 - 1395