Non-negative Matrix Based Optimization Scheme for Blind Source Separation in Automatic Speech Recognition System

被引:0
|
作者
Santosh, Kumar S. [1 ,2 ]
Bharathi, S. H. [3 ]
Archana, M. [4 ]
机构
[1] SVCE, Dept E&CE, Bengaluru, India
[2] Reva Univ, Bengaluru, India
[3] Reva Univ, Sch ECE, Bengaluru, India
[4] SJCIT, Dept Math, Chikkaballapur, India
关键词
Non negative matrix; Automatic Spech Recognition; Blind source seperation; AUDIO SOURCE SEPARATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, use of automatic speech recognition system is demanded for various applications such as security, word to text conversion etc. During the speech signal acquisition, other unwanted signals from various sources are added to the original signal which degrades the performance of ASR system. These unwanted signals are called as noise or mixing of sources which are caused due to multi-user recording, echo effect etc. this issues motivates to develop an efficient algorithm for audio demixing or source separation. To address this issue in this work we propose a new approach for source separation method using nonnegative factorization method. Proposed work utilized source mixing signal modelling, filter bank designing and source separation algorithm implementation. Modelling of signal is performed by combining two different channels which are acquired from different source, this signal is called mixture signal. Later a filter bank is designed using scattering algorithm based on wavelet transform method and a optimization problem is formulated for audio demixing. Experimental study shows the robustness of proposed model by considering various implementation scenarios.
引用
收藏
页码:782 / 787
页数:6
相关论文
共 50 条
  • [21] MULTICHANNEL BLIND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION IN WAVENUMBER DOMAIN
    Mitsufuji, Yuki
    Koyama, Shoichi
    Saruwatari, Hiroshi
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 56 - 60
  • [22] Non-Negative Blind Source Separation Algorithm Based on Minimum Aperture Simplicial Cone
    Ouedraogo, Wendyam Serge Boris
    Souloumiac, Antoine
    Jaidane, Meriem
    Jutten, Christian
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (02) : 376 - 389
  • [23] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
    Nakano, Shoichi
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
  • [24] A supervised non-negative matrix factorization model for speech emotion recognition
    Hou, Mixiao
    Li, Jinxing
    Lu, Guangming
    SPEECH COMMUNICATION, 2020, 124 : 13 - 20
  • [25] Perceptually Weighted Non-negative Matrix Factorization for Blind Single-Channel Music Source Separation
    Kirbiz, S.
    Gunsel, B.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 226 - 229
  • [26] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
    Kirbiz, S.
    Gunsel, B.
    DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
  • [27] A PERCEPTUALLY ENHANCED BLIND SINGLE-CHANNEL AUDIO SOURCE SEPARATION BY NON-NEGATIVE MATRIX FACTORIZATION
    Kirbiz, S.
    Gunsel, B.
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 731 - 735
  • [28] SPEECH EMOTION RECOGNITION USING TRANSFER NON-NEGATIVE MATRIX FACTORIZATION
    Song, Peng
    Ou, Shifeng
    Zheng, Wenming
    Jin, Yun
    Zhao, Li
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5180 - 5184
  • [29] SAR Automatic Target Recognition via Non-negative Matrix Approximations
    Riasati, Vahid
    Srinivas, Umamahesh
    Monga, Vishal
    AUTOMATIC TARGET RECOGNITION XXII, 2012, 8391
  • [30] Sound Source Separation Based on Multichannel Non-negative Matrix Factorization with Weighted Averaging
    Yamamoto, Tsuyoshi
    Uenohara, Shingo
    Nishijima, Keisuke
    Furuya, Ken'ichi
    COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, 2021, 1194 : 177 - 187