Non-negative Matrix Based Optimization Scheme for Blind Source Separation in Automatic Speech Recognition System

被引:0
|
作者
Santosh, Kumar S. [1 ,2 ]
Bharathi, S. H. [3 ]
Archana, M. [4 ]
机构
[1] SVCE, Dept E&CE, Bengaluru, India
[2] Reva Univ, Bengaluru, India
[3] Reva Univ, Sch ECE, Bengaluru, India
[4] SJCIT, Dept Math, Chikkaballapur, India
关键词
Non negative matrix; Automatic Spech Recognition; Blind source seperation; AUDIO SOURCE SEPARATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, use of automatic speech recognition system is demanded for various applications such as security, word to text conversion etc. During the speech signal acquisition, other unwanted signals from various sources are added to the original signal which degrades the performance of ASR system. These unwanted signals are called as noise or mixing of sources which are caused due to multi-user recording, echo effect etc. this issues motivates to develop an efficient algorithm for audio demixing or source separation. To address this issue in this work we propose a new approach for source separation method using nonnegative factorization method. Proposed work utilized source mixing signal modelling, filter bank designing and source separation algorithm implementation. Modelling of signal is performed by combining two different channels which are acquired from different source, this signal is called mixture signal. Later a filter bank is designed using scattering algorithm based on wavelet transform method and a optimization problem is formulated for audio demixing. Experimental study shows the robustness of proposed model by considering various implementation scenarios.
引用
收藏
页码:782 / 787
页数:6
相关论文
共 50 条
  • [41] Audio Source Separation Method Based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part I: Beamspace-domain Multichannel Non-negative Matrix Factorization system
    Lee, Seokjin
    Park, Sang Ha
    Sung, Koeng-Mo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2012, 31 (05): : 317 - 331
  • [42] NON-NEGATIVE SOURCE-FILTER DYNAMICAL SYSTEM FOR SPEECH ENHANCEMENT
    Simsekli, Umut
    Le Roux, Jonathan
    Hershey, John R.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [43] A Robust Blind Source Separation Algorithm Based on Non-negative Matrix Factorization and Frequency-Sliding Generalized Cross-Correlation
    Wang, Shiting
    Zhou, Yi
    Yang, Xiuxiang
    Liu, Hongqing
    2021 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2021, : 231 - 235
  • [44] CUSTOM SIZED NON-NEGATIVE MATRIX FACTOR DECONVOLUTION FOR SOUND SOURCE SEPARATION
    Becker, Julian M.
    Rohlfing, Christian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [45] Computational decomposition of molecular signatures based on blind source separation of non-negative dependent sources with NMF
    Zhang, JY
    Wei, L
    Wang, Y
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 409 - 418
  • [46] A convex analysis based criterion for blind separation of non-negative sources
    Chan, Tsung-Han
    Ma, Wing-Kin
    Chi, Chong-Yung
    Wang, Yue
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 961 - +
  • [47] Non-negative matrix factorization for target recognition
    Long, Hong-Lin
    Pi, Yi-Ming
    Cao, Zong-Jie
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1425 - 1429
  • [48] Non-negative matrix factorization for face recognition
    Guillamet, D
    Vitriá, J
    TOPICS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2504 : 336 - 344
  • [49] Blind source separation of molecular components of the human skin in vivo: non-negative matrix factorization of Raman microspectroscopy data
    Yakimov, B. P.
    Venets, A. V.
    Schleusener, J.
    Fadeev, V. V.
    Lademann, J.
    Shirshin, E. A.
    Darvin, M. E.
    ANALYST, 2021, 146 (10) : 3185 - 3196
  • [50] Estimation of the number of blind sources based on non-negative matrix factorization
    Li, Ning
    Shi, Tielin
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2007, 18 (22): : 2734 - 2737