Non-negative Matrix Based Optimization Scheme for Blind Source Separation in Automatic Speech Recognition System

被引：0

作者：

Santosh, Kumar S. ^{[1
,2
]}

Bharathi, S. H. ^{[3
]}

Archana, M. ^{[4
]}

机构：

[1] SVCE, Dept E&CE, Bengaluru, India

[2] Reva Univ, Bengaluru, India

[3] Reva Univ, Sch ECE, Bengaluru, India

[4] SJCIT, Dept Math, Chikkaballapur, India

来源：

PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES) | 2016年

关键词：

Non negative matrix; Automatic Spech Recognition; Blind source seperation; AUDIO SOURCE SEPARATION;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Recently, use of automatic speech recognition system is demanded for various applications such as security, word to text conversion etc. During the speech signal acquisition, other unwanted signals from various sources are added to the original signal which degrades the performance of ASR system. These unwanted signals are called as noise or mixing of sources which are caused due to multi-user recording, echo effect etc. this issues motivates to develop an efficient algorithm for audio demixing or source separation. To address this issue in this work we propose a new approach for source separation method using nonnegative factorization method. Proposed work utilized source mixing signal modelling, filter bank designing and source separation algorithm implementation. Modelling of signal is performed by combining two different channels which are acquired from different source, this signal is called mixture signal. Later a filter bank is designed using scattering algorithm based on wavelet transform method and a optimization problem is formulated for audio demixing. Experimental study shows the robustness of proposed model by considering various implementation scenarios.

引用

页码：782 / 787

页数：6

共 50 条

[21] MULTICHANNEL BLIND SOURCE SEPARATION BASED ON NON-NEGATIVE TENSOR FACTORIZATION IN WAVENUMBER DOMAIN
Mitsufuji, Yuki
Koyama, Shoichi
Saruwatari, Hiroshi
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 56 - 60
[22] Non-Negative Blind Source Separation Algorithm Based on Minimum Aperture Simplicial Cone
Ouedraogo, Wendyam Serge Boris
Souloumiac, Antoine
Jaidane, Meriem
Jutten, Christian
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (02) : 376 - 389
[23] Speech recognition in mixed sound of speech and music based on vector quantization and non-negative matrix factorization
Nakano, Shoichi
Yamamoto, Kazumasa
Nakagawa, Seiichi
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1792 - 1795
[24] A supervised non-negative matrix factorization model for speech emotion recognition
Hou, Mixiao
Li, Jinxing
Lu, Guangming
SPEECH COMMUNICATION, 2020, 124 : 13 - 20
[25] Perceptually Weighted Non-negative Matrix Factorization for Blind Single-Channel Music Source Separation
Kirbiz, S.
Gunsel, B.
2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 226 - 229
[26] Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization
Kirbiz, S.
Gunsel, B.
DIGITAL SIGNAL PROCESSING, 2013, 23 (02) : 646 - 658
[27] A PERCEPTUALLY ENHANCED BLIND SINGLE-CHANNEL AUDIO SOURCE SEPARATION BY NON-NEGATIVE MATRIX FACTORIZATION
Kirbiz, S.
Gunsel, B.
18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 731 - 735
[28] SPEECH EMOTION RECOGNITION USING TRANSFER NON-NEGATIVE MATRIX FACTORIZATION
Song, Peng
Ou, Shifeng
Zheng, Wenming
Jin, Yun
Zhao, Li
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5180 - 5184
[29] SAR Automatic Target Recognition via Non-negative Matrix Approximations
Riasati, Vahid
Srinivas, Umamahesh
Monga, Vishal
AUTOMATIC TARGET RECOGNITION XXII, 2012, 8391
[30] Sound Source Separation Based on Multichannel Non-negative Matrix Factorization with Weighted Averaging
Yamamoto, Tsuyoshi
Uenohara, Shingo
Nishijima, Keisuke
Furuya, Ken'ichi
COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, 2021, 1194 : 177 - 187

← 1 2 3 4 5 →