Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality

被引:0
|
作者
Arya, Lalaram [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol, Dharwad 580011, Karnataka, India
来源
关键词
Speech-to-speech translation (S2ST); Weighted LP residual; Speech enhancement; Pole modification;
D O I
10.1007/978-3-031-48309-7_19
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The perceptual quality of translated speech depends on the quantity of speech data used for training. The translation speech quality is poor when the system is trained with less data. The quality improves by gradually adding more speech data for training. This work demonstrates the significance of post-processing of translated speech by signal processing for improving perceptual quality. Initially, the target speech original residual is used to replace the translated speech residual. It is then replaced using the weighted residual obtained by speech enhancement. The pole modification of translated speech is also done. Finally, both weighted residual and pole modifications are combined. All the experiments show improvement in perceptual quality.
引用
收藏
页码:222 / 232
页数:11
相关论文
共 50 条
  • [1] Low -complexity Post-processing Method for Speech Enhancement
    Bao, Feng
    Li, Yuepeng
    Shang, Shidong
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [2] Combining speech enhancement with feature post-processing for robust speech recognition
    Lei, Jianjun
    Guo, Jun
    Liu, Gang
    Wang, Jian
    Nie, Xiangfei
    Yang, Zhen
    INTELLIGENT COMPUTING IN SIGNAL PROCESSING AND PATTERN RECOGNITION, 2006, 345 : 773 - 778
  • [3] EFFICIENT SNR-BASED SUBBAND POST-PROCESSING FOR RESIDUAL NOISE REDUCTION IN SPEECH ENHANCEMENT ALGORITHMS
    Mustiere, Frederic
    Bouchard, Martin
    Bolic, Miodrag
    18TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2010), 2010, : 1558 - 1561
  • [4] Post-processing in masking-based β-order MMSE speech enhancement
    Zhang, Xinxin
    Koh, Soo Ngee
    Soon, Ing Yann
    You, Changhuai
    APPLIED ACOUSTICS, 2008, 69 (04) : 354 - 357
  • [5] Boosting Objective Scores of a Speech Enhancement Model by MetricGAN Post-processing
    Fu, Szu-Wei
    Liao, Chien-Feng
    Hsieh, Tsun-An
    Hung, Kuo-Hsuan
    Wang, Syu-Siang
    Yu, Cheng
    Kuo, Heng-Cheng
    Zezario, Ryandhimas E.
    Li, You-Jin
    Chuang, Shang-Yi
    Lu, Yen-Ju
    Lin, Yu-Chen
    Tsao, Yu
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 455 - 459
  • [6] Enhancement of underwater videomosaics for post-processing
    Rzhanov, Y.
    Gu, Fan
    2007 OCEANS, VOLS 1-5, 2007, : 1073 - 1078
  • [7] Learning With Learned Loss Function: Speech Enhancement With Quality-Net to Improve Perceptual Evaluation of Speech Quality
    Fu, Szu-Wei
    Liao, Chien-Feng
    Tsao, Yu
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 26 - 30
  • [8] Blind separation of convolutive speech mixtures with post-processing based on multichannel signal enhancement
    National Key Laboratory of Radar Signal Processing, Xidian University, Xi'an 710071, China
    Tien Tzu Hsueh Pao, 2007, 12 (2389-2393):
  • [9] EFFECTIVE POST-PROCESSING FOR SINGLE-CHANNEL FREQUENCY-DOMAIN SPEECH ENHANCEMENT
    Li, Weifeng
    2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 149 - 152
  • [10] COMPARISON OF POST-PROCESSING METHODS FOR INTELLIGIBILITY ENHANCEMENT OF NARROWBAND SPEECH IN A MOBILE PHONE FRAMEWORK
    Jokinen, Emma
    Takanen, Marko
    Alku, Paavo
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,