Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality

被引:0
|
作者
Arya, Lalaram [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol, Dharwad 580011, Karnataka, India
来源
SPEECH AND COMPUTER, SPECOM 2023, PT I | 2023年 / 14338卷
关键词
Speech-to-speech translation (S2ST); Weighted LP residual; Speech enhancement; Pole modification;
D O I
10.1007/978-3-031-48309-7_19
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The perceptual quality of translated speech depends on the quantity of speech data used for training. The translation speech quality is poor when the system is trained with less data. The quality improves by gradually adding more speech data for training. This work demonstrates the significance of post-processing of translated speech by signal processing for improving perceptual quality. Initially, the target speech original residual is used to replace the translated speech residual. It is then replaced using the weighted residual obtained by speech enhancement. The pole modification of translated speech is also done. Finally, both weighted residual and pole modifications are combined. All the experiments show improvement in perceptual quality.
引用
收藏
页码:222 / 232
页数:11
相关论文
共 50 条
  • [21] A post-processing approach to improve emotion recognition rates
    Pittermann, Johannes
    Pittermann, Angela
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 708 - +
  • [22] Noisy speech enhancement using harmonic-noise model and codebook-based post-processing
    Zavarehei, Esfandiar
    Vaseghi, Saeed
    Yan, Qin
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1194 - 1203
  • [23] Enhancement of noisy speech by spectral subtraction and residual modification
    Krishnamoorthy, P.
    Prasanna, S. R. Mahadeva
    2006 ANNUAL IEEE INDIA CONFERENCE, 2006, : 124 - +
  • [24] LSTM-Based Iterative Mask Estimation and Post-Processing for Multi-Channel Speech Enhancement
    Tu, Yan-Hui
    Du, Jun
    Sun, Lei
    Lee, Chin-Hui
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 488 - 491
  • [25] Low complexity perceptual post-processing of MPEG-4 sequences
    Jung, J
    Le Maguet, Y
    Gobert, J
    Delcorso, S
    IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 248 - 259
  • [26] Post-processing of automatic segmentation of speech using dynamic programming
    Szymanski, Marcin
    Grocholewski, Stefan
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 523 - 530
  • [27] Post-processing for real-time quality enhancement of MPEG-coded video sequences
    Atzori, L
    De Natale, FGB
    Granelli, F
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 975 - 978
  • [28] A new perceptual post filter for single channel speech enhancement
    Alam, Md. Jahangir
    O'Shaughnessy, Douglas
    Selouani, Sid-Ahmed
    PROCEEDINGS OF ICECE 2008, VOLS 1 AND 2, 2008, : 386 - +
  • [29] Signal subspace speech enhancement with perceptual post-filtering
    Klein, M
    Kabal, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 537 - 540
  • [30] Application of Bezier functions to the post-processing enhancement of decompressed images
    Mayer, J
    Langdon, GG
    THIRTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 239 - 242