Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality

被引:0
|
作者
Arya, Lalaram [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol, Dharwad 580011, Karnataka, India
来源
SPEECH AND COMPUTER, SPECOM 2023, PT I | 2023年 / 14338卷
关键词
Speech-to-speech translation (S2ST); Weighted LP residual; Speech enhancement; Pole modification;
D O I
10.1007/978-3-031-48309-7_19
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The perceptual quality of translated speech depends on the quantity of speech data used for training. The translation speech quality is poor when the system is trained with less data. The quality improves by gradually adding more speech data for training. This work demonstrates the significance of post-processing of translated speech by signal processing for improving perceptual quality. Initially, the target speech original residual is used to replace the translated speech residual. It is then replaced using the weighted residual obtained by speech enhancement. The pole modification of translated speech is also done. Finally, both weighted residual and pole modifications are combined. All the experiments show improvement in perceptual quality.
引用
收藏
页码:222 / 232
页数:11
相关论文
共 50 条
  • [41] Post-Processing of the Recognized Speech for Web Presentation of Large Audio Archive
    Bohac, Marek
    Blavka, Karel
    Kucharova, Michaela
    Skodova, Svatava
    2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 441 - 445
  • [42] A POST-PROCESSING TECHNIQUE FOR REGENERATION OF OVER-ATTENUATED SPEECH COMPONENTS
    Ding, Huijun
    Soon, Ing Yann
    Koh, Soo Ngee
    Yeo, Chai Kiat
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3889 - +
  • [43] Improvement of the post-processing algorithm in AMR-NB speech codec
    Hou, Jingyu
    Zhao, Shenghui
    Li, Huinan
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 349 - 354
  • [44] Improvement of Speech Recognition Accuracy Using Post-processing of Recognized Text
    Rudzionis, Vytautas
    Malukas, Ugnius
    Danieliene, Renata
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2022, 2022, 1665 : 265 - 270
  • [45] The effectiveness of corpus-induced dependency grammars for post-processing speech
    Harper, MP
    White, CM
    Wang, W
    Johnson, MT
    Helzerman, RA
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : A102 - A109
  • [46] Multi-objective Learning and Mask-based Post-processing for Deep Neural Network based Speech Enhancement
    Xu, Yong
    Du, Jun
    Huang, Zhen
    Dai, Li-Rong
    Lee, Chin-Hui
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1508 - 1512
  • [47] A multi-objective learning speech enhancement algorithm based on IRM post-processing with joint estimation of SCNN and TCNN
    Li, Ruwei
    Sun, Xiaoyue
    Li, Tao
    Zhao, Fengnian
    DIGITAL SIGNAL PROCESSING, 2020, 101
  • [48] Modification of the CVD-graphene resistivity by post-processing sample annealing
    Tonkov, D. N.
    Gasumyants, V. E.
    Vasilyeva, E. S.
    Koltsova, T. S.
    Larionova, T., V
    Tolochko, O., V
    CHINESE JOURNAL OF PHYSICS, 2021, 74 : 256 - 261
  • [49] Denoising diffusion post-processing for low-light image enhancement
    Panagiotou, Savvas
    Bosman, Anna S.
    PATTERN RECOGNITION, 2024, 156
  • [50] Post-processing 4DCT to improve delineation of heart substructures
    Van Herk, M.
    McWilliam, A.
    Banfill, K.
    Faivre-Finn, C.
    RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S967 - S967