Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality

被引：0

作者：

Arya, Lalaram ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol, Dharwad 580011, Karnataka, India

来源：

SPEECH AND COMPUTER, SPECOM 2023, PT I | 2023年 / 14338卷

关键词：

Speech-to-speech translation (S2ST); Weighted LP residual; Speech enhancement; Pole modification;

D O I：

10.1007/978-3-031-48309-7_19

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The perceptual quality of translated speech depends on the quantity of speech data used for training. The translation speech quality is poor when the system is trained with less data. The quality improves by gradually adding more speech data for training. This work demonstrates the significance of post-processing of translated speech by signal processing for improving perceptual quality. Initially, the target speech original residual is used to replace the translated speech residual. It is then replaced using the weighted residual obtained by speech enhancement. The pole modification of translated speech is also done. Finally, both weighted residual and pole modifications are combined. All the experiments show improvement in perceptual quality.

引用

页码：222 / 232

页数：11

共 50 条

[41] Post-Processing of the Recognized Speech for Web Presentation of Large Audio Archive
Bohac, Marek
Blavka, Karel
Kucharova, Michaela
Skodova, Svatava
2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 441 - 445
[42] A POST-PROCESSING TECHNIQUE FOR REGENERATION OF OVER-ATTENUATED SPEECH COMPONENTS
Ding, Huijun
Soon, Ing Yann
Koh, Soo Ngee
Yeo, Chai Kiat
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3889 - +
[43] Improvement of the post-processing algorithm in AMR-NB speech codec
Hou, Jingyu
Zhao, Shenghui
Li, Huinan
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 349 - 354
[44] Improvement of Speech Recognition Accuracy Using Post-processing of Recognized Text
Rudzionis, Vytautas
Malukas, Ugnius
Danieliene, Renata
INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2022, 2022, 1665 : 265 - 270
[45] The effectiveness of corpus-induced dependency grammars for post-processing speech
Harper, MP
White, CM
Wang, W
Johnson, MT
Helzerman, RA
6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : A102 - A109
[46] Multi-objective Learning and Mask-based Post-processing for Deep Neural Network based Speech Enhancement
Xu, Yong
Du, Jun
Huang, Zhen
Dai, Li-Rong
Lee, Chin-Hui
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1508 - 1512
[47] A multi-objective learning speech enhancement algorithm based on IRM post-processing with joint estimation of SCNN and TCNN
Li, Ruwei
Sun, Xiaoyue
Li, Tao
Zhao, Fengnian
DIGITAL SIGNAL PROCESSING, 2020, 101
[48] Modification of the CVD-graphene resistivity by post-processing sample annealing
Tonkov, D. N.
Gasumyants, V. E.
Vasilyeva, E. S.
Koltsova, T. S.
Larionova, T., V
Tolochko, O., V
CHINESE JOURNAL OF PHYSICS, 2021, 74 : 256 - 261
[49] Denoising diffusion post-processing for low-light image enhancement
Panagiotou, Savvas
Bosman, Anna S.
PATTERN RECOGNITION, 2024, 156
[50] Post-processing 4DCT to improve delineation of heart substructures
Van Herk, M.
McWilliam, A.
Banfill, K.
Faivre-Finn, C.
RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S967 - S967

← 1 2 3 4 5 →