Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality

被引：0

作者：

Arya, Lalaram ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol, Dharwad 580011, Karnataka, India

来源：

SPEECH AND COMPUTER, SPECOM 2023, PT I | 2023年 / 14338卷

关键词：

Speech-to-speech translation (S2ST); Weighted LP residual; Speech enhancement; Pole modification;

D O I：

10.1007/978-3-031-48309-7_19

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The perceptual quality of translated speech depends on the quantity of speech data used for training. The translation speech quality is poor when the system is trained with less data. The quality improves by gradually adding more speech data for training. This work demonstrates the significance of post-processing of translated speech by signal processing for improving perceptual quality. Initially, the target speech original residual is used to replace the translated speech residual. It is then replaced using the weighted residual obtained by speech enhancement. The pole modification of translated speech is also done. Finally, both weighted residual and pole modifications are combined. All the experiments show improvement in perceptual quality.

引用

页码：222 / 232

页数：11

共 50 条

[21] A post-processing approach to improve emotion recognition rates
Pittermann, Johannes
Pittermann, Angela
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 708 - +
[22] Noisy speech enhancement using harmonic-noise model and codebook-based post-processing
Zavarehei, Esfandiar
Vaseghi, Saeed
Yan, Qin
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1194 - 1203
[23] Enhancement of noisy speech by spectral subtraction and residual modification
Krishnamoorthy, P.
Prasanna, S. R. Mahadeva
2006 ANNUAL IEEE INDIA CONFERENCE, 2006, : 124 - +
[24] LSTM-Based Iterative Mask Estimation and Post-Processing for Multi-Channel Speech Enhancement
Tu, Yan-Hui
Du, Jun
Sun, Lei
Lee, Chin-Hui
2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 488 - 491
[25] Low complexity perceptual post-processing of MPEG-4 sequences
Jung, J
Le Maguet, Y
Gobert, J
Delcorso, S
IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2003, PTS 1 AND 2, 2003, 5022 : 248 - 259
[26] Post-processing of automatic segmentation of speech using dynamic programming
Szymanski, Marcin
Grocholewski, Stefan
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 523 - 530
[27] Post-processing for real-time quality enhancement of MPEG-coded video sequences
Atzori, L
De Natale, FGB
Granelli, F
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 975 - 978
[28] A new perceptual post filter for single channel speech enhancement
Alam, Md. Jahangir
O'Shaughnessy, Douglas
Selouani, Sid-Ahmed
PROCEEDINGS OF ICECE 2008, VOLS 1 AND 2, 2008, : 386 - +
[29] Signal subspace speech enhancement with perceptual post-filtering
Klein, M
Kabal, P
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 537 - 540
[30] Application of Bezier functions to the post-processing enhancement of decompressed images
Mayer, J
Langdon, GG
THIRTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 239 - 242

← 1 2 3 4 5 →