Synthesis fidelity and time-varying spectral change in vowels

被引：36

作者：

Assmann, PF

Katz, WF

机构：

[1] Univ Texas, Sch Behav & Brain Sci, Richardson, TX 75083 USA

[2] Univ Texas, Callier Ctr Commun Disorders, Richardson, TX 75083 USA

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2005年 / 117卷 / 02期

关键词：

D O I：

10.1121/1.1852549

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Recent studies have shown that synthesized versions of American English vowels are less accurately identified when the natural time-varying spectral changes are eliminated by holding the formant frequencies constant over the duration of the vowel. A limitation of these experiments has been that vowels produced by formant synthesis are generally less accurately identified than the natural vowels after which they are modeled. To overcome this limitation, a high-quality speech analysis-synthesis system (STRAIGHT) was used to synthesize versions of 12 American English vowels spoken by adults and children. Vowels synthesized with STRAIGHT were identified as accurately as the natural versions, in contrast with previous results from our laboratory showing identification rates 9 %-12 % lower for the same vowels synthesized using the cascade formant model. Consistent with earlier studies, identification accuracy was not reduced when the fundamental frequency was held constant across the vowel. However, elimination of time-varying changes in the spectral envelope using STRAIGHT led to a greater reduction in accuracy (23 %) than was previously found with cascade formant synthesis (11 %). A statistical pattern recognition model, applied to acoustic measurements of the natural and synthesized vowels, predicted both the higher identification accuracy for vowels synthesized using STRAIGHT compared to formant synthesis, and the greater effects of holding the formant frequencies constant over time with STRAIGHT synthesis. Taken together, the experiment and modeling results suggest that formant estimation errors and incorrect rendering of spectral and temporal cues by cascade formant synthesis contribute to lower identification accuracy and underestimation of the role of time-varying spectral change in vowels. (C) 2005 Acoustical Society of America.

引用

页码：886 / 895

页数：10

共 50 条

[21] A spectral test for observability and reachability of time-varying systems
Peters, MA
Iglesias, PA
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1999, 37 (05) : 1330 - 1345
[22] Research on time-varying spectral modeling deconvolution method
Guo, Tingchao
Cao, Wenjun
Tao, Changjiang
Wang, Deying
Geophysical Prospecting for Petroleum, 2015, 54 (01) : 36 - 42
[23] SPECTRAL ANALYSIS OF PERIODICALLY TIME-VARYING LINEAR NETWORKS
BARDAKJIAN, BL
SABLATASH, M
IEEE TRANSACTIONS ON CIRCUIT THEORY, 1972, CT19 (03): : 297 - +
[24] Spectral Time-Varying Pattern Causality and Its Application
Mi, Yujia
Lin, Aijing
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (06) : 3742 - 3749
[25] Time-varying spectral analysis in exercise and sport science
Frishberg, Barry A.
Galleani, Lorenzo
Cohen, Leon
ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XVII, 2007, 6697
[26] Identification of synthetic vowels based on a time-varying model of the vocal tract area function
Bunton, Kate
Story, Brad H.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04): : EL146 - EL152
[27] Admissibility of time-varying observations for time-varying systems
Idrissi, A
Rhandi, A
JOURNAL OF COMPUTATIONAL ANALYSIS AND APPLICATIONS, 2004, 6 (03) : 229 - 241
[28] Synthesis of mechanisms using time-varying dimensions
Hansen, JM
MULTIBODY SYSTEM DYNAMICS, 2002, 7 (01) : 127 - 144
[29] Robust synthesis for uncertain time-varying systems
Pirie, C
Dullerud, GE
PROCEEDINGS OF THE 2000 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2000, : 1842 - 1846
[30] Synthesis of Mechanisms Using Time-Varying Dimensions
John M. Hansen
Multibody System Dynamics, 2002, 7 : 127 - 144

← 1 2 3 4 5 →