Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech

被引：0

作者：

Gu, Wentao ^{[1
,2
]}

Hirose, Keikichi ^{[1
]}

Fujisaki, Hiroya ^{[1
]}

机构：

[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan

[2] Chinese Uni of Hong Kong, Hong Hom, Hong Kong, Peoples R China

来源：

CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS | 2006年 / 4274卷

关键词：

prosodic hierarchy; perceived prosodic boundary; F-0; contour; phrase; command-response model; Mandarin; perception; production;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F-0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F-0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F-0 contour generation.

引用

页码：31 / +

页数：2

共 50 条

[31] ULTRASONIC REGISTRATION OF FUNDAMENTAL FREQUENCY OF A VOICE DURING NORMAL SPEECH
HOLMER, NG
RUNDQVIST, HE
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 (05): : 1073 - 1077
[32] Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants
Huang, Wanting
Wong, Lena L. N.
Chen, Fei
Liu, Haihong
Liang, Wei
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (11): : 3855 - 3864
[33] Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
Zheng, Yibin
Li, Ya
Wen, Zhengqi
Ding, Xingguang
Tao, Jianhua
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3201 - 3205
[34] A method for automatic extraction of model parameters from fundamental frequency contours of speech
Narusawa, S
Minematsu, N
Hirose, K
Fujisaki, H
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 509 - 512
[35] DISCRIMINATION OF FUNDAMENTAL FREQUENCY CONTOURS IN SYNTHETIC SPEECH - IMPLICATIONS FOR MODELS OF PITCH PERCEPTION
KLATT, DH
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 8 - 16
[36] Analysing fundamental frequency contours and local speech rate in map task dialogs
Mixdorff, H
Pfitzinger, HR
SPEECH COMMUNICATION, 2005, 46 (3-4) : 310 - 325
[37] Pre-processing of fundamental frequency contours of speech for automatic parameter extraction
Fujisaki, H
Narusawa, S
Maruno, M
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 722 - 725
[38] Automatic parameter extraction of fundamental frequency contours of speech based on a generative model
Fujisaki, H
Ohno, S
Tomita, O
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 729 - 732
[39] Speaking rate and fundamental frequency as speech cues to perceived age
Hamsberger, James D.
Shrivastav, Rahul
Brown, W. S., Jr.
Rothman, Howard
Hollien, Harry
JOURNAL OF VOICE, 2008, 22 (01) : 58 - 69
[40] ANALYSIS OF STUTTERERS VOICE ONSET TIMES AND FUNDAMENTAL-FREQUENCY CONTOURS DURING FLUENCY
HEALEY, EC
GUTKIN, B
JOURNAL OF SPEECH AND HEARING RESEARCH, 1984, 27 (02): : 219 - 225

← 1 2 3 4 5 →