Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech

被引:0
|
作者
Gu, Wentao [1 ,2 ]
Hirose, Keikichi [1 ]
Fujisaki, Hiroya [1 ]
机构
[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] Chinese Uni of Hong Kong, Hong Hom, Hong Kong, Peoples R China
关键词
prosodic hierarchy; perceived prosodic boundary; F-0; contour; phrase; command-response model; Mandarin; perception; production;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F-0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F-0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F-0 contour generation.
引用
收藏
页码:31 / +
页数:2
相关论文
共 50 条
  • [31] ULTRASONIC REGISTRATION OF FUNDAMENTAL FREQUENCY OF A VOICE DURING NORMAL SPEECH
    HOLMER, NG
    RUNDQVIST, HE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 58 (05): : 1073 - 1077
  • [32] Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants
    Huang, Wanting
    Wong, Lena L. N.
    Chen, Fei
    Liu, Haihong
    Liang, Wei
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (11): : 3855 - 3864
  • [33] Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
    Zheng, Yibin
    Li, Ya
    Wen, Zhengqi
    Ding, Xingguang
    Tao, Jianhua
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3201 - 3205
  • [34] A method for automatic extraction of model parameters from fundamental frequency contours of speech
    Narusawa, S
    Minematsu, N
    Hirose, K
    Fujisaki, H
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 509 - 512
  • [35] DISCRIMINATION OF FUNDAMENTAL FREQUENCY CONTOURS IN SYNTHETIC SPEECH - IMPLICATIONS FOR MODELS OF PITCH PERCEPTION
    KLATT, DH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 8 - 16
  • [36] Analysing fundamental frequency contours and local speech rate in map task dialogs
    Mixdorff, H
    Pfitzinger, HR
    SPEECH COMMUNICATION, 2005, 46 (3-4) : 310 - 325
  • [37] Pre-processing of fundamental frequency contours of speech for automatic parameter extraction
    Fujisaki, H
    Narusawa, S
    Maruno, M
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 722 - 725
  • [38] Automatic parameter extraction of fundamental frequency contours of speech based on a generative model
    Fujisaki, H
    Ohno, S
    Tomita, O
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 729 - 732
  • [39] Speaking rate and fundamental frequency as speech cues to perceived age
    Hamsberger, James D.
    Shrivastav, Rahul
    Brown, W. S., Jr.
    Rothman, Howard
    Hollien, Harry
    JOURNAL OF VOICE, 2008, 22 (01) : 58 - 69
  • [40] ANALYSIS OF STUTTERERS VOICE ONSET TIMES AND FUNDAMENTAL-FREQUENCY CONTOURS DURING FLUENCY
    HEALEY, EC
    GUTKIN, B
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1984, 27 (02): : 219 - 225