Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech

被引:0
|
作者
Gu, Wentao [1 ,2 ]
Hirose, Keikichi [1 ]
Fujisaki, Hiroya [1 ]
机构
[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] Chinese Uni of Hong Kong, Hong Hom, Hong Kong, Peoples R China
关键词
prosodic hierarchy; perceived prosodic boundary; F-0; contour; phrase; command-response model; Mandarin; perception; production;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F-0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F-0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F-0 contour generation.
引用
收藏
页码:31 / +
页数:2
相关论文
共 50 条
  • [1] Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin
    Ni, Jinfu
    Hirose, Keikichi
    SPEECH COMMUNICATION, 2006, 48 (08) : 989 - 1008
  • [2] The roles of fundamental frequency contours and sentence context in Mandarin Chinese speech intelligibility
    Wang, Jiuju
    Shu, Hua
    Zhang, Linjun
    Liu, Zhaoxing
    Zhang, Yang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (01): : EL91 - EL97
  • [3] The roles of fundamental frequency contours and sentence context in Mandarin Chinese speech intelligibility
    Wang, J. (wangjiuju@gmail.com), 1600, Acoustical Society of America (134):
  • [4] Detection of prosodic word boundaries by statistical modeling of mora transitions of fundamental frequency contours and its use for continuous speech recognition
    Hirose, K
    Iwano, K
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1763 - 1766
  • [5] FUNDAMENTAL FREQUENCY CONTOURS AT SYNTACTIC BOUNDARIES
    COOPER, WE
    SORENSEN, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 (03): : 683 - 692
  • [6] ANALYSIS OF FUNDAMENTAL FREQUENCY CONTOURS IN SPEECH
    LEVITT, H
    RABINER, LR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02): : 569 - &
  • [7] Analysis of Voice Fundamental Frequency Contours of Continuing and Terminating Prosodic Phrases in Four Swiss German Dialects
    Leemann, Adrian
    Hirose, Keikichi
    Fujisaki, Hiroya
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2399 - 2402
  • [8] Effects of Semantic Context and Fundamental Frequency Contours on Mandarin Speech Recognition by Second Language Learners
    Zhang, Linjun
    Li, Yu
    Wu, Han
    Li, Xin
    Shu, Hua
    Zhang, Yang
    Li, Ping
    FRONTIERS IN PSYCHOLOGY, 2016, 7
  • [9] Generative Modeling of Voice Fundamental Frequency Contours
    Kameoka, Hirokazu
    Yoshizato, Kota
    Ishihara, Tatsuma
    Kadowaki, Kento
    Ohishi, Yasunori
    Kashino, Kunio
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (06) : 1042 - 1053
  • [10] CHARACTERIZATION OF FUNDAMENTAL-FREQUENCY CONTOURS OF SPEECH
    MAEDA, S
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S33 - S33