Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech

被引:0
|
作者
Gu, Wentao [1 ,2 ]
Hirose, Keikichi [1 ]
Fujisaki, Hiroya [1 ]
机构
[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] Chinese Uni of Hong Kong, Hong Hom, Hong Kong, Peoples R China
关键词
prosodic hierarchy; perceived prosodic boundary; F-0; contour; phrase; command-response model; Mandarin; perception; production;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F-0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F-0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F-0 contour generation.
引用
收藏
页码:31 / +
页数:2
相关论文
共 50 条
  • [41] Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contours
    Sakurai, A
    Hirose, K
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 817 - 820
  • [42] The Role of Perceived Voice and Speech Characteristics in Vocal Emotion Communication
    Tanja Bänziger
    Sona Patel
    Klaus R. Scherer
    Journal of Nonverbal Behavior, 2014, 38 : 31 - 52
  • [43] The Role of Perceived Voice and Speech Characteristics in Vocal Emotion Communication
    Baenziger, Tanja
    Patel, Sona
    Scherer, Klaus R.
    JOURNAL OF NONVERBAL BEHAVIOR, 2014, 38 (01) : 31 - 52
  • [44] WaveVC: Speech and Fundamental Frequency Consistent Raw Audio Voice Conversion
    Ko, Kyungdeuk
    Kim, Donghyeon
    Oh, Kyungseok
    Ko, Hanseok
    NEURAL PROCESSING LETTERS, 2024, 56 (04)
  • [45] Control of Prosodic Focus in Corpus-based Generation of Fundamental Frequency Contours Based on the Generation Process Model
    Hirose, Keikichi
    Ochi, Keiko
    Minematsu, Nobuaki
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 629 - 632
  • [46] Generation of Fundamental Frequency Contours for Thai Speech Synthesis using Tone Nucleus Model
    Krityakien, Oraphan
    Hirose, Keikichi
    Minematsu, Nobuaki
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1036 - 1040
  • [47] FRED - AN INTERACTIVE GRAPHICS PROGRAM TO MODIFY FUNDAMENTAL-FREQUENCY CONTOURS IN RESYNTHESIZED SPEECH
    SILVERMAN, KEA
    BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1986, 18 (04): : 395 - 397
  • [48] SELECTED SPEECH AND FUNDAMENTAL FREQUENCY CHARACTERISTICS OF PATIENTS WITH ACROMEGALY
    WEINBERG, B
    DEXTER, R
    HORII, Y
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1975, 40 (02): : 253 - 259
  • [49] Voice fundamental frequency differences and speech recognition with noise and speech maskers in cochlear implant recipients
    Meister, Hartmut
    Walger, Martin
    Lang-Roth, Ruth
    Mueller, Verena
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 147 (01): : EL19 - EL24
  • [50] SPEAKING FUNDAMENTAL FREQUENCY-CHARACTERISTICS ASSOCIATED WITH VOICE PATHOLOGIES
    MURRY, T
    JOURNAL OF SPEECH AND HEARING DISORDERS, 1978, 43 (03): : 374 - 379