Tone recognition in continuous Cantonese speech using supratone models

被引:16
|
作者
Qian, Yao [1 ]
Lee, Tan
Soong, Frank K.
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Shatin, Hong Kong, Peoples R China
[2] Microsoft Res Asia, Beijing Sigma Ctr, Beijing 100080, Peoples R China
来源
关键词
D O I
10.1121/1.2717413
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper studies automatic tone recognition in continuous Cantonese speech. Cantonese is a major Chinese dialect that is known for being rich in tones. Tone information serves as a useful knowledge source for automatic speech recognition of Cantonese. Cantonese tone recognition is difficult because the tones have similar shapes of pitch contours. The tones are differentiated mainly by their relative pitch heights. In natural speech, the pitch level of a tone may shift up and down and the F0 ranges of different tones overlap with each other, making them acoustically indistinguishable within the domain of a syllable. Our study shows that the relative pitch heights are largely preserved between neighboring tones. A novel method of supratone modeling is proposed for Cantonese tone recognition. Each supratone model characterizes the F0 contour of two or three tones in succession. The tone sequence of a continuous utterance is formed as an overlapped concatenation of supratone units. The most likely tone sequence is determined under phonological constraints on syllable-tone combinations. The proposed method attains an accuracy of 74.68% in speaker-independent tone recognition experiments. In particular, the confusion among the tones with similar contour shapes is greatly resolved. (C) 2007 Acoustical Society of America.
引用
收藏
页码:2936 / 2945
页数:10
相关论文
共 50 条
  • [1] Tone recognition of continuous Cantonese speech based on support vector machines
    Peng, G
    Wang, WSY
    SPEECH COMMUNICATION, 2005, 45 (01) : 49 - 62
  • [2] Tone recognition for Chinese speech: A comparative study of Mandarin and Cantonese
    Peng, G
    Zheng, HY
    Wang, WSY
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 233 - 236
  • [3] A TONE RECOGNITION FRAMEWORK FOR CONTINUOUS MANDARIN SPEECH
    He, Lei
    Hao, Jie
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1575 - 1578
  • [4] Tone Modeling for Continuous Mandarin Speech Recognition
    Cao, Yang
    Zhang, Shuwu
    Huang, Taiyi
    Xu, Bo
    International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
  • [5] Tone recognition of Vietnamese continuous speech using hidden Markov model
    Quang, Nguyen Hong
    Pascal, Nocera
    Eric, Castelli
    Van Loan, Trinh
    2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 233 - +
  • [6] Tone Recognition of Continuous Speech of Standard Chinese Using Neural Network and Tone Nucleus Model
    Hirose, Keikichi
    Hu, Hui
    Wang, Xiaodong
    Minematsu, Nobuaki
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2394 - +
  • [7] TONE RECOGNITION OF ISOLATED CANTONESE SYLLABLES
    LEE, T
    CHING, PC
    CHAN, LW
    CHENG, YH
    MAK, B
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (03): : 204 - 209
  • [8] Continuous speech recognition using linear dynamic models
    Ma, Tao
    Srinivasan, Sundararajan
    Lazarou, Georgios
    Picone, Joseph
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 11 - 16
  • [9] Continuous Malayalam Speech Recognition Using Hidden Markov Models
    Mohamed, Anuj
    Nair, K. N. Ramachandran
    PROCEEDINGS OF THE FIRST AMRITA ACM-W CELEBRATION OF WOMEN IN COMPUTING IN INDIA (A2WIC), 2010,
  • [10] Tone Hyperarticulation in Cantonese Infant-Directed Speech
    Xu, Nan
    Burnham, Denis
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 624 - 624