Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition

被引:0
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [3 ]
Lee, Chin-Hui [2 ]
机构
[1] Univ Enna Kore, Dept Telemat, Enna, Italy
[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA USA
[3] NTNU, Dept Elect & Telecommun, Trondheim, Norway
关键词
language identification; latent semantic analysis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper expands a previously proposed universal acoustic characterization approach to spoken language identification (LID) by studying different ways of modeling attributes to improve language recognition. The motivation is to describe any spoken language with a common set of fundamental units. Thus, a spoken utterance is first tokenized into a sequence of universal attributes. Then a vector space modeling approach delivers the final LID decision. Context-dependent attribute models are now used to better capture spectral and temporal characteristics. Also, an approach to expand the set of attributes to increase the acoustic resolution is studied. Our experiments show that the tokenization accuracy positively affects LID results by producing a 2.8% absolute improvement over our previous 30-second NIST 2003 performance. This result also compares favorably with the best results on the same task known by the authors when the tokenizers are trained on language-dependent OGI-TS data.
引用
收藏
页码:2726 / +
页数:2
相关论文
共 32 条
  • [1] Universal attribute characterization of spoken languages for automatic spoken language recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 209 - 227
  • [2] Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 168 - +
  • [3] High-Resolution Acoustic Modeling and Compact Language Modeling of Language-Universal Speech Attributes for Spoken Language Identification
    Wang, Yannan
    Du, Jun
    Dai, Lirong
    Lee, Chin-Hui
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 992 - 996
  • [4] Syllable language models for Mandarin speech recognition: Exploiting character language models
    Liu, Xunying
    Hieronymus, James L.
    Gales, Mark J. F.
    Woodland, Philip C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 519 - 528
  • [5] Syllable language models for Mandarin speech recognition: Exploiting character language models
    Liu, X. (xl207@eng.cam.ac.uk), 1600, Acoustical Society of America (133):
  • [6] Combining Acoustic Name Spotting and Continuous Context Models to improve Spoken Person Name Recognition in Speech
    Bigot, Benjamin
    Senay, Gregory
    Linares, Georges
    Fredouille, Corinne
    Dufour, Richard
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2538 - 2542
  • [7] Context-dependent acoustic models for Chinese speech recognition
    Ma, B
    Huang, TY
    Xu, B
    Zhang, XJ
    Qu, F
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 455 - 458
  • [8] Context-independent acoustic models for Thai speech recognition
    Kasuriya, S
    Kanokphara, S
    Thatphithakkul, N
    Cotsomrong, P
    Sunpethniyom, T
    IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 991 - 994
  • [9] SPEECH RECOGNITION - ACOUSTIC, PHONETIC AND FORMAL-LANGUAGE MODELS
    MERMELSTEIN, P
    LEVINSON, S
    BIOTELEMETRY, 1975, 2 (1-2) : 121 - 123
  • [10] Acoustic and Language Models Adaptation for Indonesian Spontaneous Speech Recognition
    Lestari, Dessi Puji
    Irfani, Angela
    2015 2ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS ICAICTA, 2015,