Universal attribute characterization of spoken languages for automatic spoken language recognition

被引:41
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [3 ]
Lee, Chin-Hui [4 ]
机构
[1] Kore Univ Enna, Fac Engn & Architecture, Enna, Sicily, Italy
[2] Georgia Inst Technol, Georgia Tech Res Inst, Atlanta, GA 30332 USA
[3] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, N-7491 Trondheim, Norway
[4] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2013年 / 27卷 / 01期
关键词
Spoken language recognition; Vector space model; Latentsemantic analysis; Artificial neural network; Support vectormachine; Phonetic features; NEURAL-NETWORKS; DESIGN;
D O I
10.1016/j.csl.2012.05.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel universal acoustic characterization approach to spoken language recognition (LRE). The key idea is to describe any spoken language with a common set of fundamental units that can be defined "universally" across all spoken languages. In this study, speech attributes, such as manner and place of articulation, are chosen to form this unit inventory and used to build a set of language-universal attribute models with data-driven modeling techniques. The vector space modeling approach to LRE is adopted, where a spoken utterance is first decoded into a sequence of attributes independently of its language. Then, a feature vector is generated by using co-occurrence statistics of manner or place units, and the final LRE decision is implemented with a vector space language classifier. Several architectural configurations will be studied, and it will be shown that best performance is attained using a maximal figure-of-merit language classifier. Experimental evidence not only demonstrates the feasibility of the proposed techniques, but it also shows that the proposed technique attains comparable performance to standard approaches on the LRE tasks investigated in this work when the same experimental conditions are adopted. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:209 / 227
页数:19
相关论文
共 50 条
  • [1] Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 168 - +
  • [2] Automatic Word Recognition for Bangla Spoken Language
    Zinnat, Sara Binte
    Marzia, Razia
    Siddique, Asheque
    Hossain, Md. Imamul
    Abdullah, Md.
    Huda, Mohammad Nurul
    2014 INTERNATIONAL CONFERENCE ON SIGNAL PROPAGATION AND COMPUTER TECHNOLOGY (ICSPCT 2014), 2014, : 470 - 475
  • [3] Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2726 - +
  • [4] AUTOMATIC RECOGNITION OF SPOKEN DIGITS
    DAVIS, KH
    BIDDULPH, R
    BALASHEK, S
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06): : 637 - 642
  • [5] AUTOMATIC RECOGNITION OF SPOKEN NUMERALS
    SEBESTYEN, G
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1960, 32 (11): : 1516 - 1517
  • [6] AUTOMATIC RECOGNITION OF SPOKEN WORDS
    VONKELLER, TG
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 385 - +
  • [7] PROSODIC ATTRIBUTE MODEL FOR SPOKEN LANGUAGE IDENTIFICATION
    Ng, Raymond W. M.
    Leung, Cheung-Chi
    Lee, Tan
    Ma, Bin
    Li, Haizhou
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5022 - 5025
  • [8] Automatic Technologies for Processing Spoken Sign Languages
    Karpov, Alexey
    Kipyatkova, Irina
    Zelezny, Milos
    SLTU-2016 5TH WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGIES FOR UNDER-RESOURCED LANGUAGES, 2016, 81 : 201 - 207
  • [9] Efficient Language Model Adaptation for Automatic Speech Recognition of Spoken Translations
    Pelemans, Joris
    Vanallemeersch, Tom
    Demuynck, Kris
    Van Hamme, Hugo
    Wambacq, Patrick
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2262 - 2266
  • [10] Automatic Evaluation of Fluency in Spoken Language
    Audhkhasi, Kartik
    IETE TECHNICAL REVIEW, 2009, 26 (02) : 108 - 114