Universal attribute characterization of spoken languages for automatic spoken language recognition

被引:41
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [3 ]
Lee, Chin-Hui [4 ]
机构
[1] Kore Univ Enna, Fac Engn & Architecture, Enna, Sicily, Italy
[2] Georgia Inst Technol, Georgia Tech Res Inst, Atlanta, GA 30332 USA
[3] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, N-7491 Trondheim, Norway
[4] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2013年 / 27卷 / 01期
关键词
Spoken language recognition; Vector space model; Latentsemantic analysis; Artificial neural network; Support vectormachine; Phonetic features; NEURAL-NETWORKS; DESIGN;
D O I
10.1016/j.csl.2012.05.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel universal acoustic characterization approach to spoken language recognition (LRE). The key idea is to describe any spoken language with a common set of fundamental units that can be defined "universally" across all spoken languages. In this study, speech attributes, such as manner and place of articulation, are chosen to form this unit inventory and used to build a set of language-universal attribute models with data-driven modeling techniques. The vector space modeling approach to LRE is adopted, where a spoken utterance is first decoded into a sequence of attributes independently of its language. Then, a feature vector is generated by using co-occurrence statistics of manner or place units, and the final LRE decision is implemented with a vector space language classifier. Several architectural configurations will be studied, and it will be shown that best performance is attained using a maximal figure-of-merit language classifier. Experimental evidence not only demonstrates the feasibility of the proposed techniques, but it also shows that the proposed technique attains comparable performance to standard approaches on the LRE tasks investigated in this work when the same experimental conditions are adopted. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:209 / 227
页数:19
相关论文
共 50 条
  • [31] Languages spoken in Asia
    不详
    ANTHROPOLOGIE, 1948, 52 (03): : 520 - 521
  • [32] Automatic Spoken Language Identification by Digital Signal Processing Methods. Tatar and Russian Languages
    Latypov, Rustam
    Nigmatullin, Ruslan
    Stolov, Evgeni
    INFORMATION AND SOFTWARE TECHNOLOGIES (ICIST 2017), 2017, 756 : 539 - 549
  • [33] Robust numeric recognition in spoken language dialogue
    Rahim, M
    Riccardi, G
    Saul, L
    Wright, J
    Buntschuh, B
    Gorin, A
    SPEECH COMMUNICATION, 2001, 34 (1-2) : 195 - 212
  • [34] OPTIMIZING PLLR FEATURES FOR SPOKEN LANGUAGE RECOGNITION
    Diez, Mireia
    Varona, Amparo
    Penagarikano, Mikel
    Javier Rodriguez-Fuentes, Luis
    Bordel, German
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 779 - 784
  • [35] SPOKEN LANGUAGE UNDERSTANDING WITHOUT SPEECH RECOGNITION
    Chen, Yuan-Ping
    Price, Ryan
    Bangalore, Srinivas
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 6189 - 6193
  • [36] Spoken Language Recognition: From Fundamentals to Practice
    Li, Haizhou
    Ma, Bin
    Lee, Kong Aik
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1136 - 1159
  • [37] Application of GMM Models to Spoken Language Recognition
    Dustor, Adam
    Szwarc, Pawel
    MIXDES 2009: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, : 603 - 606
  • [38] A Syllable Structure Approach to Spoken Language Recognition
    Lee, Ruei-Hung Alex
    Jang, Jyh-Shing Roger
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 56 - 66
  • [39] Spoken Language Recognition in the Latent Topic Simplex
    Lee, Kong Aik
    You, Chang Huai
    Hautamaeki, Ville
    Larcher, Anthony
    Li, Haizhou
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2944 - 2947
  • [40] Spoken language recognition using ensemble classifiers
    Ma, Bin
    Li, Haizhou
    Tong, Rong
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2053 - 2062