Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

被引:0
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [1 ]
Lee, Chin-Hui [2 ]
机构
[1] NTNU, Dept Elect & Telecommun, Trondheim, Norway
[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
Language recognition; vector space modeling; phonetic features;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel universal acoustic characterization approach to spoken language identification (LID), in which any spoken language is described with a common set of fundamental units defined "universally." Specifically, manner and place of articulation form this unit inventory and are used to build a set of universal attribute models with data-driven techniques. Using the vector space modeling approaches to LID a spoken utterance is first decoded into a sequence of attributes. Then, a feature vector consisting of co-occurrence statistics of attribute units is created, and the final LID decision is implemented with a set of vector space language classifiers. Although the present study is just in its preliminary stage, promising results comparable to acoustically rich phone-based LID systems have already been obtained on the NIST 2003 LID task. The results provide clear insight for further performance improvements and encourage a continuing exploration of the proposed framework.
引用
收藏
页码:168 / +
页数:2
相关论文
共 50 条
  • [21] Spoken Language Recognition: From Fundamentals to Practice
    Li, Haizhou
    Ma, Bin
    Lee, Kong Aik
    PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1136 - 1159
  • [22] Application of GMM Models to Spoken Language Recognition
    Dustor, Adam
    Szwarc, Pawel
    MIXDES 2009: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, : 603 - 606
  • [23] A Syllable Structure Approach to Spoken Language Recognition
    Lee, Ruei-Hung Alex
    Jang, Jyh-Shing Roger
    STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 56 - 66
  • [24] Spoken Language Recognition in the Latent Topic Simplex
    Lee, Kong Aik
    You, Chang Huai
    Hautamaeki, Ville
    Larcher, Anthony
    Li, Haizhou
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2944 - 2947
  • [25] Spoken language recognition using ensemble classifiers
    Ma, Bin
    Li, Haizhou
    Tong, Rong
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2053 - 2062
  • [26] SPOKEN LANGUAGE RECOGNITION ON A DSP ARRAY PROCESSOR
    GLINSKI, S
    ROE, D
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (07) : 697 - 703
  • [27] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [28] STATE AND THE LANGUAGES SPOKEN IN IT
    KNAPPERT, J
    LINGUISTICS, 1978, (214) : 69 - 76
  • [29] TechWare: Speaker and Spoken Language Recognition Resources
    Li, Haizhou
    Ma, Bin
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 139 - 142
  • [30] Universal Adversarial Attacks On Spoken Language Assessment Systems
    Raina, Vyas
    Gales, Mark J. F.
    Knill, Kate M.
    INTERSPEECH 2020, 2020, : 3855 - 3859