Universal attribute characterization of spoken languages for automatic spoken language recognition

被引:41
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [3 ]
Lee, Chin-Hui [4 ]
机构
[1] Kore Univ Enna, Fac Engn & Architecture, Enna, Sicily, Italy
[2] Georgia Inst Technol, Georgia Tech Res Inst, Atlanta, GA 30332 USA
[3] Norwegian Univ Sci & Technol, Dept Elect & Telecommun, N-7491 Trondheim, Norway
[4] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
COMPUTER SPEECH AND LANGUAGE | 2013年 / 27卷 / 01期
关键词
Spoken language recognition; Vector space model; Latentsemantic analysis; Artificial neural network; Support vectormachine; Phonetic features; NEURAL-NETWORKS; DESIGN;
D O I
10.1016/j.csl.2012.05.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel universal acoustic characterization approach to spoken language recognition (LRE). The key idea is to describe any spoken language with a common set of fundamental units that can be defined "universally" across all spoken languages. In this study, speech attributes, such as manner and place of articulation, are chosen to form this unit inventory and used to build a set of language-universal attribute models with data-driven modeling techniques. The vector space modeling approach to LRE is adopted, where a spoken utterance is first decoded into a sequence of attributes independently of its language. Then, a feature vector is generated by using co-occurrence statistics of manner or place units, and the final LRE decision is implemented with a vector space language classifier. Several architectural configurations will be studied, and it will be shown that best performance is attained using a maximal figure-of-merit language classifier. Experimental evidence not only demonstrates the feasibility of the proposed techniques, but it also shows that the proposed technique attains comparable performance to standard approaches on the LRE tasks investigated in this work when the same experimental conditions are adopted. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:209 / 227
页数:19
相关论文
共 50 条
  • [41] SPOKEN LANGUAGE RECOGNITION ON A DSP ARRAY PROCESSOR
    GLINSKI, S
    ROE, D
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (07) : 697 - 703
  • [42] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [43] STATE AND THE LANGUAGES SPOKEN IN IT
    KNAPPERT, J
    LINGUISTICS, 1978, (214) : 69 - 76
  • [44] TechWare: Speaker and Spoken Language Recognition Resources
    Li, Haizhou
    Ma, Bin
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 139 - 142
  • [45] Universal Adversarial Attacks On Spoken Language Assessment Systems
    Raina, Vyas
    Gales, Mark J. F.
    Knill, Kate M.
    INTERSPEECH 2020, 2020, : 3855 - 3859
  • [46] SPOKEN-SPOKEN, SPOKEN-WRITTEN SPOKEN-RECITED + DISCOURSE AND LANGUAGE
    NENCIONI, G
    STRUMENTI CRITICI, 1976, (29) : 1 - 55
  • [47] How speaker tongue and name source language affect the automatic recognition of spoken names
    Reveil, Bert
    Martens, Jean-Pierre
    D'hoore, Bart
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2971 - +
  • [48] Target-Aware Language Models for Spoken Language Recognition
    Tong, Rong
    Ma, Bin
    Li, Haizhou
    Chang, Eng Siong
    Lee, Kong-Aik
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 200 - +
  • [49] Spoken Language Identification of Four Tibeto-Burman languages
    Chakraborty, Joyshree
    Sarmah, Priyankoo
    Vijaya, Samudra
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 106 - 110
  • [50] DEVELOPMENT OF A SPOKEN LANGUAGE IDENTIFICATION SYSTEM FOR SOUTH AFRICAN LANGUAGES
    Peche, M.
    Davel, M. H.
    Barnard, E.
    SAIEE AFRICA RESEARCH JOURNAL, 2009, 100 (04): : 97 - 103