Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

被引:0
|
作者
Siniscalchi, Sabato Marco [1 ]
Reed, Jeremy [2 ]
Svendsen, Torbjorn [1 ]
Lee, Chin-Hui [2 ]
机构
[1] NTNU, Dept Elect & Telecommun, Trondheim, Norway
[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
Language recognition; vector space modeling; phonetic features;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel universal acoustic characterization approach to spoken language identification (LID), in which any spoken language is described with a common set of fundamental units defined "universally." Specifically, manner and place of articulation form this unit inventory and are used to build a set of universal attribute models with data-driven techniques. Using the vector space modeling approaches to LID a spoken utterance is first decoded into a sequence of attributes. Then, a feature vector consisting of co-occurrence statistics of attribute units is created, and the final LID decision is implemented with a set of vector space language classifiers. Although the present study is just in its preliminary stage, promising results comparable to acoustically rich phone-based LID systems have already been obtained on the NIST 2003 LID task. The results provide clear insight for further performance improvements and encourage a continuing exploration of the proposed framework.
引用
收藏
页码:168 / +
页数:2
相关论文
共 50 条
  • [1] Universal attribute characterization of spoken languages for automatic spoken language recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (01): : 209 - 227
  • [2] Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition
    Siniscalchi, Sabato Marco
    Reed, Jeremy
    Svendsen, Torbjorn
    Lee, Chin-Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2726 - +
  • [3] PROSODIC ATTRIBUTE MODEL FOR SPOKEN LANGUAGE IDENTIFICATION
    Ng, Raymond W. M.
    Leung, Cheung-Chi
    Lee, Tan
    Ma, Bin
    Li, Haizhou
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5022 - 5025
  • [4] Cultivating Spoken Language Technologies for Unwritten Languages
    Reitmaier, Thomas
    Raju, Dani Kalarikalayil
    Klejch, Ondrej
    Wallington, Electra
    Markl, Nina
    Pearson, Jennifer
    Jones, Matt
    Bell, Peter
    Robinson, Simon
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [5] Cultivating Spoken Language Technologies for Unwriten Languages
    Swansea University, Swansea, United Kingdom
    不详
    不详
    不详
    Conf Hum Fact Comput Syst Proc,
  • [6] Spoken language recognition with relevance feedback
    Tong, Rong
    Li, Haizhou
    Ma, Bin
    Chng, Eng Siong
    Cho, Siu-Yeung
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 861 - +
  • [7] Spoken Language Recognition With Prosodic Features
    Ng, Raymond W. M.
    Lee, Tan
    Leung, Cheung-Chi
    Ma, Bin
    Li, Haizhou
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (09): : 1841 - 1853
  • [8] Discriminative vector for spoken language recognition
    Ma, Bin
    Tong, Rong
    Li, Haizhou
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1001 - +
  • [9] Spoken Language Treebanks in Universal Dependencies: an Overview
    Dobrovoljc, Kaja
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1798 - 1806
  • [10] Gesture links language and cognition for spoken and signed languages
    Kita, Sotaro
    Emmorey, Karen
    NATURE REVIEWS PSYCHOLOGY, 2023, 2 (07): : 407 - 420