Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition

被引：0

作者：

Siniscalchi, Sabato Marco ^{[1
]}

Reed, Jeremy ^{[2
]}

Svendsen, Torbjorn ^{[1
]}

Lee, Chin-Hui ^{[2
]}

机构：

[1] NTNU, Dept Elect & Telecommun, Trondheim, Norway

[2] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Language recognition; vector space modeling; phonetic features;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel universal acoustic characterization approach to spoken language identification (LID), in which any spoken language is described with a common set of fundamental units defined "universally." Specifically, manner and place of articulation form this unit inventory and are used to build a set of universal attribute models with data-driven techniques. Using the vector space modeling approaches to LID a spoken utterance is first decoded into a sequence of attributes. Then, a feature vector consisting of co-occurrence statistics of attribute units is created, and the final LID decision is implemented with a set of vector space language classifiers. Although the present study is just in its preliminary stage, promising results comparable to acoustically rich phone-based LID systems have already been obtained on the NIST 2003 LID task. The results provide clear insight for further performance improvements and encourage a continuing exploration of the proposed framework.

引用

页码：168 / +

页数：2

共 50 条

[21] Spoken Language Recognition: From Fundamentals to Practice
Li, Haizhou
Ma, Bin
Lee, Kong Aik
PROCEEDINGS OF THE IEEE, 2013, 101 (05) : 1136 - 1159
[22] Application of GMM Models to Spoken Language Recognition
Dustor, Adam
Szwarc, Pawel
MIXDES 2009: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE MIXED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, : 603 - 606
[23] A Syllable Structure Approach to Spoken Language Recognition
Lee, Ruei-Hung Alex
Jang, Jyh-Shing Roger
STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 56 - 66
[24] Spoken Language Recognition in the Latent Topic Simplex
Lee, Kong Aik
You, Chang Huai
Hautamaeki, Ville
Larcher, Anthony
Li, Haizhou
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2944 - 2947
[25] Spoken language recognition using ensemble classifiers
Ma, Bin
Li, Haizhou
Tong, Rong
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2053 - 2062
[26] SPOKEN LANGUAGE RECOGNITION ON A DSP ARRAY PROCESSOR
GLINSKI, S
ROE, D
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (07) : 697 - 703
[27] Language Modeling for Speech Recognition of Spoken Cantonese
Yeung, Yu Ting
Cao, Houwei
Zheng, N. H.
Lee, Tan
Ching, P. C.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
[28] STATE AND THE LANGUAGES SPOKEN IN IT
KNAPPERT, J
LINGUISTICS, 1978, (214) : 69 - 76
[29] TechWare: Speaker and Spoken Language Recognition Resources
Li, Haizhou
Ma, Bin
IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 139 - 142
[30] Universal Adversarial Attacks On Spoken Language Assessment Systems
Raina, Vyas
Gales, Mark J. F.
Knill, Kate M.
INTERSPEECH 2020, 2020, : 3855 - 3859

← 1 2 3 4 5 →