Subspace-Based Feature Representation and Learning for Language Recognition

被引：0

作者：

Shih, Yu-Chin ^{[1
]}

Lee, Hung-Shin ^{[1
]}

Wang, Hsin-Min

Jeng, Shyh-Kang ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

language recognition; subspace-based learning; IDENTIFICATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a novel subspace-based approach for phonotactic language recognition. The whole framework is divided into two parts: the speech feature representation and the subspace-based learning algorithm. First, the phonetic information as well as the contextual relationship, possessed by spoken utterances, are more abundantly retrieved by likelihood computation and feature concatenation through the decoding processed by an automatic speech recognizer. It is assumed that the extracted phone frames reside in a lower dimensional eigen-subspace, in which the structure of data can be approximately captured. Each utterance is further represented by a fixed-dimensional linear subspace. Second, to measure the similarity between two utterances, suitable non-Euclidean metrics are explored and applied to non-linear discriminant analysis in a kernel fashion, followed by a back-end classifier, such as the k-nearest neighbor (K-NN) classifier. The results of experiments on the OGI-TS database demonstrate that the proposed framework outperforms the well-known vector space modeling based method with relative reductions of 38.90% and 27.13% on the 1-to-50-second and 3-second data sets respectively in equal error rate (EER).

引用

页码：2059 / 2062

页数：4

共 50 条

[1] Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition
Lee, Hung-Shin
Tsao, Yu
Jeng, Shyh-Kang
Wang, Hsin-Min
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3065 - 3079
[2] Complemented subspace-based weighted collaborative representation model for imbalanced learning
Li, Yanting
Jin, Junwei
Tao, Hongwei
Xiao, Yang
Liang, Jing
Chen, C. L. Philip
APPLIED SOFT COMPUTING, 2024, 153
[3] Subspace-Based Face Recognition on an FPGA
Pizarro, Pablo
Figueroa, Miguel
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT I, 2011, 363 : 84 - 89
[4] SUBSPACE-BASED PHONOTACTIC LANGUAGE RECOGNITION USING MULTIVARIATE DYNAMIC LINEAR MODELS
Lee, Hung-Shin
Shih, Yu-Chin
Wang, Hsin-Min
Jeng, Shyh-Kang
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6870 - 6874
[5] Subspace-based learning for face retrieval
Alarmel Mangai, M., 2012, Praise Worthy Prize (07)
[6] Cancelable Biometric Recognition With ECGs: Subspace-Based
Wu, Shun-Chi
Chen, Peng-Tzu
Swindlehurst, A. Lee
Hung, Pei-Lun
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1323 - 1336
[7] Face recognition approach by subspace extended sparse representation and discriminative feature learning
Liao, Mengmeng
Gu, Xiaodong
NEUROCOMPUTING, 2020, 373 : 35 - 49
[8] Subspace-based Feature Alignment for Unsupervised Domain Adaptation
Yi, Eojindl
Kim, Junmo
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10163 - 10169
[9] Slow feature subspace: A video representation based on slow feature analysis for action recognition
Beleza, Suzana Rita Alves
Shimomoto, Erica K.
Souza, Lincon S.
Fukui, Kazuhiro
MACHINE LEARNING WITH APPLICATIONS, 2023, 14
[10] Subspace-based feature extraction on multi-physiological measurements of automobile drivers for distress recognition
Esener, Idil Isikli
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 66 (66)

← 1 2 3 4 5 →