Subspace-Based Feature Representation and Learning for Language Recognition

被引:0
|
作者
Shih, Yu-Chin [1 ]
Lee, Hung-Shin [1 ]
Wang, Hsin-Min
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei 10764, Taiwan
关键词
language recognition; subspace-based learning; IDENTIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel subspace-based approach for phonotactic language recognition. The whole framework is divided into two parts: the speech feature representation and the subspace-based learning algorithm. First, the phonetic information as well as the contextual relationship, possessed by spoken utterances, are more abundantly retrieved by likelihood computation and feature concatenation through the decoding processed by an automatic speech recognizer. It is assumed that the extracted phone frames reside in a lower dimensional eigen-subspace, in which the structure of data can be approximately captured. Each utterance is further represented by a fixed-dimensional linear subspace. Second, to measure the similarity between two utterances, suitable non-Euclidean metrics are explored and applied to non-linear discriminant analysis in a kernel fashion, followed by a back-end classifier, such as the k-nearest neighbor (K-NN) classifier. The results of experiments on the OGI-TS database demonstrate that the proposed framework outperforms the well-known vector space modeling based method with relative reductions of 38.90% and 27.13% on the 1-to-50-second and 3-second data sets respectively in equal error rate (EER).
引用
收藏
页码:2059 / 2062
页数:4
相关论文
共 50 条
  • [1] Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition
    Lee, Hung-Shin
    Tsao, Yu
    Jeng, Shyh-Kang
    Wang, Hsin-Min
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 3065 - 3079
  • [2] Complemented subspace-based weighted collaborative representation model for imbalanced learning
    Li, Yanting
    Jin, Junwei
    Tao, Hongwei
    Xiao, Yang
    Liang, Jing
    Chen, C. L. Philip
    APPLIED SOFT COMPUTING, 2024, 153
  • [3] Subspace-Based Face Recognition on an FPGA
    Pizarro, Pablo
    Figueroa, Miguel
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, PT I, 2011, 363 : 84 - 89
  • [4] SUBSPACE-BASED PHONOTACTIC LANGUAGE RECOGNITION USING MULTIVARIATE DYNAMIC LINEAR MODELS
    Lee, Hung-Shin
    Shih, Yu-Chin
    Wang, Hsin-Min
    Jeng, Shyh-Kang
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6870 - 6874
  • [6] Cancelable Biometric Recognition With ECGs: Subspace-Based
    Wu, Shun-Chi
    Chen, Peng-Tzu
    Swindlehurst, A. Lee
    Hung, Pei-Lun
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (05) : 1323 - 1336
  • [7] Face recognition approach by subspace extended sparse representation and discriminative feature learning
    Liao, Mengmeng
    Gu, Xiaodong
    NEUROCOMPUTING, 2020, 373 : 35 - 49
  • [8] Subspace-based Feature Alignment for Unsupervised Domain Adaptation
    Yi, Eojindl
    Kim, Junmo
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10163 - 10169
  • [9] Slow feature subspace: A video representation based on slow feature analysis for action recognition
    Beleza, Suzana Rita Alves
    Shimomoto, Erica K.
    Souza, Lincon S.
    Fukui, Kazuhiro
    MACHINE LEARNING WITH APPLICATIONS, 2023, 14
  • [10] Subspace-based feature extraction on multi-physiological measurements of automobile drivers for distress recognition
    Esener, Idil Isikli
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 66 (66)