Variable Selection Linear Regression for Robust Speech Recognition

被引:1
|
作者
Tsao, Yu [1 ]
Hu, Ting-Yao [2 ]
Sakti, Sakriani [3 ]
Nakamura, Satoshi [3 ]
Lee, Lin-shan [2 ]
机构
[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei 10764, Taiwan
[3] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300192, Japan
关键词
variable selection; linear regression; MLLR; fMLLR; model space adaptation; feature space adaptation; RAPID SPEAKER ADAPTATION; NOISY ENVIRONMENTS; HMM ADAPTATION; TRANSFORMATION; EIGENSPACE; EIGENVOICE;
D O I
10.1587/transinf.E97.D.1477
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study proposes a variable selection linear regression (VSLR) adaptation framework to improve the accuracy of automatic speech recognition (ASR) with only limited and unlabeled adaptation data. The proposed framework can be divided into three phases. The first phase prepares multiple variable subsets by applying a ranking filter to the original regression variable set. The second phase determines the best variable subset based on a pre-determined performance evaluation criterion and computes a linear regression (LR) mapping function based on the determined subset. The third phase performs adaptation in either model or feature spaces. The three phases can select the optimal components and remove redundancies in the LR mapping function effectively and thus enable VSLR to provide satisfactory adaptation performance even with a very limited number of adaptation statistics. We formulate model space VSLR and feature space VSLR by integrating the VS techniques into the conventional LR adaptation systems. Experimental results on the Aurora-4 task show that model space VSLR and feature space VSLR, respectively, outperform standard maximum likelihood linear regression (MLLR) and feature space MLLR (fMLLR) and their extensions, with notable word error rate (WER) reductions in a per-utterance unsupervised adaptation manner.
引用
收藏
页码:1477 / 1487
页数:11
相关论文
共 50 条
  • [41] Predictive linear transforms for noise robust speech recognition
    Gales, M. J. F.
    van Dalen, R. C.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 59 - 64
  • [42] Non-linear techniques for robust speech recognition
    Ge, Yubo
    Niu, Jing
    Ge, Lingnan
    Shirai, Katsuhiko
    CITSA 2007/CCCT 2007: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS : INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL III, POST-CONFERENCE ISSUE, PROCEEDINGS, 2007, : 134 - +
  • [43] Maximum likelihood subband polynomial regression for robust speech recognition
    Lu, Yong
    Wu, Zhenyang
    APPLIED ACOUSTICS, 2013, 74 (05) : 640 - 646
  • [44] Variable selection in semiparametric linear regression with censored data
    Johnson, Brent A.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 351 - 370
  • [45] An RKHS model for variable selection in functional linear regression
    Berrendero, Jose R.
    Bueno-Larraz, Beatriz
    Cuevas, Antonio
    JOURNAL OF MULTIVARIATE ANALYSIS, 2019, 170 : 25 - 45
  • [46] Bootstrapping multiple linear regression after variable selection
    Lasanthi C. R. Pelawa Watagoda
    David J. Olive
    Statistical Papers, 2021, 62 : 681 - 700
  • [47] Variable selection in linear regression based on ridge estimator
    Dorugade, A. V.
    Kashid, D. N.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2010, 80 (11) : 1211 - 1224
  • [48] Non-linear variable selection in a regression context
    Hill, Simon I.
    PROCEEDINGS OF THE 5TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2007, : 441 - 445
  • [49] Estimation and variable selection for partial functional linear regression
    Qingguo Tang
    Peng Jin
    AStA Advances in Statistical Analysis, 2019, 103 : 475 - 501
  • [50] Exhaustive Search for Sparse Variable Selection in Linear Regression
    Igarashi, Yasuhiko
    Takenaka, Hikaru
    Nakanishi-Ohno, Yoshinori
    Uemura, Makoto
    Ikeda, Shiro
    Okada, Masato
    JOURNAL OF THE PHYSICAL SOCIETY OF JAPAN, 2018, 87 (04)