Research on monaural speech segregation based on feature selection

被引:0
|
作者
Xie, Xiaoping [1 ]
Chen, Yongzhen [1 ]
Shen, Rufeng [1 ]
Tian, Dan [1 ]
机构
[1] Hunan Univ, State Key Lab Adv Design & Mfg Vehicle Body, Changsha 410082, Hunan, Peoples R China
关键词
Feature selection; Group lasso; Deep Neural Network (DNN); Monaural speech segregation; Complementary feature group; REGRESSION; NOISE;
D O I
10.1186/s13636-023-00276-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech feature model is the basis of speech and noise separation, speech expression, and different styles of speech conversion. With the development of signal processing methods, the feature types and dimensions increase. Therefore, it is difficult to select appropriate features. If a single feature is used, the representation of the speech signal will be incomplete. If multiple features are used, there will be redundancy between features, which will affect the performance of speech separation. The feature described above is a combination of parameters to characterize speech. A single feature means that the combination has only one parameter. In this paper, the feature selection method is used to select and combine eight widely used speech features and parameters. The Deep Neural Network (DNN) is used to evaluate and analyze the speech separation effect of different feature groups. The comparison results show that the speech segregation effect of the complementary feature group is better. The effectiveness of the complementary feature group to improve the performance of DNN speech separation is verified.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Research on monaural speech segregation based on feature selection
    Xiaoping Xie
    Yongzhen Chen
    Rufeng Shen
    Dan Tian
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [2] Auditory Feature for Monaural Speech Segregation
    Jiang, Yi
    Liu, Runsheng
    Zu, Yuanyuan
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONICS AND COMPUTER, 2014, 59 : 69 - 72
  • [3] New research on monaural speech segregation based on quality assessment
    Xie, Xiaoping
    Li, Can
    Tian, Dan
    Shen, Rufeng
    Ding, Fei
    COMPUTER SPEECH AND LANGUAGE, 2024, 85
  • [4] Pitch-based monaural segregation of reverberant speech
    Roman, Nicoleta
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 458 - 469
  • [5] Pitch-based monaural segregation of reverberant speech
    Roman, Nicoleta
    Wang, DeLiang
    Journal of the Acoustical Society of America, 2006, 120 (01): : 458 - 469
  • [6] Monaural speech segregation and oscillatory correlation
    Wang, DL
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1574 - 1574
  • [7] On amplitude modulation for monaural speech segregation
    Hu, GN
    Wang, DL
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 69 - 74
  • [8] Monaural speech segregation based on pitch tracking and amplitude modulation
    Hu, GN
    Wang, DL
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (05): : 1135 - 1150
  • [9] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
    Xueliang Zhang
    Wenju Liu
    Bo Xu
    EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [10] Monaural speech segregation based on pitch tracking and amplitude modulation
    Hu, GN
    Wang, DL
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 553 - 556