DNN-Based Voice Activity Detection with Multi-Task Learning

被引:31
|
作者
Kang, Tae Gyoon [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 151742, South Korea
[2] Seoul Natl Univ, Inst New Media & Commun, Seoul 151742, South Korea
来源
基金
新加坡国家研究基金会;
关键词
deep neural network; voice activity detection; multi-task learning; NETWORKS;
D O I
10.1587/transinf.2015EDL8168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, notable improvements in voice activity detection (VAD) problem have been achieved by adopting several machine learning techniques. Among them, the deep neural network (DNN) which learns the mapping between the noisy speech features and the corresponding voice activity status with its deep hidden structure has been one of the most popular techniques. In this letter, we propose a novel approach which enhances the robustness of DNN in mismatched noise conditions with multi-task learning (MTL) framework. In the proposed algorithm, a feature enhancement task for speech features is jointly trained with the conventional VAD task. The experimental results show that the DNN with the proposed framework outperforms the conventional DNN-based VAD algorithm.
引用
收藏
页码:550 / 553
页数:4
相关论文
共 50 条
  • [41] Attacking DNN-based Intrusion Detection Models
    Zhang, Xingwei
    Zheng, Xiaolong
    Wu, Desheng Dash
    IFAC PAPERSONLINE, 2020, 53 (05): : 415 - 419
  • [42] Multi-scene citrus detection based on multi-task deep learning network
    Wen, Chenxin
    Zhang, Hui
    Li, Honghao
    Li, Hongwen
    Chen, Jinhai
    Guo, Hangge
    Cheng, Shihui
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 912 - 919
  • [43] Multi-task Learning Based on Multi-type Dataset for Retinal Abnormality Detection
    Zhao, Linna
    Li, Jianqiang
    Ma, Zerui
    Guan, Yu
    Xu, Xi
    Wang, Xiaoxi
    Li, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (ICDH 2021), 2021, : 160 - 165
  • [44] A Multi-Task Scheme for Supervised DNN-Based Single-Channel Speech Enhancement by Using Speech Presence Probability as the Secondary Training Target
    Wang, Lei
    Zhu, Jie
    Sun, Kangbo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1963 - 1970
  • [45] MULTI-TASK LEARNING IMPROVES SYNTHETIC SPEECH DETECTION
    Mo, Yichuan
    Wang, Shilin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6392 - 6396
  • [46] Multi-Task Learning for Intrusion Detection on web logs
    Li, Bo
    Lin, Ying
    Zhang, Simin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2017, 81 : 92 - 100
  • [47] Multi-task Learning for Stance and Early Rumor Detection
    Chen, Yongheng
    Yin, Chunyan
    Zuo, Wanli
    OPTICAL MEMORY AND NEURAL NETWORKS, 2021, 30 (02) : 131 - 139
  • [48] Multi-task learning for object keypoints detection and classification
    Xu, Jie
    Zhao, Lin
    Zhang, Shanshan
    Gong, Chen
    Yang, Jian
    PATTERN RECOGNITION LETTERS, 2020, 130 : 182 - 188
  • [49] Interdependent Multi-task Learning for Simultaneous Segmentation and Detection
    Reginthala, Mahesh
    Iwahori, Yuji
    Bhuyan, M. K.
    Hayashi, Yoshitsugu
    Achariyaviriya, Witsarut
    Kijsirikul, Boonserm
    ICPRAM: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2020, : 167 - 174
  • [50] Driver Drowsiness Detection by Multi-task and Transfer Learning
    Chang, Yuan
    Kameyama, Wataru
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2022, 2022, 12177