DNN-Based Voice Activity Detection with Multi-Task Learning

被引:31
|
作者
Kang, Tae Gyoon [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 151742, South Korea
[2] Seoul Natl Univ, Inst New Media & Commun, Seoul 151742, South Korea
来源
基金
新加坡国家研究基金会;
关键词
deep neural network; voice activity detection; multi-task learning; NETWORKS;
D O I
10.1587/transinf.2015EDL8168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, notable improvements in voice activity detection (VAD) problem have been achieved by adopting several machine learning techniques. Among them, the deep neural network (DNN) which learns the mapping between the noisy speech features and the corresponding voice activity status with its deep hidden structure has been one of the most popular techniques. In this letter, we propose a novel approach which enhances the robustness of DNN in mismatched noise conditions with multi-task learning (MTL) framework. In the proposed algorithm, a feature enhancement task for speech features is jointly trained with the conventional VAD task. The experimental results show that the DNN with the proposed framework outperforms the conventional DNN-based VAD algorithm.
引用
收藏
页码:550 / 553
页数:4
相关论文
共 50 条
  • [21] A multi-task based deep learning approach for intrusion detection
    Liu, Qigang
    Wang, Deming
    Jia, Yuhang
    Luo, Suyuan
    Wang, Chongren
    KNOWLEDGE-BASED SYSTEMS, 2022, 238
  • [22] Multi-Task Learning U-Net for Single-Channel Speech Enhancement and Mask-Based Voice Activity Detection
    Lee, Geon Woo
    Kim, Hong Kook
    APPLIED SCIENCES-BASEL, 2020, 10 (09):
  • [23] Multi-task self-supervised learning for human activity detection
    Saeed, Aaqib
    Ozcelebi, Tanir
    Lukkien, Johan
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2019, 3 (02)
  • [24] Multi-task learning for video anomaly detection*
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [25] Multi-task learning for video anomaly detection
    Chang, Xingya
    Zhang, Yuxin
    Xue, Dingyu
    Chen, Dongyue
    Journal of Visual Communication and Image Representation, 2022, 87
  • [26] Automatic Cataract Detection with Multi-Task Learning
    Wu, Hongjie
    Lv, Jiancheng
    Wang, Jian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [27] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [28] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [29] DNN-based Human Activity Recognition by Learning Initial Motion Data for Virtual Multi-Sports
    Kim, Jong-Sung
    2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 373 - 375
  • [30] DNN-based Human Activity Recognition by Learning Initial Motion Data for Virtual Multi-Sports
    Kim, Jong-Sung
    2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 373 - 375