DNN-Based Voice Activity Detection with Multi-Task Learning

被引:31
|
作者
Kang, Tae Gyoon [1 ,2 ]
Kim, Nam Soo [1 ,2 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul 151742, South Korea
[2] Seoul Natl Univ, Inst New Media & Commun, Seoul 151742, South Korea
来源
基金
新加坡国家研究基金会;
关键词
deep neural network; voice activity detection; multi-task learning; NETWORKS;
D O I
10.1587/transinf.2015EDL8168
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, notable improvements in voice activity detection (VAD) problem have been achieved by adopting several machine learning techniques. Among them, the deep neural network (DNN) which learns the mapping between the noisy speech features and the corresponding voice activity status with its deep hidden structure has been one of the most popular techniques. In this letter, we propose a novel approach which enhances the robustness of DNN in mismatched noise conditions with multi-task learning (MTL) framework. In the proposed algorithm, a feature enhancement task for speech features is jointly trained with the conventional VAD task. The experimental results show that the DNN with the proposed framework outperforms the conventional DNN-based VAD algorithm.
引用
收藏
页码:550 / 553
页数:4
相关论文
共 50 条
  • [31] Multi-task Feature Learning Based Anomaly Detection of Network Dataflow
    Ren Hui-feng
    Yan Feng
    Dong Qing-chao
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 4144 - 4147
  • [32] CURRICULUM BASED MULTI-TASK LEARNING FOR PARKINSON'S DISEASE DETECTION
    Dhinagar, Nikhil J.
    Owens-Walton, Conor
    Laltoo, Emily
    Boyle, Christina P.
    Chen, Yao-Liang
    Cook, Philip
    McMillan, Corey
    Tsai, Chih-Chien
    Wang, J-J
    Wu, Yih-Ru
    Van der Werf, Ysbrand
    Thompson, Paul M.
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [33] Contrastive Learning based Multi-task Network for Image Manipulation Detection
    Yin, Qilin
    Wang, Jinwei
    Lu, Wei
    Luo, Xiangyang
    SIGNAL PROCESSING, 2022, 201
  • [34] Smart Contract Vulnerability Detection Model Based on Multi-Task Learning
    Huang, Jing
    Zhou, Kuo
    Xiong, Ao
    Li, Dongmeng
    SENSORS, 2022, 22 (05)
  • [35] Event Detection via Context Understanding Based on Multi-task Learning
    Xia, Jing
    Li, Xiaolong
    Tan, Yongbin
    Zhang, Wu
    Li, Dajun
    Xiong, Zhengkun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (01)
  • [36] Multi-task Learning Based on Multiple Data Sources for Cancer Detection
    Hong, Siyi
    2021 3RD INTERNATIONAL CONFERENCE ON MACHINE LEARNING, BIG DATA AND BUSINESS INTELLIGENCE (MLBDBI 2021), 2021, : 486 - 491
  • [37] Multi-Task Learning Based Joint Pulse Detection and Modulation Classification
    Akyon, Fatih Cagatay
    Nuhoglu, Mustafa Atahan
    Alp, Yasar Kemal
    Arikan, Orhan
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [38] Image Inpainting Detection Based on Multi-task Deep Learning Network
    Wang, Xinyi
    Niu, Shaozhang
    Wang, He
    IETE TECHNICAL REVIEW, 2021, 38 (01) : 149 - 157
  • [39] DNN-Based Radar Target Detection With OTFS
    Tan, Long
    Yuan, Weijie
    Zhang, Xiaoqi
    Zhang, Kecheng
    Li, Zhongjie
    Li, Yonghui
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) : 15786 - 15791
  • [40] On the Training of DNN-based Average Voice Model for Speech Synthesis
    Yang, Shan
    Wu, Zhizheng
    Xie, Lei
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,