AUC OPTIMIZATION FOR DEEP LEARNING BASED VOICE ACTIVITY DETECTION

被引:0
|
作者
Fan, Zi-Chen [1 ]
Bai, Zhongxin
Zhang, Xiao-Lei
Rahardja, Susanto
Chen, Jingdong
机构
[1] Northwestern Polytech Univ, Ctr Intelligent Acoust & Immers Commun, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
AUC; deep neural networks; voice activity detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice activity detection (VAD) based on deep neural networks (DNN) has demonstrated good performance in adverse acoustic environments. Current DNN based VAD optimizes a surrogate function, e.g. minimum cross-entropy or minimum squared error, at a given decision threshold. However, VAD usually works on-the-fly with a dynamic decision threshold; and ROC curve is a global evaluation metric of VAD that reflects the performance of VAD at all possible decision thresholds. In this paper, we propose to optimize the area under ROC curve (AUC) by DNN, which can maximize the performance of VAD in terms of the ROC curve. Experimental results show that optimizing AUC by DNN results in higher performance than the common method of optimizing the minimum squared error by DNN.
引用
收藏
页码:6760 / 6764
页数:5
相关论文
共 50 条
  • [31] Sub-voice Detection and Recognition based on Hybrid Audio Segmentation and Deep Learning
    Zhao, Xiaolei
    Wang, Chenyin
    Xu, Xibin
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 143 - 147
  • [32] Comparative study of singing voice detection based on deep neural networks and ensemble learning
    You, Shingchern D.
    Liu, Chien-Hung
    Chen, Woei-Kae
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [33] Adaptive voice activity detection for wireless communications based on hybrid fuzzy learning
    Beritelli, F
    Casale, S
    Cavallaro, A
    GLOBECOM 98: IEEE GLOBECOM 1998 - CONFERENCE RECORD, VOLS 1-6: THE BRIDGE TO GLOBAL INTEGRATION, 1998, : 1729 - 1734
  • [34] A Study on the Optimization of the Coil Defect Detection Model Based on Deep Learning
    Noh, Chun-Myoung
    Jang, Jun-Gyo
    Kim, Sung-Soo
    Lee, Soon-Sup
    Shin, Sung-Chul
    Lee, Jae-Chul
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [35] Optimization and efficiency analysis of deep learning based brain tumor detection
    Saeed, Maryam
    Halepoto, Irfan Ahmed
    Khaskheli, Sania
    Bushra, Mehak
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2023, 42 (02) : 188 - 196
  • [36] The Optimization of Face Detection Technology Based on Neural Network and Deep Learning
    Zhao, Jian
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2023, 16 (03)
  • [37] Semi-supervised AUC optimization based on positive-unlabeled learning
    Sakai, Tomoya
    Niu, Gang
    Sugiyama, Masashi
    MACHINE LEARNING, 2018, 107 (04) : 767 - 794
  • [38] A Comparison of Boosted Deep Neural Networks for Voice Activity Detection
    Krishnakumar, Harshit
    Williamson, Donald S.
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [39] Joint learning for voice based disease detection
    Wu, Kebin
    Zhang, David
    Lu, Guangming
    Guo, Zhenhua
    PATTERN RECOGNITION, 2019, 87 : 130 - 139
  • [40] PARTIAL AUC OPTIMIZATION BASED DEEP SPEAKER EMBEDDINGS WITH CLASS-CENTER LEARNING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
    Bai, Zhongxin
    Zhang, Xiao-Lei
    Chen, Jingdong
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6819 - 6823