SPEECH OVERLAP DETECTION USING CONVOLUTIVE NON-NEGATIVE SPARSE CODING: NEW IMPROVEMENTS AND INSIGHTS

被引:0
|
作者
Geiger, Juergen T. [1 ]
Vipperla, Ravichander [2 ]
Evans, Nicholas [2 ]
Schuller, Bjoern [1 ]
Rigoll, Gerhard [1 ]
机构
[1] Tech Univ Munich, Inst Human Machine Commun, D-8000 Munich, Germany
[2] EURECOM, Multimedia Commun Dept, Sophia Antipolis, France
关键词
speech overlap detection; convolutive non-negative sparse coding; speaker diarization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents recent advances in the application of convolutive non-negative sparse coding (CNSC) to the problem of overlap detection in the context of conference meetings and speaker diarization. CNSC is used to project a mixed speaker signal onto separate speaker bases and hence to detect intervals of competing speech. We present new energy ratio and total energy features which give signicant improvements over our previous work. The system is assessed using a subset of the AMI meeting corpus. We report results which are comparable to the state of the art which support the potential of a new approach to overlap detection. An analysis of system performance highlights the importance of further work to addresses weaknesses in detecting particularly short segments of overlapping speech.
引用
收藏
页码:340 / 344
页数:5
相关论文
共 50 条
  • [32] Non-negative Kernel Sparse Coding for the Analysis of Motion Data
    Hosseini, Babak
    Huelsmann, Felix
    Botsch, Mario
    Hammer, Barbara
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 506 - 514
  • [33] Dispersion Constraint Based Non-negative Sparse Coding Model
    Wang, Xin
    Wang, Can
    Shang, Li
    Sun, Zhan-Li
    NEURAL PROCESSING LETTERS, 2016, 43 (02) : 603 - 609
  • [34] ISAR target recognition based on non-negative sparse coding
    Tang, Ning
    Gao, Xunzhang
    Li, Xiang
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2012, 23 (06) : 849 - 857
  • [35] Dispersion constraint based non-negative sparse coding algorithm
    Shang, Li
    Wang, Xin
    Sun, Zhan-Li
    NEUROCOMPUTING, 2016, 188 : 253 - 261
  • [36] Dispersion Constraint Based Non-negative Sparse Coding Model
    Xin Wang
    Can Wang
    Li Shang
    Zhan-Li Sun
    Neural Processing Letters, 2016, 43 : 603 - 609
  • [37] Improvements in sparse non-negative matrix factorization for hyperspectral unmixing algorithms
    Zhang, Zuoyu
    Liao, Shouyi
    Zhang, Hexin
    Wang, Shicheng
    Hua, Chao
    JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (04):
  • [38] Local Region Partitioning for Disguised Face Recognition Using Non-negative Sparse Coding
    Khoa Dang Dang
    Thai Hoang Le
    ADVANCED METHODS FOR COMPUTATIONAL COLLECTIVE INTELLIGENCE, 2013, 457 : 197 - 206
  • [39] Non-Negative Sparse Coding based Scalable Access Control using Fingertip ECG
    Raj, Peter Sam
    Sonowal, Sukanya
    Hatzinakos, Dimitrios
    2014 IEEE/IAPR INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2014), 2014,
  • [40] Sparse image coding using a 3D non-negative tensor factorization
    Hazan, T
    Polak, S
    Shashua, A
    TENTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 50 - 57