Studying Human-Based Speaker Diarization and Comparing to State-of-the-Art Systems

被引:0
|
作者
McKnight, Simon W. [1 ]
Hogg, Aidan O. T. [1 ]
Neo, Vincent W. [1 ]
Naylor, Patrick A. [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human-based speaker diarization experiments were carried out on a five-minute extract of a typical AMI corpus meeting to see how much variance there is in human reviews based on hearing only and to compare with state-of-the-art diarization systems on the same extract. There are three distinct experiments: (a) one with no prior information; (b) one with the ground truth speech activity detection (GT-SAD); and (c) one with the blank ground truth labels (GT-labels). The results show that most human reviews tend to be quite similar, albeit with some outliers, but the choice of GT-labels can make a dramatic difference to scored performance. Using the GT-SAD provides a big advantage and improves human review scores substantially, though small differences in the GT-SAD used can have a dramatic effect on results. The use of forgiveness collars is shown to be unhelpful. The results show that state-of-the-art systems can outperform the best human reviews when no prior information is provided. However, the best human reviews still outperform state-of-the-art systems when starting from the GT-SAD.
引用
收藏
页码:394 / 401
页数:8
相关论文
共 50 条
  • [1] On Complementarity of State-of-the-art Speaker Recognition Systems
    Machlica, Lukas
    Zajic, Zbynek
    Mueller, Ludek
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 164 - 169
  • [2] Workforce Learning Curves for Human-Based Assembly Operations: A State-of-the-Art Review
    Pena, Carlos
    Romero, David
    Noguez, Julieta
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [3] State-of-the-art in speaker recognition
    Faundez-Zanuy, M
    Monte-Moreno, E
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2005, 20 (05) : 7 - 12
  • [4] Comparing state-of-the-art collaborative filtering systems
    Candillier, Laurent
    Meyer, Frank
    Boulle, Marc
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 548 - +
  • [5] An Open-source State-of-the-art Toolbox for Broadcast News Diarization
    Rouvier, Mickael
    Dupuy, Gregor
    Gay, Paul
    Khoury, Elie
    Merlin, Teva
    Meignier, Sylvain
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1476 - 1480
  • [6] STATE-OF-THE-ART SEQUENCE KERNELS FOR SVM SPEAKER VERIFICATION
    Louradour, Jerome
    Daoudi, Khalid
    2008 IEEE WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2008, : 498 - +
  • [7] A new architecture based VAD for speaker diarization/detection systems
    Kenai, Ouassila
    Ouamour, Siham
    Guerti, Mhania
    Asbai, Nassim
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 827 - 840
  • [8] A new architecture based VAD for speaker diarization/detection systems
    Ouassila Kenai
    Siham Ouamour
    Mhania Guerti
    Nassim Asbai
    International Journal of Speech Technology, 2019, 22 : 827 - 840
  • [9] STATE-OF-THE-ART OF AGROFORESTRY SYSTEMS
    NAIR, PKR
    FOREST ECOLOGY AND MANAGEMENT, 1991, 45 (1-4) : 5 - 29
  • [10] STATE-OF-THE-ART IN SAFETY SYSTEMS
    TINHAM, B
    CONTROL AND INSTRUMENTATION, 1987, 19 (02): : 47 - &