Human spoofing detection performance on degraded speech

被引:3
|
作者
Terblanche, Camryn [1 ]
Harrison, Philip [2 ,3 ]
Gully, Amelia J. [2 ]
机构
[1] Univ Cape Town, Dept Speech & Language Pathol, Rondebosch, South Africa
[2] Univ York, Dept Language & Linguist Sci, York, N Yorkshire, England
[3] JP French Associates, York, N Yorkshire, England
来源
关键词
spoofing detection; degraded speech; human performance;
D O I
10.21437/Interspeech.2021-1225
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Over the past few years attention has been focused on the automatic detection of spoofing in the context of automatic speaker verification (ASV) systems. However, little is known about how well humans perform at detecting spoofed speech, particularly under degraded conditions. Using the latest synthesis technologies from ASVspoof 2019, this paper explores human judgements of speech authenticity by considering three common channel degradations - a GSM network, a VoIP network, and background noise - in conjunction with varying synthesis quality. The results reveal that channel degradation reduces the size of the perceptual difference between genuine and spoofed speech, and overall participants correctly identified human and spoofed speech only 56% of the time. In background noise and GSM transmission, lower-quality synthetic speech was judged as more human, and in VoIP transmission all speech, including genuine recordings, was judged as less human. Under all conditions, state-of-the-art synthetic speech was judged as human, or more human than, genuine recorded speech. The paper also considers the listener factors which may contribute to an individual's spoofing detection performance, and finds that a listener's familiarity with the accents involved, their age, and the audio equipment used for playback, have an effect on their spoofing detection performance.
引用
收藏
页码:1738 / 1742
页数:5
相关论文
共 50 条
  • [31] Voice Activity Detection in Degraded Speech Using Excitation Source Information
    Murty, K. Sri Rama
    Yegnanarayana, B.
    Guruprasad, S.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 885 - +
  • [32] Anti-spoofing System: An Investigation of measures to Detect Synthetic And Human Speech
    Misra, Abhinav
    Ranjan, Shivesh
    Zhang, Chunlei
    Hansen, John H. L.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3466 - 3470
  • [33] Performance Evaluation of Multimodal Detection Method for GNSS Intermediate Spoofing
    Li, Jing
    Zhang, Jiantong
    Chang, Shoufeng
    Zhou, Meng
    IEEE ACCESS, 2016, 4 : 9459 - 9468
  • [34] ESTIMATING THE CONFIDENCE OF SPEECH SPOOFING COUNTERMEASURE
    Wang, Xin
    Yamagishi, Junichi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6372 - 6376
  • [35] End-to-end spoofing speech detection and knowledge distillation under noisy conditions
    Liu, Pengfei
    Zhang, Zhenchuan
    Yang, Yingchun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [36] One-Class Neural Network With Directed Statistics Pooling for Spoofing Speech Detection
    Lin, Guoyuan
    Luo, Weiqi
    Luo, Da
    Huang, Jiwu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 2581 - 2593
  • [37] Speech frame selection for spoofing detection with an application to partially spoofed audio-data
    Kumar, A. Kishore
    Paul, Dipjyoti
    Pal, Monisankha
    Sahidullah, Md
    Saha, Goutam
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (01) : 193 - 203
  • [38] Speech frame selection for spoofing detection with an application to partially spoofed audio-data
    A Kishore Kumar
    Dipjyoti Paul
    Monisankha Pal
    Md Sahidullah
    Goutam Saha
    International Journal of Speech Technology, 2021, 24 : 193 - 203
  • [39] Siamese Convolutional Neural Network Using Gaussian Probability Feature for Spoofing Speech Detection
    Lei, Zhenchun
    Yang, Yingen
    Liu, Changhong
    Ye, Jihua
    INTERSPEECH 2020, 2020, : 1116 - 1120
  • [40] Human vs Machine Spoofing Detection on Wideband and Narrowband Data
    Wester, Mirjam
    Wu, Zhizheng
    Yamagishi, Junichi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2047 - 2051