Human spoofing detection performance on degraded speech

被引:3
|
作者
Terblanche, Camryn [1 ]
Harrison, Philip [2 ,3 ]
Gully, Amelia J. [2 ]
机构
[1] Univ Cape Town, Dept Speech & Language Pathol, Rondebosch, South Africa
[2] Univ York, Dept Language & Linguist Sci, York, N Yorkshire, England
[3] JP French Associates, York, N Yorkshire, England
来源
关键词
spoofing detection; degraded speech; human performance;
D O I
10.21437/Interspeech.2021-1225
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Over the past few years attention has been focused on the automatic detection of spoofing in the context of automatic speaker verification (ASV) systems. However, little is known about how well humans perform at detecting spoofed speech, particularly under degraded conditions. Using the latest synthesis technologies from ASVspoof 2019, this paper explores human judgements of speech authenticity by considering three common channel degradations - a GSM network, a VoIP network, and background noise - in conjunction with varying synthesis quality. The results reveal that channel degradation reduces the size of the perceptual difference between genuine and spoofed speech, and overall participants correctly identified human and spoofed speech only 56% of the time. In background noise and GSM transmission, lower-quality synthetic speech was judged as more human, and in VoIP transmission all speech, including genuine recordings, was judged as less human. Under all conditions, state-of-the-art synthetic speech was judged as human, or more human than, genuine recorded speech. The paper also considers the listener factors which may contribute to an individual's spoofing detection performance, and finds that a listener's familiarity with the accents involved, their age, and the audio equipment used for playback, have an effect on their spoofing detection performance.
引用
收藏
页码:1738 / 1742
页数:5
相关论文
共 50 条
  • [21] GPS Spoofing Detection using Accelerometers and Performance Analysis with Probability of Detection
    Lee, Jung-Hoon
    Kwon, Keum-Cheol
    An, Dae-Sung
    Shim, Duk-Sun
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2015, 13 (04) : 951 - 959
  • [22] GPS spoofing detection using accelerometers and performance analysis with probability of detection
    Jung-Hoon Lee
    Keum-Cheol Kwon
    Dae-Sung An
    Duk-Sun Shim
    International Journal of Control, Automation and Systems, 2015, 13 : 951 - 959
  • [23] NEURAL MOS PREDICTION FOR SYNTHESIZED SPEECH USING MULTI-TASK LEARNING WITH SPOOFING DETECTION AND SPOOFING TYPE CLASSIFICATION
    Choi, Yeunju
    Jung, Youngmoon
    Kim, Hoirin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 462 - 469
  • [24] SpoTNet: A spoofing-aware Transformer Network for Effective Synthetic Speech Detection
    Khan, Awais
    Malik, Khalid Mahmood
    PROCEEDINGS OF THE 2ND ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA AI AGAINST DISCRIMINATION, MAD 2023, 2023, : 10 - 18
  • [25] Siamese Network with Wav2vec Feature for Spoofing Speech Detection
    Xie, Yang
    Zhang, Zhenchuan
    Yang, Yingchun
    INTERSPEECH 2021, 2021, : 4269 - 4273
  • [26] AFP-Conformer: Asymptotic feature pyramid conformer for spoofing speech detection
    Huang, Yida
    Shen, Qian
    Ma, Jianfen
    SPEECH COMMUNICATION, 2025, 166
  • [27] Speech Partial Spoofing Detection Using Conformer Blocks and Multiple Pooling Integration
    Liu, Haiyang
    Chen, Yanxiang
    Zheng, Shengyou
    Li, Fan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14880 : 227 - 238
  • [28] Effect of Multi-condition Training and Speech Enhancement Methods on Spoofing Detection
    Yu, Hong
    Sarkar, Achintya
    Thomsen, Dennis Alexander Lehmann
    Tan, Zheng-Hua
    Ma, Zhanyu
    Guo, Jun
    2016 FIRST INTERNATIONAL WORKSHOP ON SENSING, PROCESSING AND LEARNING FOR INTELLIGENT MACHINES (SPLINE), 2016,
  • [29] Low-Complexity Speech Spoofing Detection using Instantaneous Spectral Features
    Sankar, M. S. Arun
    De Leon, Phillip L.
    Sandoval, Steven
    Roedig, Utz
    2022 29TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2022,
  • [30] Perception of degraded speech sounds differs in chinchilla and human listeners
    Shofner, William P.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (04): : 2065 - 2077