Human spoofing detection performance on degraded speech

被引:3
|
作者
Terblanche, Camryn [1 ]
Harrison, Philip [2 ,3 ]
Gully, Amelia J. [2 ]
机构
[1] Univ Cape Town, Dept Speech & Language Pathol, Rondebosch, South Africa
[2] Univ York, Dept Language & Linguist Sci, York, N Yorkshire, England
[3] JP French Associates, York, N Yorkshire, England
来源
关键词
spoofing detection; degraded speech; human performance;
D O I
10.21437/Interspeech.2021-1225
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Over the past few years attention has been focused on the automatic detection of spoofing in the context of automatic speaker verification (ASV) systems. However, little is known about how well humans perform at detecting spoofed speech, particularly under degraded conditions. Using the latest synthesis technologies from ASVspoof 2019, this paper explores human judgements of speech authenticity by considering three common channel degradations - a GSM network, a VoIP network, and background noise - in conjunction with varying synthesis quality. The results reveal that channel degradation reduces the size of the perceptual difference between genuine and spoofed speech, and overall participants correctly identified human and spoofed speech only 56% of the time. In background noise and GSM transmission, lower-quality synthetic speech was judged as more human, and in VoIP transmission all speech, including genuine recordings, was judged as less human. Under all conditions, state-of-the-art synthetic speech was judged as human, or more human than, genuine recorded speech. The paper also considers the listener factors which may contribute to an individual's spoofing detection performance, and finds that a listener's familiarity with the accents involved, their age, and the audio equipment used for playback, have an effect on their spoofing detection performance.
引用
收藏
页码:1738 / 1742
页数:5
相关论文
共 50 条
  • [1] On the contribution of the voice texture for speech spoofing detection
    Rahmeni, Raoudha
    Ben Aicha, Anis
    Ben Ayed, Yassine
    2019 19TH INTERNATIONAL CONFERENCE ON SCIENCES AND TECHNIQUES OF AUTOMATIC CONTROL AND COMPUTER ENGINEERING (STA), 2019, : 501 - 505
  • [2] Speech Formants Integration for Generalized Detection of Synthetic Speech Spoofing Attacks
    Liu, Kexu
    Wang, Yuanxin
    Lie, Shengchen
    Shao, Xi
    INTERSPEECH 2024, 2024, : 2100 - 2104
  • [3] Novel Speech Features for Improved Detection of Spoofing Attacks
    Paul, Dipjyoti
    Pal, Monisankha
    Saha, Goutam
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [4] DETECTION AND EVALUATION OF HUMAN AND MACHINE GENERATED SPEECH IN SPOOFING ATTACKS ON AUTOMATIC SPEAKER VERIFICATION SYSTEMS
    Gao, Yang
    Lian, Jiachen
    Raj, Bhiksha
    Singh, Rita
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 544 - 551
  • [5] Detection of Vowels in Speech Signals Degraded by Speech-Like Noise
    Kumar, Avinash
    Garnaik, Sarmila
    Yadav, Ishwar Chandra
    Pradhan, Gayadhar
    Shahnawazuddin, Syed
    2019 25TH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2019,
  • [6] Detection of Voice Conversion Spoofing Attacks Using Voiced Speech
    Pillai, Arun Sankar Muttathu Sivasankara
    De Leon, Phillip L.
    Roedig, Utz
    SECURE IT SYSTEMS, NORDSEC 2022, 2022, 13700 : 159 - 175
  • [7] Spoofing Speech Detection using Temporal Convolutional Neural Network
    Tian, Xiaohai
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [8] Spoofing Speech Detection Using Modified Relative Phase Information
    Wang, Longbiao
    Nakagawa, Seiichi
    Zhang, Zhaofeng
    Yoshida, Yohei
    Kawakami, Yuta
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (04) : 660 - 670
  • [9] ON IMPROVEMENT OF PERFORMANCE OF ISOLATED WORD RECOGNITION FOR DEGRADED SPEECH
    YEGNANARAYANA, B
    CHANDRAN, S
    AGARWAL, A
    SIGNAL PROCESSING, 1984, 7 (02) : 175 - 183
  • [10] Spoofing detection goes noisy: An analysis of synthetic speech detection in the presence of additive noise
    Hanilci, Cemal
    Kinnunen, Tomi
    Sahidullah, Md
    Sizov, Aleksandr
    SPEECH COMMUNICATION, 2016, 85 : 83 - 97