Effectiveness of Speech Demodulation-Based Features for Replay Detection

被引:40
|
作者
Kamble, Madhu R. [1 ]
Tak, Hemlata [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Speech Res Lab, Gandhinagar, Gujarat, India
关键词
Spoofing; Hilbert transform; Teager energy operator; energy separation algorithm; AUTOMATIC SPEAKER VERIFICATION; ENERGY SEPARATION; COUNTERMEASURES; FREQUENCY;
D O I
10.21437/Interspeech.2018-1675
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Replay attack presents a great threat to Automatic Speaker Verification (ASV) system. The speech can be modeled as amplitude and frequency modulated (AM-FM) signals. In this paper, we explore speech demodulation-based features using Hilbert transform (HT) and Teager Energy Operator (TEO) for replay detection. In particular, we propose features, namely, FIT-based Instantaneous Amplitude (IA) and Instantaneous Frequency (IF) Cosine Coefficients (i.e., HT-IACC and HT-IFCC) and Energy Separation Algorithm (ESA)-based features (i.e., ESA-IACC and ESA-IFCC). For adapting instantaneous energy w.r.t given sampling frequency, ESA requires 3 samples whereas FIT requires relatively large number of samples and thus, ESA gives high time resolution.The experiments were performed on ASV spoof 2017 Challenge database for replay spoof speech detection (SSD).The experimental results shows that ESA-based features gave lower EER. In addition, linearly spaced Gabor filterbank gave lower EER than Butterworth filterbank. To explore possible complementary information using amplitude and frequency, we have used score-level fusion of IA and IF. With HT-based feature set, the score-level fusion gave EER of 5.24 % (dev) and 10.03 % (eval), whereas ESA-based feature set reduced the EER to 2.01 % (dev) and 9.64 % (eval).
引用
收藏
页码:641 / 645
页数:5
相关论文
共 50 条
  • [21] A Robust Method for Speech Replay Attack Detection
    Lin, Lang
    Wang, Rangding
    Yan, Diqun
    Dong, Li
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (01): : 168 - 182
  • [22] Decision feedback demodulation-based adaptive linear equalisers for differentially coherent DPSK systems
    Jang, DW
    Oh, SK
    Lee, YH
    ELECTRONICS LETTERS, 1996, 32 (20) : 1851 - 1852
  • [23] Detection of replay spoof speech using global self-attentive Teager energy features
    Chen, Ming
    Chen, Xueqin
    Shengxue Xuebao/Acta Acustica, 2024, 49 (05): : 1122 - 1130
  • [24] Combining Phase-based Features for Replay Spoof Detection System
    Srinivas, Kantheti
    Das, Rohan Kumar
    Patil, Hemant A.
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 151 - 155
  • [25] Energy Separation Based Features for Replay Spoof Detection for Voice Assistant
    Prajapati, Gauri P.
    Kamble, Madhu R.
    Patil, Hemant A.
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 386 - 390
  • [26] Replay Speech Detection Based on Dual-Input Hierarchical Fusion Network
    Hu, Chenlei
    Zhou, Ruohua
    Yuan, Qingsheng
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [27] Multiscale cyclic frequency demodulation-based feature fusion framework for multi-sensor driven gearbox intelligent fault detection
    Guo, Junchao
    He, Qingbo
    Zhen, Dong
    Gu, Fengshou
    Ball, Andrew D.
    KNOWLEDGE-BASED SYSTEMS, 2024, 283
  • [28] VOICE QUALITY FEATURES FOR REPLAY ATTACK DETECTION
    Woubie, Abraham
    Backstrom, Tom
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 384 - 388
  • [29] Modulation Dynamic Features for the Detection of Replay Attacks
    Suthokumar, Gajan
    Sethu, Vidhyasaharan
    Wijenayake, Chamith
    Ambikairajah, Eliathamby
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 691 - 695
  • [30] Features and Classifiers for Replay Spoofing Attack Detection
    Hanilci, Cemal
    2017 10TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2017, : 1187 - 1191