Effectiveness of Speech Demodulation-Based Features for Replay Detection

被引:40
|
作者
Kamble, Madhu R. [1 ]
Tak, Hemlata [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Speech Res Lab, Gandhinagar, Gujarat, India
关键词
Spoofing; Hilbert transform; Teager energy operator; energy separation algorithm; AUTOMATIC SPEAKER VERIFICATION; ENERGY SEPARATION; COUNTERMEASURES; FREQUENCY;
D O I
10.21437/Interspeech.2018-1675
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Replay attack presents a great threat to Automatic Speaker Verification (ASV) system. The speech can be modeled as amplitude and frequency modulated (AM-FM) signals. In this paper, we explore speech demodulation-based features using Hilbert transform (HT) and Teager Energy Operator (TEO) for replay detection. In particular, we propose features, namely, FIT-based Instantaneous Amplitude (IA) and Instantaneous Frequency (IF) Cosine Coefficients (i.e., HT-IACC and HT-IFCC) and Energy Separation Algorithm (ESA)-based features (i.e., ESA-IACC and ESA-IFCC). For adapting instantaneous energy w.r.t given sampling frequency, ESA requires 3 samples whereas FIT requires relatively large number of samples and thus, ESA gives high time resolution.The experiments were performed on ASV spoof 2017 Challenge database for replay spoof speech detection (SSD).The experimental results shows that ESA-based features gave lower EER. In addition, linearly spaced Gabor filterbank gave lower EER than Butterworth filterbank. To explore possible complementary information using amplitude and frequency, we have used score-level fusion of IA and IF. With HT-based feature set, the score-level fusion gave EER of 5.24 % (dev) and 10.03 % (eval), whereas ESA-based feature set reduced the EER to 2.01 % (dev) and 9.64 % (eval).
引用
收藏
页码:641 / 645
页数:5
相关论文
共 50 条
  • [1] Speech Demodulation-based Techniques for Replay and Presentation Attack Detection
    Kamble, Madhu R.
    Sai, Pulikonda Aditya Krishna
    Krishna, Maddala V. Siva
    Patil, Ankur T.
    Acharya, Rajul
    Patil, Hemant A.
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1545 - 1550
  • [2] Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection
    Kamble, Madhu R.
    Tak, Hemlata
    Krishna, Maddala V. Siva
    Patil, Hemant A.
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 334 - 338
  • [3] Amplitude and Frequency Modulation-based features for detection of replay Spoof Speech
    Kamble, Madhu R.
    Tak, Hemlata
    Patil, Hemant A.
    SPEECH COMMUNICATION, 2020, 125 : 114 - 127
  • [4] A signal demodulation-based method for the early detection of Cheyne-Stokes respiration
    Guyot, Pauline
    Djermoune, El-Hadi
    Chenuel, Bruno
    Bastogne, Thierry
    PLOS ONE, 2020, 15 (03):
  • [5] Device Features Based on Linear Transformation With Parallel Training Data for Replay Speech Detection
    Xu, Longting
    Yang, Jichen
    You, Chang Huai
    Qian, Xinyuan
    Huang, Daiyu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1574 - 1586
  • [6] Amplitude Demodulation-based EM Analysis of different RSA implementations
    Perin, Guilherme
    Torres, Lionel
    Benoit, Pascal
    Maurine, Philippe
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 1167 - 1172
  • [7] A dual-wavelength demodulation-based sensor for magnetic fields
    Zuo, Yan
    Li, Can
    Zhao, Yi
    Zhang, Yating
    Xia, Li
    SENSORS AND ACTUATORS A-PHYSICAL, 2024, 366
  • [8] An Analysis of Modified Demodulation-Based Grid Voltage Parameter Estimator
    Golestan, Saeed
    Guerrero, Josep M.
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2015, 30 (12) : 6528 - 6533
  • [9] Speech Replay Detection with x-Vector Attack Embeddings and Spectral Features
    Williams, Jennifer
    Rownicka, Joanna
    INTERSPEECH 2019, 2019, : 1053 - 1057
  • [10] A multi-branch ResNet with discriminative features for detection of replay speech signals
    Cheng, Xingliang
    Xu, Mingxing
    Zheng, Thomas Fang
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2020, 9