Significance of Phase-based Features for Person Recognition Using Humming

被引:0
|
作者
Sailor, Hardik B. [1 ]
Madhavi, Maulik C. [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Gandhinagar, Gujarat, India
关键词
Humming; person recognition; Modified Group Delay Function (MODGDF); polynomial classifier;
D O I
10.1145/2708463.2709035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents use of hum of a person as a biometric cue for person recognition task. Mel Frequency Cepstral Coefficients (MFCC) is found to be state-of-the-art in the voice biometrics. However, it is magnitude-based features and ignores the phase information. This paper shows the effectiveness of phase-based information extracted via Modified Group Delay Function (MODGDF). The features developed by Mel filtering of MODGDF spectrum are called Modified Group Delay Cepstral Coefficients (MGDCC). The paper demonstrates two types of fusion strategies, viz., score-level and feature-level. The experimental results show that overall performance is improved by 3 % if a score-level fusion is employed between MFCC and MGDCC and 19.78 % by feature-level fusion in terms of % Equal Error Rate (EER). These experimental results clearly indicate that incorporating phase information along with magnitude-based features can effectively captures person-specific characteristics in humming.
引用
收藏
页码:99 / 103
页数:5
相关论文
共 50 条
  • [41] STATISTICAL NORMALISATION OF PHASE-BASED FEATURE REPRESENTATION FOR ROBUST SPEECH RECOGNITION
    Loweimi, Erfan
    Barker, Jon
    Hain, Thomas
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5310 - 5314
  • [42] Phase-based features for Motor Imagery Brain-Computer Interfaces
    Hamner, Benjamin
    Leeb, Robert
    Tavella, Michele
    Millan, Jose del R.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 2578 - 2581
  • [43] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
    Arash Mansouri
    Eduardo Castillo-Guerra
    SN Applied Sciences, 2019, 1
  • [44] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
    Mansouri, Arash
    Castillo-Guerra, Eduardo
    SN APPLIED SCIENCES, 2019, 1 (04):
  • [45] Conditional Gabor phase-based disparity estimation applied to facial tracking for person-specific facial action recognition: a preliminary study
    Dahmane, Mohamed
    Cossette, Sylvie
    Meunier, Jean
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (17) : 7111 - 7130
  • [46] Analysis of EEG Fluctuation Patterns Using Nonlinear Phase-Based Functional Connectivity Measures for Emotion Recognition
    Kumar, Himanshu
    Ganapathy, Nagarajan
    Puthankattil, Subha D.
    Swaminathan, Ramakrishnan
    FLUCTUATION AND NOISE LETTERS, 2024, 23 (05):
  • [47] Phase-Based Neuron Training using Evolutionary Algorithms
    Stanescu, Sorin Laurentiu
    Otanocha, Omonigho Benedict
    2014 INTERNATIONAL SYMPOSIUM ON FUNDAMENTALS OF ELECTRICAL ENGINEERING (ISFEE), 2014,
  • [48] Enhancing Bug Localization Using Phase-Based Approach
    Mohsen, Amr Mansour
    Hassan, Hesham A.
    Wassif, Khaled T.
    Moawad, Ramadan
    Makady, Soha H.
    IEEE ACCESS, 2023, 11 : 35901 - 35913
  • [49] Significance of features in object recognition using depth sensors
    Harasymowicz-Boggio, Bogdan
    Chechlinski, Lukasz
    Siemiatkowska, Barbara
    OPTICA APPLICATA, 2015, 45 (04) : 559 - 571
  • [50] Key Frame Extraction Based on Multi-scale Phase-based Local Features
    Lin Honghua
    Yang Xuan
    Pei Jihong
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 1031 - +