Spectral entropy and spectral shape based pre-quantization for real time speaker identification system

被引:2
|
作者
Sarkar G. [1 ]
Saha G. [1 ]
机构
[1] Department of Electronics and Electrical Communication Engineering, IIT Kharagpur
关键词
Kurtosis; Pre-quantization; Speaker identification; Spectral entropy;
D O I
10.1007/s10772-010-9079-8
中图分类号
学科分类号
摘要
Pre-processing is one of the vital steps for developing robust and efficient recognition system. Better preprocessing not only aid in better data selection but also in significant reduction of computational complexity. Further an efficient frame selection technique can improve the overall performance of the system. Pre-quantization (PQ) is the technique of selecting less number of frames in the pre-processing stage to reduce the computational burden in the post processing stages of speaker identification (SI). In this paper, we develop PQ techniques based on spectral entropy and spectral shape to pick suitable frames containing speaker specific information that varies from frame to frame depending on spoken text and environmental conditions. The attempt is to exploit the statistical properties of distributions of speech frames at the pre-processing stage of speaker recognition. Our aim is not only to reduce the frame rate but also to maintain identification accuracy reasonably high. Further we have also analyzed the robustness of our proposed techniques on noisy utterances. To establish the efficacy of our proposed methods, we used two different databases, POLYCOST (telephone speech) and YOHO (microphone speech). © Springer Science+Business Media, LLC 2010.
引用
收藏
页码:189 / 199
页数:10
相关论文
共 50 条
  • [31] Time-varying Spectral Entropy Based Analysis of Impulse Noises
    Singh, Neelima
    Lall, Brejesh
    2019 IEEE 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2019, : 1534 - 1539
  • [32] Robust speaker identification system based on multilayer eigen-codebook vector quantization
    Hsieh, CT
    Lai, E
    Chen, WC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1185 - 1193
  • [33] Robust speaker identification system based on multilayer eigen-codebook vector quantization
    Hsieh, Ching-Tang
    Lai, Eugene
    Chen, Wan-Chen
    IEICE Transactions on Information and Systems, 2004, E87-D (05) : 1185 - 1193
  • [34] A real time spectral subtraction based speech enhancement scheme
    Flogeras, D
    Doraiswami, R
    Kaye, ME
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1071 - 1074
  • [35] Design and Implementation of a Real-Time Speaker Identification System with Improved GMM
    Jiang, Ye
    Tang, Zhen-min
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 603 - 607
  • [36] A Novel Approach To Enhance The Efficiency Of Real-Time Speaker Identification System
    Priyadarshini, Subhashree
    Sarangi, Susanta Kumar
    Bhuyan, K. C.
    2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 52 - 54
  • [37] Improved Text-independent Speaker Identification System For Real Time Applications
    AboElenein, Nagwa M.
    Amin, Khalid M.
    Ibrahim, Mina.
    Hadhoud, Mohiy M.
    2016 FOURTH INTERNATIONAL JAPAN-EGYPT CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND COMPUTERS (JEC-ECC), 2016, : 58 - 62
  • [38] Rolling Bearing Degradation State Identification Based on LCD Relative Spectral Entropy
    Yu H.
    Li H.
    Xu B.
    Journal of Failure Analysis and Prevention, 2016, 16 (4) : 655 - 666
  • [39] Real-time cable force identification based on block recursive Capon spectral estimation method
    Yu, Xuewen
    Dan, Danhui
    MEASUREMENT, 2023, 213
  • [40] Spectral calibration of the real-time data gathering and spectrum rebuilding system based on FPGA
    Zhang Ning
    Zhang Lijun
    Liu Xiaohua
    Cao Weiliang
    2011 INTERNATIONAL CONFERENCE ON OPTICAL INSTRUMENTS AND TECHNOLOGY: OPTOELECTRONIC IMAGING AND PROCESSING TECHNOLOGY, 2011, 8200