Evaluating MFCC-based speaker identification systems with data envelopment analysis

被引:6
|
作者
Ozcan, Zubeyir [1 ]
Kayikcioglu, Temel [1 ]
机构
[1] Karadeniz Tech Univ, Fac Engn, Dept Elect & Elect Engn, Trabzon, Turkey
关键词
Speaker recognition evaluation; Multi-criteria decision making; Data envelopment analysis; Speaker identification; MFCC features; MCDM; EFFICIENCY; CLASSIFICATION; RECOGNITION; SELECTION; MACHINES; RANKING;
D O I
10.1016/j.eswa.2020.114448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The concept of the efficiency of speaker recognition systems varies in the literature. Although many authors have defined efficiency as recognition accuracy, others have defined it as low energy consumption, memory storage, or computational burden. In our study, for a novel approach, speaker recognition was evaluated following a multi-criteria decision-making approach in two stages. First, speaker identification based on Melfrequency cepstrum coefficients (MFCC) was conducted for various parameters and methods, including number of speakers, number of MFCCs, test speech duration, training utterance length and the various classifiers. Classification metrics, memory storage, testing, and training time of the trials were measured as well, and the performance of the trials was examined for each criterion. Verifying the literature, the study revealed that no parameters or methods achieved the best performance for all criteria. In the second stage, a multi-criteria efficiency analysis, as suggested in the literature, was conducted according to various application scenarios. By using data envelopment analysis, the efficiency of trials according to the scenarios was determined. After ranking the efficiency scores, it was revealed that the best solution was task-dependent. From the perspective of classifiers, artificial neural networks outperformed the others considering benefits to cost; however, some of their costs were high, whereas the other classifiers provided the best solutions in light of cost criteria. Last, the number of MFCCs was the least effective parameter for efficiency. Altogether, the findings indicate that the efficiency of a speaker identification system cannot be defined as recognition accuracy, memory storage, testing time or training time but as a function of those criteria.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An MFCC-based Speaker Identification System
    Leu, Fang-Yie
    Lin, Guan-Liang
    2017 IEEE 31ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2017, : 1055 - 1062
  • [2] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    Tsinghua Science and Technology, 2005, (02) : 158 - 161
  • [3] An MFCC-based text-independent speaker identification system for access control
    Liu, Jung-Chun
    Leu, Fang-Yie
    Lin, Guan-Liang
    Susanto, Heru
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (02):
  • [4] Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition
    Ehkan, P.
    Zakaria, F. F.
    Warip, M. N. M.
    Sauli, Z.
    Elshaikh, M.
    ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315 : 471 - 480
  • [5] Accuracy of MFCC-Based Speaker Recognition in Series 60 Device
    Juhani Saastamoinen
    Evgeny Karpov
    Ville Hautamäki
    Pasi Fränti
    EURASIP Journal on Advances in Signal Processing, 2005
  • [6] Accuracy of MFCC-based speaker recognition in Series 60 device
    Saastamoinen, J
    Karpov, E
    Hautamäki, V
    Fränti, P
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (17) : 2816 - 2827
  • [7] An MFCC-based Secure Framework for Voice Assistant Systems
    Ahmed, Syed Fahad
    Jaffari, Rabeea
    Ahmed, Syed Saad
    Jawaid, Moazzam
    Talpur, Shahnawaz
    2022 INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS), 2022, : 57 - 61
  • [8] MFCC and Similarity Measurements for Speaker Identification Systems
    Maazouzi, A.
    Aqili, N.
    Aamoud, A.
    Raji, M.
    Hammouch, A.
    PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT 2017), 2017,
  • [9] MFCC-based perceptual hashing for compressed domain of speech content identification
    Zhang, Qiu-Yu
    Liu, Yang-Wei
    Di, Yan-Jun
    Zhang, Qian-Yun
    Xing, Peng-Fei
    Journal of Chemical and Pharmaceutical Research, 2014, 6 (07) : 379 - 386
  • [10] Speaker identification based on combination of MFCC and UMRT based features
    Antony, Anett
    Gopikakumari, R.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 250 - 257