Gender Recognition Based on the Stacking of Different Acoustic Features

被引:1
|
作者
Yuecesoy, Erguen [1 ]
机构
[1] Ordu Univ, Vocat Sch Tech Sci, TR-52200 Ordu, Turkiye
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
gender recognition; hybrid features; MFCC; KNN; LDA; CNN; MLP; machine learning; deep learning;
D O I
10.3390/app14156564
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A speech signal can provide various information about a speaker, such as their gender, age, accent, and emotional state. The gender of the speaker is the most salient piece of information contained in the speech signal and is directly or indirectly used in many applications. In this study, a new approach is proposed for recognizing the gender of the speaker based on the use of hybrid features created by stacking different types of features. For this purpose, four different features, namely Mel frequency cepstral coefficients (MFCC), Mel scaled power spectrogram (Mel Spectrogram), Chroma, Spectral contrast (Contrast), and Tonal Centroid (Tonnetz), and twelve hybrid features created by stacking these features were used. These features were applied to four different classifiers, two of which were based on traditional machine learning (KNN and LDA) while two were based on the deep learning approach (CNN and MLP), and the performance of each was evaluated separately. In the experiments conducted on the Turkish subset of the Common Voice dataset, it was observed that hybrid features, created by stacking different acoustic features, led to improvements in gender recognition accuracy ranging from 0.3 to 1.73%.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Voice Gender Recognition Using Acoustic Features, MFCCs and SVM
    Abakarim, Fadwa
    Abenaou, Abdenbi
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022, PT I, 2022, 13375 : 634 - 648
  • [2] Gender Recognition Based on Face Geometric Features
    Xiao, Jie
    Guo, Zhaoli
    Cai, Chao
    MIPPR 2013: PATTERN RECOGNITION AND COMPUTER VISION, 2013, 8919
  • [3] Gradient-Based Acoustic Features for Speech Recognition
    Muroi, Takashi
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2009), 2009, : 445 - 448
  • [4] DNN-BASED EMOTION RECOGNITION BASED ON BOTTLENECK ACOUSTIC FEATURES AND LEXICAL FEATURES
    Kim, Eesung
    Shin, Jong Won
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6720 - 6724
  • [5] Performance Evaluation of Bangla Word Recognition Using Different Acoustic Features
    Lisa, Nusrat Jahan
    Eity, Qamrun Nahar
    Muhammad, Ghulam
    Huda, Mohammad Nurul
    Rahman, Chowdhury Mofizur
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (09): : 96 - 100
  • [6] Crack Pattern Recognition Based on Acoustic Emission Waveform Features
    Jingjing Dai
    Jianfeng Liu
    Lulin Zhou
    Xin He
    Rock Mechanics and Rock Engineering, 2023, 56 : 1063 - 1076
  • [7] Crack Pattern Recognition Based on Acoustic Emission Waveform Features
    Dai, Jingjing
    Liu, Jianfeng
    Zhou, Lulin
    He, Xin
    ROCK MECHANICS AND ROCK ENGINEERING, 2023, 56 (02) : 1063 - 1076
  • [8] Speech recognition based on a combination of acoustic features with articulatory information
    LU Xugang DANG Jianwu (Japan Advanced Institute of Science and Technology
    ChineseJournalofAcoustics, 2005, (03) : 271 - 279
  • [9] Excavation Equipment Recognition Based on Novel Acoustic Statistical Features
    Cao, Jiuwen
    Wang, Wei
    Wang, Jianzhong
    Wang, Ruirong
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (12) : 4392 - 4404
  • [10] Facial Features for Gender Recognition
    Liao Guangjun
    Chen Wei
    Wu Yaoxin
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4161 - 4165