Gender Recognition Based on the Stacking of Different Acoustic Features

被引:1
|
作者
Yuecesoy, Erguen [1 ]
机构
[1] Ordu Univ, Vocat Sch Tech Sci, TR-52200 Ordu, Turkiye
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期
关键词
gender recognition; hybrid features; MFCC; KNN; LDA; CNN; MLP; machine learning; deep learning;
D O I
10.3390/app14156564
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A speech signal can provide various information about a speaker, such as their gender, age, accent, and emotional state. The gender of the speaker is the most salient piece of information contained in the speech signal and is directly or indirectly used in many applications. In this study, a new approach is proposed for recognizing the gender of the speaker based on the use of hybrid features created by stacking different types of features. For this purpose, four different features, namely Mel frequency cepstral coefficients (MFCC), Mel scaled power spectrogram (Mel Spectrogram), Chroma, Spectral contrast (Contrast), and Tonal Centroid (Tonnetz), and twelve hybrid features created by stacking these features were used. These features were applied to four different classifiers, two of which were based on traditional machine learning (KNN and LDA) while two were based on the deep learning approach (CNN and MLP), and the performance of each was evaluated separately. In the experiments conducted on the Turkish subset of the Common Voice dataset, it was observed that hybrid features, created by stacking different acoustic features, led to improvements in gender recognition accuracy ranging from 0.3 to 1.73%.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Interpretable features for underwater acoustic target recognition
    Jiang, Junjun
    Wu, Zhenning
    Lu, Junan
    Huang, Min
    Xiao, Zhongzhe
    MEASUREMENT, 2021, 173 (173)
  • [42] Novel acoustic features for speech emotion recognition
    Yong-Wan Roh
    Dong-Ju Kim
    Woo-Seok Lee
    Kwang-Seok Hong
    Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
  • [43] LATE INTEGRATION OF FEATURES FOR ACOUSTIC EMOTION RECOGNITION
    Cullen, Ailbhe
    Harte, Naomi
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [44] Novel acoustic features for speech emotion recognition
    Roh Yong-Wan
    Kim Dong-Ju
    Lee Woo-Seok
    Hong Kwang-Seok
    SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 2009, 52 (07): : 1838 - 1848
  • [45] AGE AND GENDER RECOGNITION USING EAR FEATURES
    Shahid, Ayesha
    Haider, Khurram Zeeshan
    Awais, Muhammad
    Mohi-yu-din, Burhan
    Kousar, Naila
    Nawaz, Ismat
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2019, 11 (01): : 33 - 40
  • [46] Features combination for gender recognition on Twitter users
    Fernandez, Daniela
    Moctezuma, Daniela
    Siordia, Oscar S.
    2016 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2016,
  • [47] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
    Chen, SH
    Wang, HC
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
  • [48] ACOUSTIC FEATURES AND ACOUSTIC CHANGE ARE REPRESENTED BY DIFFERENT CENTRAL PATHWAYS
    KING, C
    MCGEE, T
    RUBEL, EW
    NICOL, T
    KRAUS, N
    HEARING RESEARCH, 1995, 85 (1-2) : 45 - 52
  • [49] Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks
    Ju Lin
    Wei Li
    Yingming Gao
    Yanlu Xie
    Nancy F. Chen
    Sabato Marco Siniscalchi
    Jinsong Zhang
    Chin-Hui Lee
    Journal of Signal Processing Systems, 2018, 90 : 1077 - 1087
  • [50] Real-Time Underwater Acoustic Homing Weapon Target Recognition Based on a Stacking Technique of Ensemble Learning
    Deng, Jianjing
    Yang, Xiangfeng
    Liu, Liwen
    Shi, Lei
    Li, Yongsheng
    Yang, Yunchuan
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)