Gender Recognition Based on the Stacking of Different Acoustic Features

被引：1

作者：

Yuecesoy, Erguen ^{[1
]}

机构：

[1] Ordu Univ, Vocat Sch Tech Sci, TR-52200 Ordu, Turkiye

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期

关键词：

gender recognition; hybrid features; MFCC; KNN; LDA; CNN; MLP; machine learning; deep learning;

D O I：

10.3390/app14156564

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

A speech signal can provide various information about a speaker, such as their gender, age, accent, and emotional state. The gender of the speaker is the most salient piece of information contained in the speech signal and is directly or indirectly used in many applications. In this study, a new approach is proposed for recognizing the gender of the speaker based on the use of hybrid features created by stacking different types of features. For this purpose, four different features, namely Mel frequency cepstral coefficients (MFCC), Mel scaled power spectrogram (Mel Spectrogram), Chroma, Spectral contrast (Contrast), and Tonal Centroid (Tonnetz), and twelve hybrid features created by stacking these features were used. These features were applied to four different classifiers, two of which were based on traditional machine learning (KNN and LDA) while two were based on the deep learning approach (CNN and MLP), and the performance of each was evaluated separately. In the experiments conducted on the Turkish subset of the Common Voice dataset, it was observed that hybrid features, created by stacking different acoustic features, led to improvements in gender recognition accuracy ranging from 0.3 to 1.73%.

引用

页数：13

共 50 条

[1] Voice Gender Recognition Using Acoustic Features, MFCCs and SVM
Abakarim, Fadwa
Abenaou, Abdenbi
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022, PT I, 2022, 13375 : 634 - 648
[2] Gender Recognition Based on Face Geometric Features
Xiao, Jie
Guo, Zhaoli
Cai, Chao
MIPPR 2013: PATTERN RECOGNITION AND COMPUTER VISION, 2013, 8919
[3] Gradient-Based Acoustic Features for Speech Recognition
Muroi, Takashi
Takashima, Ryoichi
Takiguchi, Tetsuya
Ariki, Yasuo
2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2009), 2009, : 445 - 448
[4] DNN-BASED EMOTION RECOGNITION BASED ON BOTTLENECK ACOUSTIC FEATURES AND LEXICAL FEATURES
Kim, Eesung
Shin, Jong Won
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6720 - 6724
[5] Performance Evaluation of Bangla Word Recognition Using Different Acoustic Features
Lisa, Nusrat Jahan
Eity, Qamrun Nahar
Muhammad, Ghulam
Huda, Mohammad Nurul
Rahman, Chowdhury Mofizur
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (09): : 96 - 100
[6] Crack Pattern Recognition Based on Acoustic Emission Waveform Features
Jingjing Dai
Jianfeng Liu
Lulin Zhou
Xin He
Rock Mechanics and Rock Engineering, 2023, 56 : 1063 - 1076
[7] Crack Pattern Recognition Based on Acoustic Emission Waveform Features
Dai, Jingjing
Liu, Jianfeng
Zhou, Lulin
He, Xin
ROCK MECHANICS AND ROCK ENGINEERING, 2023, 56 (02) : 1063 - 1076
[8] Speech recognition based on a combination of acoustic features with articulatory information
LU Xugang DANG Jianwu (Japan Advanced Institute of Science and Technology
ChineseJournalofAcoustics, 2005, (03) : 271 - 279
[9] Excavation Equipment Recognition Based on Novel Acoustic Statistical Features
Cao, Jiuwen
Wang, Wei
Wang, Jianzhong
Wang, Ruirong
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (12) : 4392 - 4404
[10] Facial Features for Gender Recognition
Liao Guangjun
Chen Wei
Wu Yaoxin
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 4161 - 4165

← 1 2 3 4 5 →