Gender Recognition Based on the Stacking of Different Acoustic Features

被引：1

作者：

Yuecesoy, Erguen ^{[1
]}

机构：

[1] Ordu Univ, Vocat Sch Tech Sci, TR-52200 Ordu, Turkiye

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 15期

关键词：

gender recognition; hybrid features; MFCC; KNN; LDA; CNN; MLP; machine learning; deep learning;

D O I：

10.3390/app14156564

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

A speech signal can provide various information about a speaker, such as their gender, age, accent, and emotional state. The gender of the speaker is the most salient piece of information contained in the speech signal and is directly or indirectly used in many applications. In this study, a new approach is proposed for recognizing the gender of the speaker based on the use of hybrid features created by stacking different types of features. For this purpose, four different features, namely Mel frequency cepstral coefficients (MFCC), Mel scaled power spectrogram (Mel Spectrogram), Chroma, Spectral contrast (Contrast), and Tonal Centroid (Tonnetz), and twelve hybrid features created by stacking these features were used. These features were applied to four different classifiers, two of which were based on traditional machine learning (KNN and LDA) while two were based on the deep learning approach (CNN and MLP), and the performance of each was evaluated separately. In the experiments conducted on the Turkish subset of the Common Voice dataset, it was observed that hybrid features, created by stacking different acoustic features, led to improvements in gender recognition accuracy ranging from 0.3 to 1.73%.

引用

页数：13

共 50 条

[41] Interpretable features for underwater acoustic target recognition
Jiang, Junjun
Wu, Zhenning
Lu, Junan
Huang, Min
Xiao, Zhongzhe
MEASUREMENT, 2021, 173 (173)
[42] Novel acoustic features for speech emotion recognition
Yong-Wan Roh
Dong-Ju Kim
Woo-Seok Lee
Kwang-Seok Hong
Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
[43] LATE INTEGRATION OF FEATURES FOR ACOUSTIC EMOTION RECOGNITION
Cullen, Ailbhe
Harte, Naomi
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[44] Novel acoustic features for speech emotion recognition
Roh Yong-Wan
Kim Dong-Ju
Lee Woo-Seok
Hong Kwang-Seok
SCIENCE IN CHINA SERIES E-TECHNOLOGICAL SCIENCES, 2009, 52 (07): : 1838 - 1848
[45] AGE AND GENDER RECOGNITION USING EAR FEATURES
Shahid, Ayesha
Haider, Khurram Zeeshan
Awais, Muhammad
Mohi-yu-din, Burhan
Kousar, Naila
Nawaz, Ismat
INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2019, 11 (01): : 33 - 40
[46] Features combination for gender recognition on Twitter users
Fernandez, Daniela
Moctezuma, Daniela
Siordia, Oscar S.
2016 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2016,
[47] Improvement of speaker recognition by combining residual and prosodic features with acoustic features
Chen, SH
Wang, HC
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 93 - 96
[48] ACOUSTIC FEATURES AND ACOUSTIC CHANGE ARE REPRESENTED BY DIFFERENT CENTRAL PATHWAYS
KING, C
MCGEE, T
RUBEL, EW
NICOL, T
KRAUS, N
HEARING RESEARCH, 1995, 85 (1-2) : 45 - 52
[49] Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks
Ju Lin
Wei Li
Yingming Gao
Yanlu Xie
Nancy F. Chen
Sabato Marco Siniscalchi
Jinsong Zhang
Chin-Hui Lee
Journal of Signal Processing Systems, 2018, 90 : 1077 - 1087
[50] Real-Time Underwater Acoustic Homing Weapon Target Recognition Based on a Stacking Technique of Ensemble Learning
Deng, Jianjing
Yang, Xiangfeng
Liu, Liwen
Shi, Lei
Li, Yongsheng
Yang, Yunchuan
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (12)

← 1 2 3 4 5 →