Classification of phonation types in singing voice using wavelet scattering network-based features

被引:0
|
作者
Mittapalle, Kiran Reddy [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, FI-00076 Espoo, Finland
来源
JASA EXPRESS LETTERS | 2024年 / 4卷 / 06期
基金
芬兰科学院;
关键词
QUALITY; MODES; EXCITATION; PERCEPTION; AMPLITUDES; QUOTIENT;
D O I
10.1121/10.0026241
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic classification of phonation types in singing voice is essential for tasks such as identification of singing style. In this study, it is proposed to use wavelet scattering network (WSN)-based features for classification of phonation types in singing voice. WSN, which has a close similarity with auditory physiological models, generates acoustic features that greatly characterize the information related to pitch, formants, and timbre. Hence, the WSN-based features can effectively capture the discriminative information across phonation types in singing voice. The experimental results show that the proposed WSN-based features improved phonation classification accuracy by at least 9% compared to state-of-the-art features. (c) C2024Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (https://creative-commons.org/licenses/by/4.0/)
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Voice Disorders Identification Using Discrete Wavelet Based Features
    Eskidere, Omer
    Aktas, Omer
    Unal, Cevat
    2015 MEDICAL TECHNOLOGIES NATIONAL CONFERENCE (TIPTEKNO), 2015,
  • [22] ICA-FX features for classification of singing voice and instrumental sound
    Leung, TW
    Ngo, CW
    Lau, RWH
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 367 - 370
  • [23] An MDL-Based Wavelet Scattering Features Selection for Signal Classification
    Bruni, Vittoria
    Cardinali, Maria Lucia
    Vitulano, Domenico
    AXIOMS, 2022, 11 (08)
  • [24] Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification
    Florian Eyben
    Gláucia L Salomão
    Johan Sundberg
    Klaus R Scherer
    Björn W Schuller
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [25] Dysphonic voice classification using wavelet packet transform and artificial neural network
    Schuck, A
    Guimaraes, LV
    Wisbeck, JO
    PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 2958 - 2961
  • [26] Mel-frequency cepstral coefficients derived using the zero-time windowing spectrum for classification of phonation types in singing
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 146 (05): : EL418 - EL423
  • [27] An automatic neural network-based cloud detection/classification scheme using multispectral and textural features
    Shaikh, MA
    Tian, B
    AzimiSadjadi, MR
    Eis, KE
    VonderHaar, TH
    ALGORITHMS FOR MULTISPECTRAL AND HYPERSPECTRAL IMAGERY II, 1996, 2758 : 51 - 61
  • [28] Cardiac sarcoidosis classification with deep convolutional neural network-based features using polar maps
    Togo, Ren
    Hirata, Kenji
    Manabe, Osamu
    Ohira, Hiroshi
    Tsujino, Ichizo
    Magota, Keiichi
    Ogawa, Takahiro
    Haseyama, Miki
    Shiga, Tohru
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 104 : 81 - 86
  • [29] Convolutional Neural Network for Medical Image Classification using Wavelet Features
    Khatami, Amin
    Nazari, Asef
    Beheshti, Amin
    Thanh Thi Nguyen
    Nahavandi, Saeid
    Ziebal, Jerzy
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [30] Classification Framework of the Bearing Faults of an Induction Motor Using Wavelet Scattering Transform-Based Features
    Toma, Rafia Nishat
    Gao, Yangde
    Piltan, Farzin
    Im, Kichang
    Shon, Dongkoo
    Yoon, Tae Hyun
    Yoo, Dae-Seung
    Kim, Jong-Myon
    SENSORS, 2022, 22 (22)