Classification of phonation types in singing voice using wavelet scattering network-based features

被引:0
|
作者
Mittapalle, Kiran Reddy [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, FI-00076 Espoo, Finland
来源
JASA EXPRESS LETTERS | 2024年 / 4卷 / 06期
基金
芬兰科学院;
关键词
QUALITY; MODES; EXCITATION; PERCEPTION; AMPLITUDES; QUOTIENT;
D O I
10.1121/10.0026241
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic classification of phonation types in singing voice is essential for tasks such as identification of singing style. In this study, it is proposed to use wavelet scattering network (WSN)-based features for classification of phonation types in singing voice. WSN, which has a close similarity with auditory physiological models, generates acoustic features that greatly characterize the information related to pitch, formants, and timbre. Hence, the WSN-based features can effectively capture the discriminative information across phonation types in singing voice. The experimental results show that the proposed WSN-based features improved phonation classification accuracy by at least 9% compared to state-of-the-art features. (c) C2024Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (https://creative-commons.org/licenses/by/4.0/)
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Analysis and classification of phonation types in speech and singing voice
    Kadiri, Sudarsana Reddy
    Alku, Paavo
    Yegnanarayana, B.
    SPEECH COMMUNICATION, 2020, 118 : 33 - 47
  • [2] Automatic classification of neurological voice disorders using wavelet scattering features
    Yagnavajjula, Madhu Keerthana
    Mittapalle, Kiran Reddy
    Alku, Paavo
    Rao, K. Sreenivasa
    Mitra, Pabitra
    SPEECH COMMUNICATION, 2024, 157
  • [3] Classification of Phonation Modes in Classical Singing Using Modulation Power Spectral Features
    Brandner, Manuel
    Bereuter, Paul Armin
    Kadiri, Sudarsana Reddy
    Sontacchi, Alois
    IEEE ACCESS, 2023, 11 : 29149 - 29161
  • [4] Artificial neural network-based classification to screen for dysphonia using psychoacoustic scaling of acoustic voice features
    Linder, Roland
    Albers, Andreas E.
    Hess, Markus
    Poeppl, Siegfried J.
    Schoenweiler, Rainer
    JOURNAL OF VOICE, 2008, 22 (02) : 155 - 163
  • [5] Wavelet network-based detection and classification of transients
    Angrisani, L
    Daponte, P
    D'Apuzzo, M
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2001, 50 (05) : 1425 - 1435
  • [6] Sinsy: A Deep Neural Network-Based Singing Voice Synthesis System
    Hono, Yukiya
    Hashimoto, Kei
    Oura, Keiichiro
    Nankaku, Yoshihiko
    Tokuda, Keiichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2803 - 2815
  • [7] Wavelet network-based classification of transients using dominant frequency signature
    Chatterjee, S.
    Chakravorti, S.
    Roy, C. K.
    Dey, D.
    ELECTRIC POWER SYSTEMS RESEARCH, 2008, 78 (01) : 21 - 29
  • [8] Neural Network Based Terrain Classification Using Wavelet Features
    Gi-Yeul Sung
    Dong-Min Kwak
    Joon Lyou
    Journal of Intelligent & Robotic Systems, 2010, 59 : 269 - 281
  • [9] Neural Network Based Terrain Classification Using Wavelet Features
    Sung, Gi-Yeul
    Kwak, Dong-Min
    Lyou, Joon
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2010, 59 (3-4) : 269 - 281
  • [10] Automatic classification of phonation modes in singing voice: towards singing style characterisation and application to ethnomusicological recordings
    Rouasi, Jean-Luc
    Ioannidis, Leonidas
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 150 - 154