Classification of phonation types in singing voice using wavelet scattering network-based features

被引:0
|
作者
Mittapalle, Kiran Reddy [1 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Informat & Commun Engn, FI-00076 Espoo, Finland
来源
JASA EXPRESS LETTERS | 2024年 / 4卷 / 06期
基金
芬兰科学院;
关键词
QUALITY; MODES; EXCITATION; PERCEPTION; AMPLITUDES; QUOTIENT;
D O I
10.1121/10.0026241
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The automatic classification of phonation types in singing voice is essential for tasks such as identification of singing style. In this study, it is proposed to use wavelet scattering network (WSN)-based features for classification of phonation types in singing voice. WSN, which has a close similarity with auditory physiological models, generates acoustic features that greatly characterize the information related to pitch, formants, and timbre. Hence, the WSN-based features can effectively capture the discriminative information across phonation types in singing voice. The experimental results show that the proposed WSN-based features improved phonation classification accuracy by at least 9% compared to state-of-the-art features. (c) C2024Author(s). All article content, except where otherwise noted, is licensed under a Creative Commons Attribution (CC BY) license (https://creative-commons.org/licenses/by/4.0/)
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Neural network-based segmentation of textures using gabor features
    Ramakrishnan, AG
    Raja, SK
    Ram, HVR
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 365 - 374
  • [42] Network-Based Classification Using Cortical Thickness of AD Patients
    Dai, Dai
    He, Huiguang
    Vogelstein, Joshua
    Hou, Zengguang
    MACHINE LEARNING IN MEDICAL IMAGING, 2011, 7009 : 193 - +
  • [43] Network-based Classification of Authentication Attempts using Machine Learning
    Taylor, Curtis R.
    Lanson, Julian P.
    2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 669 - 673
  • [44] Identifying regions of non-modal phonation using features of the wavelet transform
    Kane, John
    Gobl, Christer
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 184 - 187
  • [45] Neural network-based leaf classification using machine learning
    Palanisamy, Tamilselvi
    Sadayan, Geetha
    Pathinetampadiyan, Nagasankar
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (08):
  • [46] Sparse network-based models for patient classification using fMRI
    Rosa, Maria J.
    Portugal, Liana
    Hahn, Tim
    Fallgatter, Andreas J.
    Garrido, Marta I.
    Shawe-Taylor, John
    Mourao-Miranda, Janaina
    NEUROIMAGE, 2015, 105 : 493 - 506
  • [47] Sparse network-based models for patient classification using fMRI
    Rosa, Maria J.
    Portugal, Liana
    Shawe-Taylor, John
    Mourao-Miranda, Janaina
    2013 3RD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION IN NEUROIMAGING (PRNI 2013), 2013, : 66 - 69
  • [48] Heartbeat Classification Using Convolution Neural Network and Wavelet Transform to Extract Features
    Qiu, Lishen
    Li, Wanyue
    Cai, Wenqiang
    Zhang, Miao
    Zhu, Wenliang
    Wang, Lirong
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 139 - 143
  • [49] Braking Noise Classification Based on Wavelet Scattering Deep Sequential Neural Network
    Jiang T.
    Jin C.
    Li T.
    Li Y.
    Tongji Daxue Xuebao/Journal of Tongji University, 2022, 50 : 26 - 31
  • [50] LFNN: Lion fuzzy neural network-based evolutionary model for text classification using context and sense based features
    Ranjan, Nihar M.
    Prasad, Rajesh S.
    APPLIED SOFT COMPUTING, 2018, 71 : 994 - 1008