Timbre-Based Portable Musical Instrument Recognition Using LVQ Learning Algorithm

被引:1
|
作者
Sun, Yizhen [1 ]
机构
[1] Zhengzhou SIAS Univ, Sch Mus & Drama, Zhengzhou, Peoples R China
来源
MOBILE NETWORKS & APPLICATIONS | 2023年 / 28卷 / 06期
关键词
Portable musical instrument recognition; Feature extraction; LVQ neural network learning algorithm; Sensor; Internet of Things;
D O I
10.1007/s11036-023-02174-y
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the advent of deep learning algorithms, the field of portable musical instrument recognition, i.e., musical recognition using mobile devices, has experienced substantial progress. Manual labeling, which is time-consuming, labor-intensive, and error-prone, has historically been used to classify instruments. Recent research, however, has concentrated on automating the classification process through the extraction of music properties. Nonetheless, due to the complicated interplay between the fundamental wave and harmonics in music, identifying important audio information remains difficult. This article describes the underlying ideas and implementation approach of portable musical instrument identification based on acoustic characteristics in detail. This paper proposes utilizing the Learning Vector Quantization (LVQ) neural network learning technique to extract acoustic components from music sources using the Short-Time Fourier Transform (STFT). In addition, this paper uses a feature selection strategy to pick the most informative features, lowering the dimensionality of the classifier's feature vector and improving training and recognition efficiency. The weighted recognition accuracy is 79.8% when all characteristics are picked, according to the experimental results. However, by decreasing the number of feature dimensions to 24, the system obtains its greatest weighted recognition rate of 81.2%, outperforming the performance with all features enabled by 1.3%. This illustrates how feature dimensionality reduction may increase recognition performance. However, decreasing the feature dimensions beyond 24 resulted in worse recognition accuracy, demonstrating the existence of an ideal feature dimensionality for each portable musical instrument category. A feature vector with 24 dimensions produces the greatest results for piano recognition, whereas a vector with 20 dimensions offers the maximum accuracy for cello recognition. These findings highlight the significance of feature selection in obtaining high accuracy rates for certain instrument types.
引用
收藏
页码:2171 / 2181
页数:11
相关论文
共 50 条
  • [1] Fast recognition of musical sounds based on timbre
    Agus, Trevor R.
    Suied, Clara
    Thorpe, Simon J.
    Pressnitzer, Daniel
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : 4124 - 4133
  • [2] Recognition of Musical Instrument Using Deep Learning Techniques
    Rajesh, Sangeetha
    Nalini, N. J.
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (04) : 41 - 60
  • [3] Learning metrics on spectrotemporal modulations reveals the perception of musical instrument timbre
    Thoret, Etienne
    Caramiaux, Baptiste
    Depalle, Philippe
    McAdams, Stephen
    NATURE HUMAN BEHAVIOUR, 2021, 5 (03) : 369 - +
  • [4] Learning metrics on spectrotemporal modulations reveals the perception of musical instrument timbre
    Etienne Thoret
    Baptiste Caramiaux
    Philippe Depalle
    Stephen McAdams
    Nature Human Behaviour, 2021, 5 : 369 - 377
  • [5] PITCH-TIMBRE DISENTANGLEMENT OF MUSICAL INSTRUMENT SOUNDS BASED ON VAE-BASED METRIC LEARNING
    Tanaka, Keitaro
    Nishikimi, Ryo
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    Morishima, Shigeo
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 111 - 115
  • [6] Timbre Model of Software Musical Instrument Based on Sine Interpolation
    Cao, Xi-zheng
    Meng, Hui-li
    Xu, Jui-cheng
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 358 - 361
  • [7] Fractional Fourier Transform Based Features for Musical Instrument Recognition Using Machine Learning Techniques
    Bhalke, D. G.
    Rao, C. B. Rama
    Bormane, D. S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 155 - 163
  • [8] Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
    Nakashima, Futa
    Nakamura, Tomohiko
    Takamune, Norihiro
    Fukayama, Satoru
    Saruwatari, Hiroshi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 735 - 742
  • [9] A Flame Recognition Algorithm Based on LVQ Neural Network
    Zeng, Shaojun
    Chen, Yanping
    Xu, Mengting
    Chen, Zilu
    Yin, Hao
    ADVANCED OPTICAL IMAGING TECHNOLOGIES, 2018, 10816
  • [10] SEMI-SUPERVISED LEARNING FOR MUSICAL INSTRUMENT RECOGNITION
    Diment, Aleksandr
    Heittola, Toni
    Virtanen, Tuomas
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,