Timbre-Based Portable Musical Instrument Recognition Using LVQ Learning Algorithm

被引：1

作者：

Sun, Yizhen ^{[1
]}

机构：

[1] Zhengzhou SIAS Univ, Sch Mus & Drama, Zhengzhou, Peoples R China

来源：

MOBILE NETWORKS & APPLICATIONS | 2023年 / 28卷 / 06期

关键词：

Portable musical instrument recognition; Feature extraction; LVQ neural network learning algorithm; Sensor; Internet of Things;

D O I：

10.1007/s11036-023-02174-y

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the advent of deep learning algorithms, the field of portable musical instrument recognition, i.e., musical recognition using mobile devices, has experienced substantial progress. Manual labeling, which is time-consuming, labor-intensive, and error-prone, has historically been used to classify instruments. Recent research, however, has concentrated on automating the classification process through the extraction of music properties. Nonetheless, due to the complicated interplay between the fundamental wave and harmonics in music, identifying important audio information remains difficult. This article describes the underlying ideas and implementation approach of portable musical instrument identification based on acoustic characteristics in detail. This paper proposes utilizing the Learning Vector Quantization (LVQ) neural network learning technique to extract acoustic components from music sources using the Short-Time Fourier Transform (STFT). In addition, this paper uses a feature selection strategy to pick the most informative features, lowering the dimensionality of the classifier's feature vector and improving training and recognition efficiency. The weighted recognition accuracy is 79.8% when all characteristics are picked, according to the experimental results. However, by decreasing the number of feature dimensions to 24, the system obtains its greatest weighted recognition rate of 81.2%, outperforming the performance with all features enabled by 1.3%. This illustrates how feature dimensionality reduction may increase recognition performance. However, decreasing the feature dimensions beyond 24 resulted in worse recognition accuracy, demonstrating the existence of an ideal feature dimensionality for each portable musical instrument category. A feature vector with 24 dimensions produces the greatest results for piano recognition, whereas a vector with 20 dimensions offers the maximum accuracy for cello recognition. These findings highlight the significance of feature selection in obtaining high accuracy rates for certain instrument types.

引用

页码：2171 / 2181

页数：11

共 50 条

[1] Fast recognition of musical sounds based on timbre
Agus, Trevor R.
Suied, Clara
Thorpe, Simon J.
Pressnitzer, Daniel
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05): : 4124 - 4133
[2] Recognition of Musical Instrument Using Deep Learning Techniques
Rajesh, Sangeetha
Nalini, N. J.
INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2021, 11 (04) : 41 - 60
[3] Learning metrics on spectrotemporal modulations reveals the perception of musical instrument timbre
Thoret, Etienne
Caramiaux, Baptiste
Depalle, Philippe
McAdams, Stephen
NATURE HUMAN BEHAVIOUR, 2021, 5 (03) : 369 - +
[4] Learning metrics on spectrotemporal modulations reveals the perception of musical instrument timbre
Etienne Thoret
Baptiste Caramiaux
Philippe Depalle
Stephen McAdams
Nature Human Behaviour, 2021, 5 : 369 - 377
[5] PITCH-TIMBRE DISENTANGLEMENT OF MUSICAL INSTRUMENT SOUNDS BASED ON VAE-BASED METRIC LEARNING
Tanaka, Keitaro
Nishikimi, Ryo
Bando, Yoshiaki
Yoshii, Kazuyoshi
Morishima, Shigeo
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 111 - 115
[6] Timbre Model of Software Musical Instrument Based on Sine Interpolation
Cao, Xi-zheng
Meng, Hui-li
Xu, Jui-cheng
PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2009, : 358 - 361
[7] Fractional Fourier Transform Based Features for Musical Instrument Recognition Using Machine Learning Techniques
Bhalke, D. G.
Rao, C. B. Rama
Bormane, D. S.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 155 - 163
[8] Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders
Nakashima, Futa
Nakamura, Tomohiko
Takamune, Norihiro
Fukayama, Satoru
Saruwatari, Hiroshi
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 735 - 742
[9] A Flame Recognition Algorithm Based on LVQ Neural Network
Zeng, Shaojun
Chen, Yanping
Xu, Mengting
Chen, Zilu
Yin, Hao
ADVANCED OPTICAL IMAGING TECHNOLOGIES, 2018, 10816
[10] SEMI-SUPERVISED LEARNING FOR MUSICAL INSTRUMENT RECOGNITION
Diment, Aleksandr
Heittola, Toni
Virtanen, Tuomas
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,

← 1 2 3 4 5 →