Study of Automatic Piano Transcription Algorithms based on the Polyphonic Properties of Piano Audio

被引:0
|
作者
Liang Y. [1 ]
Pan F. [2 ]
机构
[1] Department of Educational Sciences and Music, Luoyang Institute of Science and Technology, Henan, Luoyang
[2] Department of Sports Training, Guangzhou Sport University, Guangdong, Guangzhou
关键词
Automatic transcription; Convolutional neural network; Piano audio; Polyphonic characteristics;
D O I
10.5573/IEIESPC.2023.12.5.412
中图分类号
学科分类号
摘要
The polyphonic characteristics of piano audio make automatic transcription particularly challenging. This study briefly analyzed the polyphonic characteristics of piano audio and introduced three piano audio features: short-time Fourier transform (STFT), constant-Q transform (CQT), and variable-Q transform (VQT). An algorithm integrating a convolutional neural network (CNN) with a bidirectional gated recurrent unit (BiGRU) was developed and tested on the MAPS dataset to detect the note start and end points and fundamental tones of polyphone. The results showed that the combined algorithm performed better than STFT and CQT when VQT was used as input, and CNN-BiGRU outperformed CNN and CNN-GRU in terms of the P value, R-value, and F1-measure in the fundamental tone detection of 97.16%, 97.34%, and 97.25%, respectively. The experimental results of this paper confirmed that the designed automatic piano transcription algorithm is reliable and can be further adopted in the practical music field. Copyrights © 2023 The Institute of Electronics and Information Engineers.
引用
收藏
页码:412 / 418
页数:6
相关论文
共 50 条
  • [1] Automatic transcription of piano polyphonic music
    Kobzantsev, A
    Chazan, D
    Zeevi, Y
    ISPA 2005: Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005, : 414 - 418
  • [3] Multitask Learning for Polyphonic Piano Transcription, a Case Study
    Kelz, Rainer
    Boeck, Sebastian
    Widmer, Gerhard
    2019 INTERNATIONAL WORKSHOP ON MULTILAYER MUSIC REPRESENTATION AND PROCESSING (MMRP 2019), 2019, : 85 - 91
  • [4] Polyphonic piano transcription based on graph convolutional network
    Xiao, Zhe
    Chen, Xin
    Zhou, Li
    SIGNAL PROCESSING, 2023, 212
  • [5] Event based transcription system for polyphonic piano music
    Costantini, Giovanni
    Perfetti, Renzo
    Todisco, Massimiliano
    SIGNAL PROCESSING, 2009, 89 (09) : 1798 - 1811
  • [6] A Discriminative Model for Polyphonic Piano Transcription
    Graham E. Poliner
    Daniel P. W. Ellis
    EURASIP Journal on Advances in Signal Processing, 2007
  • [7] A discriminative model for polyphonic piano transcription
    Poliner, Graham E.
    Ellis, Daniel P. W.
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [8] On the Effect of Memory Width in Automatic Transcription Systems for Polyphonic Piano Music
    Costantini, Giovanni
    Todisco, Massimiliano
    Saggio, Giovanni
    IMCIC'11: THE 2ND INTERNATIONAL MULTI-CONFERENCE ON COMPLEXITY, INFORMATICS AND CYBERNETICS, VOL I, 2011, : 124 - 127
  • [9] Piano automatic transcription based on transformer
    Wang, Yuan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (05) : 8441 - 8448
  • [10] Improving generalization for classification-based polyphonic piano transcription
    Poliner, Graham E.
    Ellis, Daniel P. W.
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 309 - 312