Study of Automatic Piano Transcription Algorithms based on the Polyphonic Properties of Piano Audio

被引:0
|
作者
Liang Y. [1 ]
Pan F. [2 ]
机构
[1] Department of Educational Sciences and Music, Luoyang Institute of Science and Technology, Henan, Luoyang
[2] Department of Sports Training, Guangzhou Sport University, Guangdong, Guangzhou
关键词
Automatic transcription; Convolutional neural network; Piano audio; Polyphonic characteristics;
D O I
10.5573/IEIESPC.2023.12.5.412
中图分类号
学科分类号
摘要
The polyphonic characteristics of piano audio make automatic transcription particularly challenging. This study briefly analyzed the polyphonic characteristics of piano audio and introduced three piano audio features: short-time Fourier transform (STFT), constant-Q transform (CQT), and variable-Q transform (VQT). An algorithm integrating a convolutional neural network (CNN) with a bidirectional gated recurrent unit (BiGRU) was developed and tested on the MAPS dataset to detect the note start and end points and fundamental tones of polyphone. The results showed that the combined algorithm performed better than STFT and CQT when VQT was used as input, and CNN-BiGRU outperformed CNN and CNN-GRU in terms of the P value, R-value, and F1-measure in the fundamental tone detection of 97.16%, 97.34%, and 97.25%, respectively. The experimental results of this paper confirmed that the designed automatic piano transcription algorithm is reliable and can be further adopted in the practical music field. Copyrights © 2023 The Institute of Electronics and Information Engineers.
引用
收藏
页码:412 / 418
页数:6
相关论文
共 50 条
  • [41] EXPLORING TRANSFORMER'S POTENTIAL ON AUTOMATIC PIANO TRANSCRIPTION
    Ou, Longshen
    Guo, Ziyi
    Benetos, Emmanouil
    Han, Jiqing
    Wang, Ye
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 776 - 780
  • [42] Audio-Based Automatic Generation of a Piano Reduction Score by Considering the Musical Structure
    Takamori, Hirofumi
    Nakatsuka, Takayuki
    Fukayama, Satoru
    Goto, Masataka
    Morishima, Shigeo
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 169 - 181
  • [43] POLYPHONIC PIANO NOTE TRANSCRIPTION WITH NON-NEGATIVE MATRIX FACTORIZATION OF DIFFERENTIAL SPECTROGRAM
    Gao, Lufei
    Su, Li
    Yang, Yi-Hsuan
    Lee, Tan
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 291 - 295
  • [44] POLYPHONIC PIANO TRANSCRIPTION USING NON-NEGATIVE MATRIX FACTORISATION WITH GROUP SPARSITY
    O'Hanlon, Ken
    Plumbley, Mark D.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [45] Static and Dynamic Classification Methods for Polyphonic Transcription of Piano Pieces in Different Musical Styles
    Costantini, Giovanni
    Todisco, Massimiliano
    Carota, Massimo
    Casali, Daniele
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON CIRCUITS: NEW ASPECTS OF CIRCUITS, 2008, : 158 - +
  • [46] COMPARISON OF VARIOUS ALGORITHMS: RESEARCH ON PIANO AUDIO SIGNAL FEATURE IDENTIFICATION
    Hao, Shuang
    Canadian Acoustics - Acoustique Canadienne, 2023, 51 (02): : 39 - 44
  • [47] A Data-Driven Analysis of Robust Automatic Piano Transcription
    Edwards, Drew
    Dixon, Simon
    Benetos, Emmanouil
    Maezawa, Akira
    Kusaka, Yuta
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 681 - 685
  • [48] AUTOMATIC TRANSCRIPTION OF PIANO MUSIC BY SPARSE REPRESENTATION OF MAGNITUDE SPECTRA
    Lee, Cheng-Te
    Yang, Yi-Hsuan
    Chen, Homer
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [49] Unsupervised note activity detection in NMF-based automatic transcription of piano music
    Tavares, Tiago Fernandes
    Arnal Barbedo, Jayme Garcia
    Attux, Romis
    JOURNAL OF NEW MUSIC RESEARCH, 2016, 45 (02) : 118 - 123
  • [50] Mobile-AMT: Real-time Polyphonic Piano Transcription for In-the-Wild Recordings
    Kusaka, Yuta
    Maezawa, Akira
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 36 - 40