Study of Automatic Piano Transcription Algorithms based on the Polyphonic Properties of Piano Audio

被引:0
|
作者
Liang Y. [1 ]
Pan F. [2 ]
机构
[1] Department of Educational Sciences and Music, Luoyang Institute of Science and Technology, Henan, Luoyang
[2] Department of Sports Training, Guangzhou Sport University, Guangdong, Guangzhou
关键词
Automatic transcription; Convolutional neural network; Piano audio; Polyphonic characteristics;
D O I
10.5573/IEIESPC.2023.12.5.412
中图分类号
学科分类号
摘要
The polyphonic characteristics of piano audio make automatic transcription particularly challenging. This study briefly analyzed the polyphonic characteristics of piano audio and introduced three piano audio features: short-time Fourier transform (STFT), constant-Q transform (CQT), and variable-Q transform (VQT). An algorithm integrating a convolutional neural network (CNN) with a bidirectional gated recurrent unit (BiGRU) was developed and tested on the MAPS dataset to detect the note start and end points and fundamental tones of polyphone. The results showed that the combined algorithm performed better than STFT and CQT when VQT was used as input, and CNN-BiGRU outperformed CNN and CNN-GRU in terms of the P value, R-value, and F1-measure in the fundamental tone detection of 97.16%, 97.34%, and 97.25%, respectively. The experimental results of this paper confirmed that the designed automatic piano transcription algorithm is reliable and can be further adopted in the practical music field. Copyrights © 2023 The Institute of Electronics and Information Engineers.
引用
收藏
页码:412 / 418
页数:6
相关论文
共 50 条
  • [31] Rhythm Transcription of Polyphonic Piano Music Based on Merged-Output HMM for Multiple Voices
    Nakamura, Eita
    Yoshii, Kazuyoshi
    Sagayama, Shigeki
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 794 - 806
  • [32] Summary of Dissertation: Transcription of polyphonic music for piano based in resolution of groups of notes and finite states
    Gómez-Meire S.
    Inteligencia Artificial, 2010, 14 (45) : 44 - 47
  • [33] A CRNN-GCN PIANO TRANSCRIPTION MODEL BASED ON AUDIO AND SKELETON FEATURES
    Li, Yuqing
    Wang, Xianke
    Wu, Ruimin
    Xu, Wei
    Cheng, Wenqing
    2023 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW, 2023,
  • [34] Randomized Matrix Decompositions and Exemplar Selection in Large Dictionaries for Polyphonic Piano Transcription
    Ari, Ismail
    Simsekli, Umut
    Cemgil, Ali Taylan
    Akarun, Lale
    JOURNAL OF NEW MUSIC RESEARCH, 2014, 43 (03) : 255 - 265
  • [35] JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC
    Liu, Lele
    Morfi, Veronica
    Benetos, Emmanouil
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 281 - 285
  • [36] Polyphonic Piano Music Transcription using Long Short-Term Memory
    Sadekar, Aakash
    Mahajan, Shrinivas P.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [37] Automatic bass line transcription from streaming polyphonic audio
    Ryynanen, Matti
    Klapuri, Anssi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 1437 - +
  • [38] Reducing latency of neural automatic piano transcription models
    Lee, Dasol
    Jeong, Dasaem
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2023, 42 (02): : 102 - 111
  • [39] Computational Analysis of Audio Recordings of Piano Performance for Automatic Evaluation
    Kato, Norihiro
    Nakamura, Eita
    Mine, Kyoko
    Doeda, Orie
    Yamada, Masanao
    RESPONSIVE AND SUSTAINABLE EDUCATIONAL FUTURES, EC-TEL 2023, 2023, 14200 : 586 - 592
  • [40] SCORE-INFORMED TRANSCRIPTION FOR AUTOMATIC PIANO TUTORING
    Benetos, Emmanouil
    Klapuri, Anssi
    Dixon, Simon
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2153 - 2157