Full-Scale Piano Score Recognition

被引:0
|
作者
Zhang, Xiang-Yi [1 ]
Hsu, Jia-Lien [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Comp Sci & Informat Engn, New Taipei City 242062, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
sheet music; Optical Music Recognition; YOLOv8; CRNN;
D O I
10.3390/app15052857
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sheet music is one of the most efficient methods for storing music. Meanwhile, a large amount of sheet music-image data is stored in paper form, but not in a computer-readable format. Therefore, digitizing sheet music is an essential task, such that the encoded music object could be effectively utilized for tasks such as editing or playback. Although there have been a few studies focused on recognizing sheet music images with simpler structures-such as monophonic scores or more modern scores with relatively simple structures, only containing clefs, time signatures, key signatures, and notes-in this paper we focus on the issue of classical sheet music containing dynamics symbols and articulation signs, more than only clefs, time signatures, key signatures, and notes. Therefore, this study augments the data from the GrandStaff dataset by concatenating single-line scores into multi-line scores and adding various classical music dynamics symbols not included in the original GrandStaff dataset. Given a full-scale piano score in pages, our approach first applies three YOLOv8 models to perform the three tasks: 1. Converting a full page of sheet music into multiple single-line scores; 2. Recognizing the classes and absolute positions of dynamics symbols in the score; and 3. Finding the relative positions of dynamics symbols in the score. Then, the identified dynamics symbols are removed from the original score, and the remaining score serves as the input into a Convolutional Recurrent Neural Network (CRNN) for the following steps. The CRNN outputs KERN notation (KERN, a core pitch/duration representation for common practice music notation) without dynamics symbols. By combining the CRNN output with the relative and absolute position information of the dynamics symbols, the final output is obtained. The results show that with the assistance of YOLOv8, there is a significant improvement in accuracy.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] FULL-SCALE RESEARCH
    WALSKI, TM
    JOURNAL OF ENVIRONMENTAL ENGINEERING-ASCE, 1991, 117 (05): : 519 - 520
  • [2] Full-scale model
    Nathan, Stuart
    Engineer, 2010, JUNE
  • [3] Full-scale shaking
    不详
    MECHANICAL ENGINEERING, 2006, 128 (07) : 14 - 14
  • [4] Full-scale attack
    Sergeev, K.
    Tselliuloza, Bumaga, Karton/Pulp, Paper, Board, 2006, (05): : 20 - 23
  • [5] Piano concertos one to six in full score
    不详
    CLAVIER, 2006, 45 (04): : 38 - 38
  • [6] FULL-SCALE EXPERIMENTS - DISCUSSION
    HOLMES, JD
    SIERPUTOWSKI, P
    LITTLER, JD
    LEE, BE
    RICHARDSON, GM
    DALLEY, S
    GERHARD, HJ
    SURRY, D
    COOK, NJ
    STATHOPOULOS, T
    LARSEN, A
    HOXEY, R
    CASTRO, I
    HANDA, K
    BREEZE, G
    BIETRY, J
    MATSUMOTO, M
    LAROSE, G
    JOURNAL OF WIND ENGINEERING AND INDUSTRIAL AERODYNAMICS, 1995, 57 (2-3) : 420 - 432
  • [7] FULL-SCALE CURRENCY HEDGING
    Czasonis, Megan
    Kritzman, Mark
    Turkington, David
    JOURNAL OF INVESTMENT MANAGEMENT, 2024, 22 (02): : 25 - 35
  • [8] Full-scale tests for housing
    Söderlind, L
    1ST INTERNATIONAL RILEM SYMPOSIUM ON SELF COMPACTING CONCRETE, 1999, 7 : 723 - 728
  • [9] Full-scale trials for RCC
    de Andrade, MAS
    Traboulsi, MA
    Bittencourt, RM
    de Andrade, WP
    ROLLER COMPACTED CONCRETE DAMS, 2003, : 891 - 895
  • [10] FULL-SCALE COLLISION TESTS
    CARLEBUR, AFC
    SAFETY SCIENCE, 1995, 19 (2-3) : 171 - 178