Full-Scale Piano Score Recognition

被引:0
|
作者
Zhang, Xiang-Yi [1 ]
Hsu, Jia-Lien [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Comp Sci & Informat Engn, New Taipei City 242062, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
sheet music; Optical Music Recognition; YOLOv8; CRNN;
D O I
10.3390/app15052857
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sheet music is one of the most efficient methods for storing music. Meanwhile, a large amount of sheet music-image data is stored in paper form, but not in a computer-readable format. Therefore, digitizing sheet music is an essential task, such that the encoded music object could be effectively utilized for tasks such as editing or playback. Although there have been a few studies focused on recognizing sheet music images with simpler structures-such as monophonic scores or more modern scores with relatively simple structures, only containing clefs, time signatures, key signatures, and notes-in this paper we focus on the issue of classical sheet music containing dynamics symbols and articulation signs, more than only clefs, time signatures, key signatures, and notes. Therefore, this study augments the data from the GrandStaff dataset by concatenating single-line scores into multi-line scores and adding various classical music dynamics symbols not included in the original GrandStaff dataset. Given a full-scale piano score in pages, our approach first applies three YOLOv8 models to perform the three tasks: 1. Converting a full page of sheet music into multiple single-line scores; 2. Recognizing the classes and absolute positions of dynamics symbols in the score; and 3. Finding the relative positions of dynamics symbols in the score. Then, the identified dynamics symbols are removed from the original score, and the remaining score serves as the input into a Convolutional Recurrent Neural Network (CRNN) for the following steps. The CRNN outputs KERN notation (KERN, a core pitch/duration representation for common practice music notation) without dynamics symbols. By combining the CRNN output with the relative and absolute position information of the dynamics symbols, the final output is obtained. The results show that with the assistance of YOLOv8, there is a significant improvement in accuracy.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] MINICOMPUTERS IN FULL-SCALE STRUCTURAL TESTING
    Pinjarkar, Suresh G.
    Guedelhoefer, Otto C.
    Journal of the Technical Councils of ASCE: Proceedings of the ASCE, 1979, 105 (01): : 103 - 111
  • [42] FULL-SCALE CENTRIFUGAL MILL.
    LLOYD, P.J.D.
    BRADLEY, A.A.
    HINDE, A.L.
    STANTON, K.H.
    SCHYMURA, G.K.
    1982, V 82 (N 6): : 149 - 156
  • [43] Predicting full-scale TOC removal
    Department of Civil Engineering, Virginia Polytechnic Institute, State University, 407 NEB, Blacksburg, VA 24061-0246, United States
    J Am Water Works Assoc, 4 (159-170):
  • [44] Testing full-scale composite aircraft
    不详
    AIRCRAFT ENGINEERING AND AEROSPACE TECHNOLOGY, 1996, 68 (05): : 32 - 33
  • [45] RESEARCH AT FULL-SCALE - THE HDR PROGRAM
    SCHOLL, KH
    HOLMAN, GS
    NUCLEAR ENGINEERING INTERNATIONAL, 1983, 28 (336): : 39 - 43
  • [46] Full-scale tests lead to the target
    Sulzer Tech Rev, 2 (28-31):
  • [47] Full-scale testing of fuselage panels
    Bakuckas, JG
    Bigelow, CA
    Tan, PW
    Awerbuch, J
    Lau, AC
    Tan, TM
    IEEE SYSTEMS READINESS TECHNOLOGY CONFERENCE: 2001 IEEE AUTOTESTCON PROCEEDINGS, 2001, : 827 - 846
  • [48] FULL-SCALE INTENSIFICATION AND MENTAL PROCESSES
    ROTHE, B
    STEININGER, H
    DEUTSCHE ZEITSCHRIFT FUR PHILOSOPHIE, 1987, 35 (07): : 605 - 613
  • [49] DYNAMIC TESTS OF FULL-SCALE STRUCTURES
    HUDSON, DE
    JOURNAL OF THE ENGINEERING MECHANICS DIVISION-ASCE, 1977, 103 (06): : 1141 - 1157
  • [50] A CITY SHIFTS INTO FULL-SCALE RECYCLING
    FLESCHNER, E
    CROMBIE, G
    MOREAU, T
    BIOCYCLE, 1992, 33 (01) : 38 - 42