Full-Scale Piano Score Recognition

被引:0
|
作者
Zhang, Xiang-Yi [1 ]
Hsu, Jia-Lien [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Comp Sci & Informat Engn, New Taipei City 242062, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
sheet music; Optical Music Recognition; YOLOv8; CRNN;
D O I
10.3390/app15052857
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sheet music is one of the most efficient methods for storing music. Meanwhile, a large amount of sheet music-image data is stored in paper form, but not in a computer-readable format. Therefore, digitizing sheet music is an essential task, such that the encoded music object could be effectively utilized for tasks such as editing or playback. Although there have been a few studies focused on recognizing sheet music images with simpler structures-such as monophonic scores or more modern scores with relatively simple structures, only containing clefs, time signatures, key signatures, and notes-in this paper we focus on the issue of classical sheet music containing dynamics symbols and articulation signs, more than only clefs, time signatures, key signatures, and notes. Therefore, this study augments the data from the GrandStaff dataset by concatenating single-line scores into multi-line scores and adding various classical music dynamics symbols not included in the original GrandStaff dataset. Given a full-scale piano score in pages, our approach first applies three YOLOv8 models to perform the three tasks: 1. Converting a full page of sheet music into multiple single-line scores; 2. Recognizing the classes and absolute positions of dynamics symbols in the score; and 3. Finding the relative positions of dynamics symbols in the score. Then, the identified dynamics symbols are removed from the original score, and the remaining score serves as the input into a Convolutional Recurrent Neural Network (CRNN) for the following steps. The CRNN outputs KERN notation (KERN, a core pitch/duration representation for common practice music notation) without dynamics symbols. By combining the CRNN output with the relative and absolute position information of the dynamics symbols, the final output is obtained. The results show that with the assistance of YOLOv8, there is a significant improvement in accuracy.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] FULL-SCALE HYDRAULIC MINE PLANNED
    JACKSON, D
    COAL AGE, 1983, 88 (09): : 69 - &
  • [22] Visualizing full-scale ventilation airflows
    Settles, GS
    ASHRAE JOURNAL-AMERICAN SOCIETY OF HEATING REFRIGERATING AND AIR-CONDITIONING ENGINEERS, 1997, 39 (07): : 19 - &
  • [23] Impact on nitrifiers of full-scale bioaugmentation
    Stenstrom, F.
    Jansen, J. la Cour
    WATER SCIENCE AND TECHNOLOGY, 2017, 76 (11) : 3079 - 3085
  • [24] Full-scale ANANOX® system performance
    Garuti, G
    Giordano, A
    Pirozzi, F
    WATER SA, 2001, 27 (02) : 189 - 197
  • [25] Full-Scale Fairing Qualification Tests
    Constantinides, Yiannis
    Liapis, Stergios
    Spencer, Don
    Islam, Mohammed
    Skaugset, Kjetil
    Batra, Apurva
    Baarholm, Rolf
    JOURNAL OF OFFSHORE MECHANICS AND ARCTIC ENGINEERING-TRANSACTIONS OF THE ASME, 2017, 139 (04):
  • [26] Predicting full-scale TOC removal
    Tseng, T
    Edwards, M
    JOURNAL AMERICAN WATER WORKS ASSOCIATION, 1999, 91 (04): : 159 - 170
  • [27] Full-scale Mechanical Vibrations Laboratory
    McDaniel, Cole C.
    Archer, Graham C.
    2013 ASEE ANNUAL CONFERENCE, 2013,
  • [28] VIBRATION TESTING OF FULL-SCALE STRUCTURES
    SMITH, CB
    MATTHIES.RB
    NUCLEAR ENGINEERING AND DESIGN, 1973, 25 (01) : 17 - 29
  • [29] FULL-SCALE FIRE TEST PROGRAM
    ALPERT, RL
    FIRELINE, 1978, 5 (01): : 7 - 7