Full-Scale Piano Score Recognition

被引:0
|
作者
Zhang, Xiang-Yi [1 ]
Hsu, Jia-Lien [1 ]
机构
[1] Fu Jen Catholic Univ, Dept Comp Sci & Informat Engn, New Taipei City 242062, Taiwan
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
sheet music; Optical Music Recognition; YOLOv8; CRNN;
D O I
10.3390/app15052857
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sheet music is one of the most efficient methods for storing music. Meanwhile, a large amount of sheet music-image data is stored in paper form, but not in a computer-readable format. Therefore, digitizing sheet music is an essential task, such that the encoded music object could be effectively utilized for tasks such as editing or playback. Although there have been a few studies focused on recognizing sheet music images with simpler structures-such as monophonic scores or more modern scores with relatively simple structures, only containing clefs, time signatures, key signatures, and notes-in this paper we focus on the issue of classical sheet music containing dynamics symbols and articulation signs, more than only clefs, time signatures, key signatures, and notes. Therefore, this study augments the data from the GrandStaff dataset by concatenating single-line scores into multi-line scores and adding various classical music dynamics symbols not included in the original GrandStaff dataset. Given a full-scale piano score in pages, our approach first applies three YOLOv8 models to perform the three tasks: 1. Converting a full page of sheet music into multiple single-line scores; 2. Recognizing the classes and absolute positions of dynamics symbols in the score; and 3. Finding the relative positions of dynamics symbols in the score. Then, the identified dynamics symbols are removed from the original score, and the remaining score serves as the input into a Convolutional Recurrent Neural Network (CRNN) for the following steps. The CRNN outputs KERN notation (KERN, a core pitch/duration representation for common practice music notation) without dynamics symbols. By combining the CRNN output with the relative and absolute position information of the dynamics symbols, the final output is obtained. The results show that with the assistance of YOLOv8, there is a significant improvement in accuracy.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] MY MAMA AND THE FULL-SCALE INVASION
    Nigh, Katherine Jean
    Denisova, Sasha
    THEATRE JOURNAL, 2024, 76 (03)
  • [32] SUPERPHENIX - FULL-SCALE BREEDER REACTOR
    VENDRYES, GA
    SCIENTIFIC AMERICAN, 1977, 236 (03) : 26 - 35
  • [33] Full-scale tests of deicing with seawater
    Saeterdal, Ane
    Sundso, Per-Arne
    COLD REGIONS SCIENCE AND TECHNOLOGY, 2024, 221
  • [34] GOING FULL-SCALE WITH SOURCE SEPARATION
    GIES, G
    BIOCYCLE, 1994, 35 (05) : 66 - 70
  • [35] Analysis of full-scale fire tests
    Auguin, G
    Forestier, BE
    Giovannelli, G
    Casalé, E
    10TH INTERNATIONAL SYMPOSIUM ON AERODYNAMICS AND VENTILATION OF VEHICLE TUNNELS: PRINCIPLES, ANALYSIS AND DESIGN, 2000, (43): : 659 - 669
  • [36] The response of full-scale piles to tunnelling
    Selemetas, D.
    Standing, J. R.
    Mair, R. J.
    GEOTECHNICAL ASPECTS OF UNDERGROUND CONSTRUCTION IN SOFT GROUND, 2006, : 763 - +
  • [37] Full-scale impact tests on pipelines
    Palmer, A
    Touhey, M
    Holder, S
    Anderson, M
    Booth, S
    INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2006, 32 (08) : 1267 - 1283
  • [38] CANLEX full-scale experiment and modelling
    Byrne, PM
    Puebla, H
    Chan, DH
    Soroush, A
    Morgenstern, NR
    Cathro, DC
    Gu, WH
    Phillips, R
    Robertson, PK
    Hofmann, BA
    Wride, CE
    Sego, DC
    Plewes, HD
    List, BR
    Tan, S
    CANADIAN GEOTECHNICAL JOURNAL, 2000, 37 (03) : 543 - 562
  • [39] Full-scale tests of a railway bridge
    Maragakis, EM
    Douglas, BM
    Chen, QB
    Sandirasegaram, U
    STRUCTURAL ANALYSIS AND DESIGN: BRIDGES, CULVERTS, AND PIPES, 1998, (1624): : 140 - 147
  • [40] Full-scale application of the BABE® technology
    Salem, S
    Berends, DHJG
    van der Roest, HF
    van der Kuij, RJ
    van Loosdrecht, MCM
    WATER SCIENCE AND TECHNOLOGY, 2004, 50 (07) : 87 - 96