Deep Learning-Based Optical Music Recognition for Semantic Representation of Non-overlap and Overlap Music Notes

被引:0
|
作者
Abdulazeez, Rana L. [1 ]
Alizadeh, Fattah [2 ]
机构
[1] Salahaddin Univ, Dept Software & Informat Engn, Coll Engn, Erbil, Kurdistan Regio, Iraq
[2] Univ Kurdistan Hewler, Sch Sci & Engn, Dept Comp Engn, Erbil, Kurdistan Regio, Iraq
来源
关键词
Long short-term memory network; Convolutional neural network; Segmentation; Semantic representation; Overlapping;
D O I
10.14500/aro.11402
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
-In the technology era, the process of teaching a computer to interpret musical notation is termed optical music recognition (OMR). It aims to convert musical note sheets presented in an image into a computer -readable format. Recently, the sequence -to -sequence model along with the attention mechanism (which is used in text and handwritten recognition) has been used in music notes recognition. However, due to the gradual disappearance of excessively long sequences of musical sheets, the mentioned OMR models which consist of long short-term memory are facing difficulties in learning the relationships among the musical notations. Consequently, a new framework has been proposed, leveraging the image segmentation technique to break up the procedure into several steps. In addition, an overlap problem in OMR has been addressed in this study. Overlapping can result in misinterpretation of music notations, producing inaccurate findings. Thus, a novel algorithm is being suggested to detect and segment the notations that are extremely close to each other. Our experiments are based on the usage of the Convolutional Neural Network block as a feature extractor from the image of the musical sheet and the sequence -to -sequence model to retrieve the corresponding semantic representation. The proposed approach has been evaluated on The Printed Images of Music Staves dataset. The achieved results confirm that our suggested framework successfully solves the problem of long sequence music sheets, obtaining SER 0% for the non -overlap symbols in the best scenario. Furthermore, our approach has shown promising results in addressing the overlapping problem: 23.12 % SER for overlapping symbols.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [21] Deep learning-based late fusion of multimodal information for emotion classification of music video
    Yagya Raj Pandeya
    Joonwhoan Lee
    Multimedia Tools and Applications, 2021, 80 : 2887 - 2905
  • [22] Deep Reinforcement Learning-based Music Recommendation with Knowledge Graph Using Acoustic Features
    Sakurai, Keigo
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    ITE TRANSACTIONS ON MEDIA TECHNOLOGY AND APPLICATIONS, 2022, 10 (01): : 8 - 17
  • [23] Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition
    Gupta, Shikha
    Sharma, Krishan
    Dinesh, Dileep Aroor
    Thenkanidiyoor, Veena
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [24] A detector for page-level handwritten music object recognition based on deep learning
    Zhang, Yusen
    Huang, Zhiqing
    Zhang, Yanxin
    Ren, Keyan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (13): : 9773 - 9787
  • [25] Research on Music Emotion Recognition Model of Deep Learning Based on Musical Stage Effect
    Huang, Cuiqing
    Zhang, Qiang
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [26] A detector for page-level handwritten music object recognition based on deep learning
    Yusen Zhang
    Zhiqing Huang
    Yanxin Zhang
    Keyan Ren
    Neural Computing and Applications, 2023, 35 : 9773 - 9787
  • [27] Deep Learning-Based Plasma Optical Boundary Recognition and Reconstruction on EAST Tokamak
    Hou, Jiancheng
    Hu, Jiahui
    Han, Xiaofeng
    Yang, Jianhua
    Wang, Jichao
    Wang, Teng
    Chen, Jingang
    FUSION SCIENCE AND TECHNOLOGY, 2025,
  • [28] Deep Learning-Based Approaches for Text Recognition in PCB Optical Inspection: A Survey
    Ghosh, Shajib
    Sathiaseelan, Mukhil Azhagan Mallaiyan
    Asadizanjani, Navid
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PHYSICAL ASSURANCE AND INSPECTION ON ELECTRONICS (PAINE), 2021,
  • [29] Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation
    Al-Hammadi, Muneer
    Muhammad, Ghulam
    Abdul, Wadood
    Alsulaiman, Mansour
    Bencherif, Mohammed A.
    Alrayes, Tareq S.
    Mathkour, Hassan
    Mekhtiche, Mohamed Amine
    IEEE ACCESS, 2020, 8 (08): : 192527 - 192542
  • [30] RETRACTED: Research on Music Content Recognition and Recommendation Technology Based on Deep Learning (Retracted Article)
    Yang, Gao
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022