CNN-based Note Onset Detection using Synthetic Data Augmentation

被引:0
|
作者
Mounir, Mina [1 ,3 ]
Karsmakers, Peter [2 ]
van Waterschoot, Toon [1 ,3 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, ESAT STADIUS, Leuven, Belgium
[2] Katholieke Univ Leuven, DTAI ADVISE, Dept Comp Sci, Geel Campus, Geel, Belgium
[3] Katholieke Univ Leuven, ESAT Lab, Leuven, Belgium
基金
欧洲研究理事会;
关键词
CNN; data augmentation; note onset detection;
D O I
10.23919/eusipco47968.2020.9287621
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting the onset of notes in music excerpts is a fundamental problem in many music signal processing tasks, including analysis, synthesis, and information retrieval. When addressing the note onset detection (NOD) problem using a data-driven methodology, a major challenge is the availability and quality of labeled datasets used for both model training/tuning and evaluation. As most of the available datasets are manually annotated, the amount of annotated music excerpts is limited and the annotation strategy and quality varies across data sets. To counter both problems, in this paper we propose to use semi-synthetic datasets where the music excerpts are mixes of isolated note recordings. The advantage resides in the annotations being automatically generated while mixing the notes, as isolated note onsets are straightforward to detect using a simple energy measure. A semi-synthetic dataset is used in this work for augmenting a real piano dataset when training a convolutional Neural Network (CNN) with three novel model training strategies. Training the CNN on a semi-synthetic dataset and retraining only the CNN classification layers on a real dataset results in higher average F-1-score (F-1) scores with lower variance.
引用
收藏
页码:171 / 175
页数:5
相关论文
共 50 条
  • [1] DATA AUGMENTATION FOR CNN-BASED PEOPLE DETECTION IN AERIAL IMAGES
    Chen, Hua-Tsung
    Liu, Che-Han
    Tsai, Wen-Jiin
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [2] Effectiveness of Data Augmentation for CNN-based Pupil Center Point Detection
    Kan, Naoyuki
    Kondo, Nagisa
    Chinsatit, Warapon
    Saitoh, Takeshi
    2018 57TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2018, : 41 - 46
  • [3] Data Augmentation in CNN-based Periocular Authentication
    Dellana, Ryan
    Roy, Kaushik
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2016), 2016, : 141 - 145
  • [4] IMPROVING CNN-BASED VISEME RECOGNITION USING SYNTHETIC DATA
    Mattos, Andrea Britto
    Borges Oliveira, Dario Augusto
    Morais, Edmilson da Silva
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [5] Towards Robust CNN-based Object Detection through Augmentation with Synthetic Rain Variations
    Volk, Georg
    Mueller, Stefan
    von Bernuth, Alexander
    Hospach, Dennis
    Bringmann, Oliver
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 285 - 292
  • [6] CNN-BASED INITIAL LOCALIZATION IMPROVED BY DATA AUGMENTATION
    Mueller, M. S.
    Metzger, A.
    Jutzi, B.
    ISPRS TC I MID-TERM SYMPOSIUM INNOVATIVE SENSING - FROM SENSORS TO METHODS AND APPLICATIONS, 2018, 4-1 : 117 - 124
  • [7] CNN-based data augmentation for handwritten gurumukhi text recognition
    Sareen, Bhavna
    Ahuja, Rakesh
    Singh, Amitoj
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 71035 - 71053
  • [8] PRATIT: a CNN-based emotion recognition system using histogram equalization and data augmentation
    Dhara Mungra
    Anjali Agrawal
    Priyanka Sharma
    Sudeep Tanwar
    Mohammad S. Obaidat
    Multimedia Tools and Applications, 2020, 79 : 2285 - 2307
  • [9] PRATIT: a CNN-based emotion recognition system using histogram equalization and data augmentation
    Mungra, Dhara
    Agrawal, Anjali
    Sharma, Priyanka
    Tanwar, Sudeep
    Obaidat, Mohammad S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) : 2285 - 2307
  • [10] ENHANCING THE QUALITY OF CNN-BASED BURNED AREA DETECTION IN SATELLITE IMAGERY THROUGH DATA AUGMENTATION
    Hnatushenko, Vik.
    Hnatushenko, V.
    Soldatenko, D.
    Heipke, C.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1749 - 1755