Polyphonic Piano Transcription with a Note-Based Music Language Model

被引:6
|
作者
Wang, Qi [1 ,2 ]
Zhou, Ruohua [1 ,2 ]
Yan, Yonghong [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Xinjiang Lab Minor Speech & Language Informat Pro, Urumqi 830001, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 03期
基金
中国国家自然科学基金;
关键词
polyphonic piano transcription; note-based music language model; recurrent neural network; restricted Boltzmann machine;
D O I
10.3390/app8030470
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a note-based music language model (MLM) for improving note-level polyphonic piano transcription. The MLM is based on the recurrent structure, which could model the temporal correlations between notes in music sequences. To combine the outputs of the note-based MLM and acoustic model directly, an integrated architecture is adopted in this paper. We also propose an inference algorithm, in which the note-based MLM is used to predict notes at the blank onsets in the thresholding transcription results. The experimental results show that the proposed inference algorithm improves the performance of note-level transcription. We also observe that the combination of the restricted Boltzmann machine (RBM) and recurrent structure outperforms a single recurrent neural network (RNN) or long short-term memory network (LSTM) in modeling the high-dimensional note sequences. Among all the MLMs, LSTM-RBM helps the system yield the best results on all evaluation metrics regardless of the performance of acoustic models.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Balancing bias and performance in polyphonic piano transcription systems
    Martak, Lukas Samuel
    Kelz, Rainer
    Widmer, Gerhard
    FRONTIERS IN SIGNAL PROCESSING, 2022, 2
  • [42] Multitask Learning for Polyphonic Piano Transcription, a Case Study
    Kelz, Rainer
    Boeck, Sebastian
    Widmer, Gerhard
    2019 INTERNATIONAL WORKSHOP ON MULTILAYER MUSIC REPRESENTATION AND PROCESSING (MMRP 2019), 2019, : 85 - 91
  • [43] Assessing the Relevance of Onset Information for Note Tracking in Piano Music Transcription
    Valero-Mas, Jose J.
    Benetos, Emmanouil
    Inesta, Jose M.
    2017 AES INTERNATIONAL CONFERENCE ON SEMANTIC AUDIO, 2017,
  • [44] Generative Spectrogram Factorization Models for Polyphonic Piano Transcription
    Peeling, Paul H.
    Cemgil, A. Taylan
    Godsill, Simon J.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 519 - 527
  • [45] A DISCRIMINATIVE APPROACH TO POLYPHONIC PIANO NOTE TRANSCRIPTION USING SUPERVISED NON-NEGATIVE MATRIX FACTORIZATION
    Weninger, Felix
    Kirst, Christian
    Schuller, Bjoern
    Bungartz, Hans-Joachim
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6 - 10
  • [46] Enforcing sparsity, shift-invariance and positivity in a Bayesian model of polyphonic piano music
    Blumensath, T.
    Davies, M.
    2005 IEEE/SP 13th Workshop on Statistical Signal Processing (SSP), Vols 1 and 2, 2005, : 702 - 706
  • [47] FUSING TRANSCRIPTION RESULTS FROM POLYPHONIC AND MONOPHONIC AUDIO FOR SINGING MELODY TRANSCRIPTION IN POLYPHONIC MUSIC
    Zhu, Bilei
    Wu, Fuzhang
    Li, Ke
    Wu, Yongjian
    Huang, Feiyue
    Wu, Yunsheng
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 296 - 300
  • [48] Genetic algorithm approach to polyphonic music transcription
    Reis, Gustavo
    Fonseca, Nuno
    Ferndandez, Francisco
    2007 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING, CONFERENCE PROCEEDINGS BOOK, 2007, : 321 - 326
  • [49] Unsupervised Transcription of Piano Music
    Berg-Kirkpatrick, Taylor
    Andreas, Jacob
    Klein, Dan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [50] Note separation of polyphonic music by energy split
    Aczel, Kristof
    Vajk, Istvan
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION: ADVANCED TOPICS ON SIGNAL PROCESSING, ROBOTICS AND AUTOMATION, 2008, : 208 - +