Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription

被引:26
|
作者
Benetos, Emmanouil [1 ]
Dixon, Simon [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Ctr Digital Mus, London E1 4NS, England
关键词
Automatic music transcription; harmonic envelope estimation; conditional random fields (CRFs); resonator time-frequency image; SEPARATION;
D O I
10.1109/JSTSP.2011.2162394
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a method for automatic transcription of music signals based on joint multiple-F0 estimation is proposed. As a time-frequency representation, the constant-Q resonator time-frequency image is employed, while a novel noise suppression technique based on pink noise assumption is applied in a preprocessing step. In the multiple-F0 estimation stage, the optimal tuning and inharmonicity parameters are computed and a salience function is proposed in order to select pitch candidates. For each pitch candidate combination, an overlapping partial treatment procedure is used, which is based on a novel spectral envelope estimation procedure for the log-frequency domain, in order to compute the harmonic envelope of candidate pitches. In order to select the optimal pitch combination for each time frame, a score function is proposed which combines spectral and temporal characteristics of the candidate pitches and also aims to suppress harmonic errors. For postprocessing, hidden Markov models (HMMs) and conditional random fields (CRFs) trained on MIDI data are employed, in order to boost transcription accuracy. The system was trained on isolated piano sounds from the MAPS database and was tested on classic and jazz recordings from the RWC database, as well as on recordings from a Disklavier piano. A comparison with several state-of-the-art systems is provided using a variety of error metrics, where encouraging results are indicated.
引用
收藏
页码:1111 / 1123
页数:13
相关论文
共 50 条
  • [21] Pitch detection in polyphonic music using instrument tone models
    Li, Yipeng
    Wang, DeLiang
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 481 - +
  • [22] JOINT DOA AND MULTI-PITCH ESTIMATION VIA BLOCK SPARSE DICTIONARY LEARNING
    Kronvall, Ted
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1053 - 1057
  • [23] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [24] AN ADAPTIVE PENALTY APPROACH TO MULTI-PITCH ESTIMATION
    Kronvall, Ted
    Elvander, Filip
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 31 - 35
  • [25] EXPECTATION-MAXIMIZATION ALGORITHM FOR MULTI-PITCH ESTIMATION AND SEPARATION OF OVERLAPPING HARMONIC SPECTRA
    Badeau, Roland
    Emiya, Valentin
    David, Bertrand
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3073 - 3076
  • [26] Multi-pitch estimation with polyphony per instrument information for Western classical and electronic music
    Michael Taenzer
    EURASIP Journal on Audio, Speech, and Music Processing, 2025 (1)
  • [27] Multi-pitch estimation exploiting block sparsity
    Adalbjornsson, Stefan I.
    Jakobsson, Andreas
    Christensen, Mads G.
    SIGNAL PROCESSING, 2015, 109 : 236 - 247
  • [28] Predominant vocal pitch detection in polyphonic music
    Shao, Xi
    Xu, Changsheng
    Kankanhalli, Mohan S.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 897 - 900
  • [29] Harmonic and inharmonic Nonnegative Matrix Factorization for polyphonic pitch transcription
    Vincent, Emmanuel
    Bertin, Nancy
    Badeau, Roland
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 109 - +
  • [30] MULTI-PITCH ESTIMATION AND TRACKING USING BAYESIAN INFERENCE IN BLOCK SPARSITY
    Karimian-Azari, Sam
    Jakobsson, Andreas
    Jensen, Jesper R.
    Christensen, Mads G.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 16 - 20