Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription

被引:26
|
作者
Benetos, Emmanouil [1 ]
Dixon, Simon [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, Ctr Digital Mus, London E1 4NS, England
关键词
Automatic music transcription; harmonic envelope estimation; conditional random fields (CRFs); resonator time-frequency image; SEPARATION;
D O I
10.1109/JSTSP.2011.2162394
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a method for automatic transcription of music signals based on joint multiple-F0 estimation is proposed. As a time-frequency representation, the constant-Q resonator time-frequency image is employed, while a novel noise suppression technique based on pink noise assumption is applied in a preprocessing step. In the multiple-F0 estimation stage, the optimal tuning and inharmonicity parameters are computed and a salience function is proposed in order to select pitch candidates. For each pitch candidate combination, an overlapping partial treatment procedure is used, which is based on a novel spectral envelope estimation procedure for the log-frequency domain, in order to compute the harmonic envelope of candidate pitches. In order to select the optimal pitch combination for each time frame, a score function is proposed which combines spectral and temporal characteristics of the candidate pitches and also aims to suppress harmonic errors. For postprocessing, hidden Markov models (HMMs) and conditional random fields (CRFs) trained on MIDI data are employed, in order to boost transcription accuracy. The system was trained on isolated piano sounds from the MAPS database and was tested on classic and jazz recordings from the RWC database, as well as on recordings from a Disklavier piano. A comparison with several state-of-the-art systems is provided using a variety of error metrics, where encouraging results are indicated.
引用
收藏
页码:1111 / 1123
页数:13
相关论文
共 50 条
  • [41] Multi Pitch Estimation of Piano Music using Cartesian Genetic Programming with Spectral Harmonic Mask
    Miragaia, Rolando
    Reis, Gustavo
    Fernandez de Vega, Francisco
    Chavez, Francisco
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1800 - 1807
  • [42] An iterative subspace-based multi-pitch estimation algorithm
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    SIGNAL PROCESSING, 2011, 91 (01) : 150 - 154
  • [43] Multi-pitch estimation based on partial event and support transfer
    Duan, Zhiyao
    Zhang, Dan
    Zhang, Changshui
    Shi, Zhenwei
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 216 - 219
  • [44] Evolving a Multi-Classifier System for Multi-Pitch Estimation of Piano Music and Beyond: An Application of Cartesian Genetic Programming
    Miragaia, Rolando
    Fernandez, Francisco
    Reis, Gustavo
    Inacio, Tiago
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [45] Weighted Initialisation of Evolutionary Instrument and Pitch Detection in Polyphonic Music
    Dettmer, Justin
    Vatolkin, Igor
    Glasmachers, Tobias
    ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN, EVOMUSART 2024, 2024, 14633 : 114 - 129
  • [46] NOTE ONSET DETECTION FOR THE TRANSCRIPTION OF POLYPHONIC PIANO MUSIC
    Boogaart, C. G. V. D.
    Lienhart, R.
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 446 - 449
  • [47] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
    Li, Ming
    Cao, Chuan
    Wang, Di
    Lu, Ping
    Fu, Qiang
    Yan, Yonghong
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
  • [48] A Genetic Algorithm Approach with Harmonic Structure Evolution for Polyphonic Music Transcription
    Reis, Gustavo
    Fonseca, Nuno
    Fernandez, Francisco
    Ferreira, Anibal
    ISSPIT: 8TH IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2008, : 491 - +
  • [49] Multiple comb filters and autocorrelation of the multi-scale product for multi-pitch estimation
    Zeremdini, Jihen
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    APPLIED ACOUSTICS, 2017, 120 : 45 - 53
  • [50] TIME-RECURSIVE MULTI-PITCH ESTIMATION USING GROUP SPARSE RECURSIVE LEAST SQUARES
    Elvander, Filip
    Sward, Johan
    Jakobsson, Andreas
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 369 - 373