A Music Cognition-Guided Framework for Multi-pitch Estimation

被引:2
|
作者
Li, Xiaoquan [1 ]
Yan, Yijun [2 ]
Soraghan, John [1 ]
Wang, Zheng [3 ]
Ren, Jinchang [2 ]
机构
[1] Univ Strathclyde, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
[2] Robert Gordon Univ, Natl Subsea Ctr, Aberdeen AB21 0BH, Scotland
[3] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
关键词
Music cognition; Automatic music transcription; Multi-pitch estimation; Harmonic structure detection (HSD); Polyphonic music detection; TRANSCRIPTION; NETWORK;
D O I
10.1007/s12559-022-10031-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As one of the most important subtasks of automatic music transcription (AMT), multi-pitch estimation (MPE) has been studied extensively for predicting the fundamental frequencies in the frames of audio recordings during the past decade. However, how to use music perception and cognition for MPE has not yet been thoroughly investigated. Motivated by this, this demonstrates how to effectively detect the fundamental frequency and the harmonic structure of polyphonic music using a cognitive framework. Inspired by cognitive neuroscience, an integration of the constant Q transform and a state-of-the-art matrix factorization method called shift-invariant probabilistic latent component analysis (SI-PLCA) are proposed to resolve the polyphonic short-time magnitude log-spectra for multiple pitch estimation and source-specific feature extraction. The cognitions of rhythm, harmonic periodicity and instrument timbre are used to guide the analysis of characterizing contiguous notes and the relationship between fundamental frequency and harmonic frequencies for detecting the pitches from the outcomes of SI-PLCA. In the experiment, we compare the performance of proposed MPE system to a number of existing state-of-the-art approaches (seven weak learning methods and four deep learning methods) on three widely used datasets (i.e. MAPS, BACH10 and TRIOS) in terms of F-measure (F-1) values. The experimental results show that the proposed MPE method provides the best overall performance against other existing methods.
引用
收藏
页码:23 / 35
页数:13
相关论文
共 50 条
  • [21] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [22] Joint DOA and multi-pitch estimation based on subspace techniques
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [23] Joint DOA and multi-pitch estimation based on subspace techniques
    Johan Xi Zhang
    Mads Græsbøll Christensen
    Søren Holdt Jensen
    Marc Moonen
    EURASIP Journal on Advances in Signal Processing, 2012
  • [24] Multi-pitch estimation based on partial event and support transfer
    Duan, Zhiyao
    Zhang, Dan
    Zhang, Changshui
    Shi, Zhenwei
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 216 - 219
  • [25] Evolving a Multi-Classifier System for Multi-Pitch Estimation of Piano Music and Beyond: An Application of Cartesian Genetic Programming
    Miragaia, Rolando
    Fernandez, Francisco
    Reis, Gustavo
    Inacio, Tiago
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [26] JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC
    Liu, Lele
    Morfi, Veronica
    Benetos, Emmanouil
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 281 - 285
  • [27] Multiple comb filters and autocorrelation of the multi-scale product for multi-pitch estimation
    Zeremdini, Jihen
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    APPLIED ACOUSTICS, 2017, 120 : 45 - 53
  • [28] MULTI-PITCH STREAMING OF INTERWOVEN STREAMS
    Kuan, Chih-Yi
    Su, Li
    Chin, Yu-Hao
    Wang, Jia-Ching
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 311 - 315
  • [29] MULTI-PITCH ESTIMATION AND TRACKING USING BAYESIAN INFERENCE IN BLOCK SPARSITY
    Karimian-Azari, Sam
    Jakobsson, Andreas
    Jensen, Jesper R.
    Christensen, Mads G.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 16 - 20
  • [30] The multi-band, multi-pitch antenna
    Ooi, S
    Bit-Babik, G
    PROCEEDINGS OF VIITH INTERNATIONAL SEMINAR/WORKSHOP ON DIRECT AND INVERSE PROBLEMS OF ELECTROMAGNETIC AND ACOUSTIC WAVE THEORY, 2002, : 65 - 69