Deep Neural Network for Multi-Pitch Estimation Using Weighted Cross Entropy Loss

被引:1
|
作者
Stone, Samuel [1 ]
Spector, Evan [1 ]
机构
[1] SRC Inc, North Syracuse, NY 13212 USA
关键词
Frequency Estimation; Machine Learning; Harmonic Analysis;
D O I
10.1109/WNYISPW53194.2021.9661285
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Multi-Pitch Estimation, the estimation of multiple overlapping or polyphonic harmonic fundamental frequencies, has a wide range of applications, including automatic music transcription, power systems, and radar signal processing. Multiple fundamental frequencies represent a challenge due to the added complexity of the overlapping signals. This paper presents a Deep Learning approach to estimating multiple fundamental frequencies. The network is trained in a supervised fashion to generate a pseudospectrum representing the fundamental frequencies. Training data is represented by a sparse binary vector the size of the pseudospectrum, indicating the location of fundamental frequencies. A weighted binary cross-entropy loss function is used to correct for class imbalance caused by the sparsity of the signal space relative to the full spectrum. We show comparable performance to existing techniques while requiring fewer operations and samples due to a simpler frequency-domain-only architecture.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] Multi-pitch estimation
    Christensen, Mads Graesboll
    Stoica, Petre
    Jakobsson, Andreas
    Jensen, Soren Holdt
    SIGNAL PROCESSING, 2008, 88 (04) : 972 - 983
  • [2] MULTI-PITCH ESTIMATION USING SEMIDEFINITE PROGRAMMING
    Jensen, Tobias Lindstrom
    Vandenberghe, Lieven
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4192 - 4196
  • [3] Multi-pitch estimation using harmonic music
    Christensen, Mads Graesboll
    Jakobsson, Andreas
    Jensen, Soren Holdt
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 521 - +
  • [4] MULTI-PITCH ESTIMATION OF INHARMONIC SIGNALS
    Nilsson, Tommy
    Adalbjornsson, Stefan I.
    Butt, Naveed R.
    Jakobsson, Andreas
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [5] JOINT DOA AND MULTI-PITCH ESTIMATION USING BLOCK SPARSITY
    Kronvall, Ted
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [6] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [7] AN ADAPTIVE PENALTY APPROACH TO MULTI-PITCH ESTIMATION
    Kronvall, Ted
    Elvander, Filip
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 31 - 35
  • [8] Multi-pitch estimation for polyphonic musical signals
    Fernandez-Cid, P
    Casajus-Quiros, FJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3565 - 3568
  • [9] Multi-pitch estimation exploiting block sparsity
    Adalbjornsson, Stefan I.
    Jakobsson, Andreas
    Christensen, Mads G.
    SIGNAL PROCESSING, 2015, 109 : 236 - 247
  • [10] Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings
    Weiss, Christof
    Peeters, Geoffroy
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2814 - 2827