2-D Processing of Speech for Multi-Pitch Analysis

被引:0
|
作者
Wang, Tianyu T. [1 ]
Quatieri, Thomas F. [1 ]
机构
[1] MIT Lincoln Lab, Lincoln, NE USA
关键词
2-D speech processing; Grating Compression Transform; multi-pitch analysis; segmental pitch dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a two-dimensional (2-D) processing approach for the analysis of multi-pitch speech sounds. Our framework invokes the short-space 2-D Fourier transform magnitude of a narrowband spectrogram, mapping harmonically-related signal components to multiple concentrated entities in a new 2-D space. First, localized time-frequency regions of the spectrogram are analyzed to extract pitch candidates. These candidates are then combined across multiple regions for obtaining separate pitch estimates of each speech-signal component at a single point in time. We refer to this as multi-region analysis (MRA). By explicitly accounting for pitch dynamics within localized time segments, this separability is distinct from that which can be obtained using short-time autocorrelation methods typically employed in state-of-the-art multi-pitch tracking algorithms. We illustrate the feasibility of MRA for multi-pitch estimation on mixtures of synthetic and real speech.
引用
收藏
页码:2795 / 2798
页数:4
相关论文
共 50 条
  • [1] Multi-Pitch Estimation by a Joint 2-D Representation of Pitch and Pitch Dynamics
    Wang, Tianyu T.
    Quatieri, Thomas F.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 645 - 648
  • [2] A multi-pitch tracking algorithm for noisy speech
    Wu, MY
    Wang, DL
    Brown, GJ
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 369 - 372
  • [3] Multi-pitch estimation
    Christensen, Mads Graesboll
    Stoica, Petre
    Jakobsson, Andreas
    Jensen, Soren Holdt
    SIGNAL PROCESSING, 2008, 88 (04) : 972 - 983
  • [4] An Algorithm for Multi-Pitch Tracking in Co-Channel Speech
    Vishnubhotla, Srikanth
    Espy-Wilson, Carol
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 143 - +
  • [5] SPECTRAL MULTI-SCALE ANALYSIS FOR MULTI-PITCH TRACKING
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    Ellouze, Noureddine
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 26 - 31
  • [6] VISUALLY INFORMED MULTI-PITCH ANALYSIS OF STRING ENSEMBLES
    Dinesh, Karthik
    Li, Bochen
    Liu, Xinzhao
    Duan, Zhiyao
    Sharma, Gaurav
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 3021 - 3025
  • [7] MULTI-PITCH STREAMING OF INTERWOVEN STREAMS
    Kuan, Chih-Yi
    Su, Li
    Chin, Yu-Hao
    Wang, Jia-Ching
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 311 - 315
  • [8] MPTRACKER: A NEW MULTI-PITCH DETECTION AND SEPARATION ALGORITHM FOR MIXED SPEECH SIGNALS
    Radfar, M. H.
    Dansereau, R. M.
    Chan, W. -Y.
    Wong, W.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4468 - 4471
  • [9] The multi-band, multi-pitch antenna
    Ooi, S
    Bit-Babik, G
    PROCEEDINGS OF VIITH INTERNATIONAL SEMINAR/WORKSHOP ON DIRECT AND INVERSE PROBLEMS OF ELECTROMAGNETIC AND ACOUSTIC WAVE THEORY, 2002, : 65 - 69
  • [10] MULTI-PITCH ESTIMATION OF INHARMONIC SIGNALS
    Nilsson, Tommy
    Adalbjornsson, Stefan I.
    Butt, Naveed R.
    Jakobsson, Andreas
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,