2-D Processing of Speech for Multi-Pitch Analysis

被引:0
|
作者
Wang, Tianyu T. [1 ]
Quatieri, Thomas F. [1 ]
机构
[1] MIT Lincoln Lab, Lincoln, NE USA
关键词
2-D speech processing; Grating Compression Transform; multi-pitch analysis; segmental pitch dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a two-dimensional (2-D) processing approach for the analysis of multi-pitch speech sounds. Our framework invokes the short-space 2-D Fourier transform magnitude of a narrowband spectrogram, mapping harmonically-related signal components to multiple concentrated entities in a new 2-D space. First, localized time-frequency regions of the spectrogram are analyzed to extract pitch candidates. These candidates are then combined across multiple regions for obtaining separate pitch estimates of each speech-signal component at a single point in time. We refer to this as multi-region analysis (MRA). By explicitly accounting for pitch dynamics within localized time segments, this separability is distinct from that which can be obtained using short-time autocorrelation methods typically employed in state-of-the-art multi-pitch tracking algorithms. We illustrate the feasibility of MRA for multi-pitch estimation on mixtures of synthetic and real speech.
引用
收藏
页码:2795 / 2798
页数:4
相关论文
共 50 条
  • [21] Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals
    Mansali, Mariem Bouafif
    Backstrom, Tom
    Lachiri, Zied
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 286 - 290
  • [22] RNN-BLSTM Based Multi-Pitch Estimation
    Zhang, Jianshu
    Tang, Jian
    Dai, Li-Rang
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1785 - 1789
  • [23] Co-channel speaker identification using usable speech extraction based on multi-pitch tracking
    Shao, Y
    Wang, DL
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 205 - 208
  • [24] The Harmonic Shift Algorithm for Efficient Multi-Pitch Detection
    Grinewitschus, Lukas
    Jung, Peter
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 548 - 561
  • [25] The multi-pitch estimation problem: Some new solutions
    Christensen, Mads Graeesboll
    Stoica, Petre
    Jakobsson, Andreas
    Jensen, Soren Holdt
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 1221 - +
  • [26] A Music Cognition–Guided Framework for Multi-pitch Estimation
    Xiaoquan Li
    Yijun Yan
    John Soraghan
    Zheng Wang
    Jinchang Ren
    Cognitive Computation, 2023, 15 : 23 - 35
  • [27] MULTI-PITCH ESTIMATION VIA FAST GROUP SPARSE LEARNING
    Kronvall, Ted
    Elvander, Filip
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1093 - 1097
  • [28] Using multi-scale product spectrum for single and multi-pitch estimation
    Messaoud, M. A. B.
    Bouzid, A.
    Ellouze, N.
    IET SIGNAL PROCESSING, 2011, 5 (03) : 344 - 355
  • [29] An iterative subspace-based multi-pitch estimation algorithm
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    SIGNAL PROCESSING, 2011, 91 (01) : 150 - 154
  • [30] JOINT DOA AND MULTI-PITCH ESTIMATION USING BLOCK SPARSITY
    Kronvall, Ted
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,