2-D Processing of Speech for Multi-Pitch Analysis

被引:0
|
作者
Wang, Tianyu T. [1 ]
Quatieri, Thomas F. [1 ]
机构
[1] MIT Lincoln Lab, Lincoln, NE USA
关键词
2-D speech processing; Grating Compression Transform; multi-pitch analysis; segmental pitch dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a two-dimensional (2-D) processing approach for the analysis of multi-pitch speech sounds. Our framework invokes the short-space 2-D Fourier transform magnitude of a narrowband spectrogram, mapping harmonically-related signal components to multiple concentrated entities in a new 2-D space. First, localized time-frequency regions of the spectrogram are analyzed to extract pitch candidates. These candidates are then combined across multiple regions for obtaining separate pitch estimates of each speech-signal component at a single point in time. We refer to this as multi-region analysis (MRA). By explicitly accounting for pitch dynamics within localized time segments, this separability is distinct from that which can be obtained using short-time autocorrelation methods typically employed in state-of-the-art multi-pitch tracking algorithms. We illustrate the feasibility of MRA for multi-pitch estimation on mixtures of synthetic and real speech.
引用
收藏
页码:2795 / 2798
页数:4
相关论文
共 50 条
  • [31] M2FPA: A Multi-Yaw Multi-Pitch High-Quality Dataset and Benchmark for Facial Pose Analysis
    Li, Peipei
    Wu, Xiang
    Hu, Yibo
    He, Ran
    Sun, Zhenan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10042 - 10050
  • [32] Joint DOA and multi-pitch estimation based on subspace techniques
    Zhang, Johan Xi
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    Moonen, Marc
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [33] An adaptive penalty multi-pitch estimator with self-regularization
    Elvander, F.
    Kronvall, T.
    Adalbjornsson, S. I.
    Jakobsson, A.
    SIGNAL PROCESSING, 2016, 127 : 56 - 70
  • [34] Multi-pitch estimation based on partial event and support transfer
    Duan, Zhiyao
    Zhang, Dan
    Zhang, Changshui
    Shi, Zhenwei
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 216 - 219
  • [35] Joint DOA and multi-pitch estimation based on subspace techniques
    Johan Xi Zhang
    Mads Græsbøll Christensen
    Søren Holdt Jensen
    Marc Moonen
    EURASIP Journal on Advances in Signal Processing, 2012
  • [36] An Automatic Synthesis of Musical Phrases from Multi-Pitch Samples
    Pluta, Marek
    Spalek, Leszek J.
    Delekta, Rafal J.
    ARCHIVES OF ACOUSTICS, 2017, 42 (02) : 235 - 247
  • [37] Pitch detection and formant analysis of Arabic speech processing
    Cherif, A
    Bouafif, L
    Dabbabi, T
    APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140
  • [38] A Music Cognition-Guided Framework for Multi-pitch Estimation
    Li, Xiaoquan
    Yan, Yijun
    Soraghan, John
    Wang, Zheng
    Ren, Jinchang
    COGNITIVE COMPUTATION, 2023, 15 (01) : 23 - 35
  • [39] Multiple comb filters and autocorrelation of the multi-scale product for multi-pitch estimation
    Zeremdini, Jihen
    Ben Messaoud, Mohamed Anouar
    Bouzid, Aicha
    APPLIED ACOUSTICS, 2017, 120 : 45 - 53
  • [40] PITCH PROCESSING IN MUSIC AND SPEECH
    Tillmann, Barbara
    ACOUSTICS AUSTRALIA, 2014, 42 (02) : 124 - 130