2-D Processing of Speech for Multi-Pitch Analysis

被引:0
|
作者
Wang, Tianyu T. [1 ]
Quatieri, Thomas F. [1 ]
机构
[1] MIT Lincoln Lab, Lincoln, NE USA
关键词
2-D speech processing; Grating Compression Transform; multi-pitch analysis; segmental pitch dynamics;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a two-dimensional (2-D) processing approach for the analysis of multi-pitch speech sounds. Our framework invokes the short-space 2-D Fourier transform magnitude of a narrowband spectrogram, mapping harmonically-related signal components to multiple concentrated entities in a new 2-D space. First, localized time-frequency regions of the spectrogram are analyzed to extract pitch candidates. These candidates are then combined across multiple regions for obtaining separate pitch estimates of each speech-signal component at a single point in time. We refer to this as multi-region analysis (MRA). By explicitly accounting for pitch dynamics within localized time segments, this separability is distinct from that which can be obtained using short-time autocorrelation methods typically employed in state-of-the-art multi-pitch tracking algorithms. We illustrate the feasibility of MRA for multi-pitch estimation on mixtures of synthetic and real speech.
引用
收藏
页码:2795 / 2798
页数:4
相关论文
共 50 条
  • [41] Pitch processing in music and speech
    Tillmann, Barbara, 1600, Australian Acoustical Society, Singapore (42):
  • [42] Multi-pitch estimation based on multi-scale product analysis, improved comb filter and dynamic programming
    Zeremdini J.
    Messaoud M.A.B.
    Bouzid A.
    International Journal of Speech Technology, 2017, 20 (02) : 225 - 237
  • [43] SONG-LEVEL MULTI-PITCH TRACKING BY HEAVILY CONSTRAINED CLUSTERING
    Duan, Zhiyao
    Han, Jinyu
    Pardo, Bryan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 57 - 60
  • [44] MULTI-PITCH ESTIMATION AND TRACKING USING BAYESIAN INFERENCE IN BLOCK SPARSITY
    Karimian-Azari, Sam
    Jakobsson, Andreas
    Jensen, Jesper R.
    Christensen, Mads G.
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 16 - 20
  • [45] JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC
    Liu, Lele
    Morfi, Veronica
    Benetos, Emmanouil
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 281 - 285
  • [46] Multi-Pitch Liquid Crystal Filters with Single Layer Polymer Template
    Zhu, Zhikang
    Gao, Yao
    Lu, Jiangang
    POLYMERS, 2021, 13 (15)
  • [47] PERMUTATION INVARIANT TRAINING FOR SPEAKER-INDEPENDENT MULTI-PITCH TRACKING
    Liu, Yuzhou
    Wang, DeLiang
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5594 - 5598
  • [48] Multi-Pitch Estimation using NHF with Multi-Dictionary Distinguishing Attack and Reverberation of Sounds
    Fujisawa, Takanori
    Harada, Sora
    Ikehara, Masaaki
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1836 - 1841
  • [49] Neural Coincidence Detection Strategies during Perception of Multi-Pitch Musical Tones
    Bader, Rolf
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [50] Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings
    Weiss, Christof
    Peeters, Geoffroy
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2814 - 2827