Finite Mixture Spectrogram Modeling for Multipitch Tracking Using A Factorial Hidden Markov Model

被引:0
|
作者
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
Factorial hidden Markov model; pitch estimation; multipitch tracking; minimum description length; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features using Gaussian Mixture Models (GMMs) in combination with the Minimum Description Length (MDL) model selection criterion. This enables to automatically determine the number of Gaussian components depending on the available data for a specific pitch pair. A factorial hidden Markov model (FHMM) is applied for tracking. We compare our approach to two methods based on correlogram features [1]. Those methods either uses HMM [1] or a FHMM [7] for tracking. Experimental results on the Mocha-TIMIT database [2] show that our proposed approach significantly outperforms the correlogram-based methods for speech utterances mixed at 0dB. The superior performance even holds when adding white Gaussian noise to the mixed speech utterances during pitch tracking.
引用
收藏
页码:1103 / 1106
页数:4
相关论文
共 50 条
  • [31] A Coupled Factorial Hidden Markov Model (CFHMM) for Diagnosing Coupled Faults
    Kodali, Anuradha
    Pattipati, Krishna
    Singh, Satnam
    2010 IEEE AEROSPACE CONFERENCE PROCEEDINGS, 2010,
  • [32] Indoor Localization and Tracking using Posterior State Distribution of Hidden Markov Model
    El-Khoribi, Reda A.
    Hamza, Haitham S.
    Hammad, M. A.
    2013 8TH INTERNATIONAL ICST CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2013, : 557 - 562
  • [33] A Factorial Hidden Markov Model for the Analysis of Temporal Change in Choice Models
    Amirali Kani
    Wayne S. DeSarbo
    Duncan K. H. Fong
    Customer Needs and Solutions, 2018, 5 (3-4) : 162 - 177
  • [34] DOMINANT COMPONENT TRACKING FOR EMPIRICAL MODE DECOMPOSITION USING A HIDDEN MARKOV MODEL
    Sandoval, Steven
    Bredin, Matthew
    De Leon, Phillip L.
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 116 - 120
  • [35] Modeling Dinophysis in Western Andalucía using an autoregressive hidden Markov model
    Jordan Aron
    Paul S. Albert
    Matthew O. Gribble
    Environmental and Ecological Statistics, 2022, 29 : 557 - 585
  • [36] Modeling Dinophysis in Western Andalucia using a autoregressive hidden Markov model
    Aron, Jordan
    Albert, Paul S.
    Gribble, Matthew O.
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2022, 29 (03) : 557 - 585
  • [37] Phoneme Modeling for Speech Recognition in Kannada Using Hidden Markov Model
    Kannadaguli, Prashanth
    Thalengala, Ananthakrishna
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [38] Music analysis using hidden Markov mixture models
    Qi, Yuting
    Paisley, John William.
    Carin, Lawrence
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (11) : 5209 - 5224
  • [39] Modeling complex motion by tracking and editing hidden Markov graphs
    Wang, YZ
    Zhu, SC
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, 2004, : 856 - 863
  • [40] Modeling electricity markets with hidden Markov model
    Yu, W
    Sheblé, GB
    ELECTRIC POWER SYSTEMS RESEARCH, 2006, 76 (6-7) : 445 - 451