Finite Mixture Spectrogram Modeling for Multipitch Tracking Using A Factorial Hidden Markov Model

被引:0
|
作者
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
来源
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年
关键词
Factorial hidden Markov model; pitch estimation; multipitch tracking; minimum description length; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features using Gaussian Mixture Models (GMMs) in combination with the Minimum Description Length (MDL) model selection criterion. This enables to automatically determine the number of Gaussian components depending on the available data for a specific pitch pair. A factorial hidden Markov model (FHMM) is applied for tracking. We compare our approach to two methods based on correlogram features [1]. Those methods either uses HMM [1] or a FHMM [7] for tracking. Experimental results on the Mocha-TIMIT database [2] show that our proposed approach significantly outperforms the correlogram-based methods for speech utterances mixed at 0dB. The superior performance even holds when adding white Gaussian noise to the mixed speech utterances during pitch tracking.
引用
收藏
页码:1103 / 1106
页数:4
相关论文
共 50 条
  • [41] Counting Single Molecules using Infinite Factorial Hidden Markov Models
    Bryan, Shep
    BIOPHYSICAL JOURNAL, 2020, 118 (03) : 614A - 614A
  • [42] Bearings Prognostic using Mixture of Gaussians Hidden Markov Model and Support Vector Machine
    Sloukia, F.
    El Aroussi, M.
    Medromi, H.
    Wahbi, M.
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [43] Missing motion data recovery using factorial hidden Markov models
    Lee, Dongheui
    Kulic, Dana
    Nakamura, Yoshihiko
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 1722 - 1728
  • [44] Spatio-temporal modeling based on Hidden Markov Model for Object Tracking in Satellite Imagery
    Essid, Houcine
    Ben Abbes, Ali
    Farah, Imed Riadh
    Barra, Vincent
    2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 351 - 358
  • [45] Variable parameter Gaussian mixture hidden Markov modeling for speech recognition
    Cui, XD
    Gong, YF
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 12 - 15
  • [46] An ICA Mixture Hidden Markov Model for Video Content Analysis
    Zhou, Jian
    Zhang, Xiao-Ping
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (11) : 1576 - 1586
  • [47] Shape tracking and production using Hidden Markov Models
    Caelli, T
    McCabe, N
    Briscoe, G
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (01) : 197 - 221
  • [48] A random coefficients mixture hidden Markov model for marketing research
    Kappe, Eelco
    Blank, Ashley Stadler
    DeSarbo, Wayne S.
    INTERNATIONAL JOURNAL OF RESEARCH IN MARKETING, 2018, 35 (03) : 415 - 431
  • [49] A Mixture Hidden Markov Model to Mine Students' University Curricula
    Bacci, Silvia
    Bertaccini, Bruno
    DATA, 2022, 7 (02)
  • [50] Using hidden Markov modeling in DNA sequencing
    Nelson, Ruben
    Foo, Simon
    Weatherspoon, Mark
    PROCEEDINGS OF THE 40TH SOUTHEASTERN SYMPOSIUM ON SYSTEM THEORY, 2008, : 215 - 217