Finite Mixture Spectrogram Modeling for Multipitch Tracking Using A Factorial Hidden Markov Model

被引:0
|
作者
Wohlmayr, Michael [1 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, A-8010 Graz, Austria
关键词
Factorial hidden Markov model; pitch estimation; multipitch tracking; minimum description length; ALGORITHM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features using Gaussian Mixture Models (GMMs) in combination with the Minimum Description Length (MDL) model selection criterion. This enables to automatically determine the number of Gaussian components depending on the available data for a specific pitch pair. A factorial hidden Markov model (FHMM) is applied for tracking. We compare our approach to two methods based on correlogram features [1]. Those methods either uses HMM [1] or a FHMM [7] for tracking. Experimental results on the Mocha-TIMIT database [2] show that our proposed approach significantly outperforms the correlogram-based methods for speech utterances mixed at 0dB. The superior performance even holds when adding white Gaussian noise to the mixed speech utterances during pitch tracking.
引用
收藏
页码:1103 / 1106
页数:4
相关论文
共 50 条
  • [1] Multipitch Tracking Using A Factorial Hidden Markov Model
    Wohlmayr, Michael
    Pernkopf, Franz
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 147 - 150
  • [2] A MIXTURE MAXIMIZATION APPROACH TO MULTIPITCH TRACKING WITH FACTORIAL HIDDEN MARKOV MODELS
    Wohlmayr, M.
    Stark, M.
    Pernkopf, F.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5070 - 5073
  • [3] A Probabilistic Interaction Model for Multipitch Tracking With Factorial Hidden Markov Models
    Wohlmayr, Michael
    Stark, Michael
    Pernkopf, Franz
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 799 - 810
  • [4] BAYESIAN NONPARAMETRIC SPECTROGRAM MODELING BASED ON INFINITE FACTORIAL INFINITE HIDDEN MARKOV MODEL
    Nakano, Masahiro
    Le Roux, Jonathan
    Kameoka, Hirokazu
    Nakamura, Tomohiko
    Ono, Nobutaka
    Sagayama, Shigeki
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 325 - 328
  • [5] MODEL ADAPTATION OF FACTORIAL HMMS FOR MULTIPITCH TRACKING
    Wohlmayr, M.
    Pernkopf, F.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6792 - 6796
  • [6] Visual tracking using interactive factorial hidden Markov models
    Paeng, Jin Wook
    Kwon, Junseok
    IET SIGNAL PROCESSING, 2021, 15 (06) : 365 - 374
  • [7] Non-intrusive Load Monitoring Using Factorial Hidden Markov Model Based on Gaussian Mixture Model
    Zhang, Lu
    Jing, Zhaoxia
    2020 IEEE POWER & ENERGY SOCIETY GENERAL MEETING (PESGM), 2020,
  • [8] The peak value tracking of time-frequency spectrogram based on Hidden Markov Model
    Liang, YQ
    Qin, SR
    Guo, Y
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION SCIENCE AND TECHNOLOGY, VOL 3, 2002, : 576 - 580
  • [9] Finite Mixture of the Hidden Markov Model for Driving Style Analysis
    Ding, Lusa
    Zhu, Ting
    Wang, Yanli
    Zou, Yajie
    JOURNAL OF ADVANCED TRANSPORTATION, 2022, 2022
  • [10] Finite mixture model of hidden Markov regression with covariate dependence
    Sarkar, Shuchismita
    Zhu, Xuwen
    STAT, 2022, 11 (01):