PITCH MODIFICATIONS OF SPEECH BASED ON AN ADAPTIVE HARMONIC MODEL

被引:0
|
作者
Kafentzis, George P. [1 ,2 ]
Degottex, Gilles [2 ]
Rosec, Olivier [3 ]
Stylianou, Yannis [2 ]
机构
[1] TECH ACTS MAS, Orange Labs, Lannion, France
[2] Univ Crete, Comp Sci Dept, Multimedia Informat Lab, Rethimnon, Greece
[3] Voxygen SA, Pole Phoenix, Pleumeur, France
关键词
Pitch modification; Speech analysis; Adaptive quasi-harmonic model; Adaptive harmonic model; TRANSFORMATION; DECOMPOSITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a simple method for pitch-scale modifications of speech based on a recently suggested model for AM-FM decomposition of speech signals, is presented. This model is referred to as the adaptive Harmonic Model (aHM). The aHM models speech as a sum of harmonically related sinusoids that can adapt to the local characteristics of the signal. It was shown that this model provides high quality reconstruction of speech and thus, it can also provide high quality pitch-scale modifications. For the latter, the amplitude envelope is estimated using the Discrete All-Pole (DAP) method, and the phase envelope estimation is performed by utilizing the concept of relative phase. Formal listening tests on a database of several languages show that the synthetic pitch-scaled waveforms are natural and free of some common artefacts encountered in other state-of-the-art models, such as HNM and STRAIGHT.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Pitch detection algorithm of overlapping speech based on the energy of pitch and its harmonic
    Zhao Jun
    Pan Yong-xiang
    Proceedings of 2005 Chinese Control and Decision Conference, Vols 1 and 2, 2005, : 1439 - 1442
  • [2] TIME-SCALE MODIFICATIONS BASED ON A FULL-BAND ADAPTIVE HARMONIC MODEL
    Kafentzis, George P.
    Degottex, Gilles
    Rosec, Olivier
    Stylianou, Yannis
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8193 - 8197
  • [3] A Method for Pitch Estimation from Noisy Speech Signals Based on a Pitch-Harmonic Extraction
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 120 - 123
  • [4] Pitch Delay Based Adaptive Steganography for AMR Speech Stream
    Gong, Chen
    Yi, Xiaowei
    Zhao, Xianfeng
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2018, 2019, 11378 : 275 - 289
  • [5] ANALYSIS/SYNTHESIS OF SPEECH BASED ON AN ADAPTIVE QUASI-HARMONIC PLUS NOISE MODEL
    Pantazis, Yannis
    Tzedakis, Georgios
    Rosec, Olivier
    Stylianou, Yannis
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4246 - 4249
  • [6] Robust Bayesian Pitch Tracking Based on the Harmonic Model
    Shi, Liming
    Nielsen, Jesper Kjaer
    Jensen, Jesper Rindom
    Little, Max A.
    Christensen, Mads Graesboll
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1737 - 1751
  • [7] Speech Analysis and Synthesis with a Computationally Efficient Adaptive Harmonic Model
    Morfi, Veronica
    Degottex, Gilles
    Mouchtaris, Athanasios
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1950 - 1962
  • [8] Speech emotion recognition based on statistical pitch model
    WANG Zhiping ZHAO Li ZOU Cairong (Department of Radio Engineering
    Chinese Journal of Acoustics, 2006, (01) : 87 - 96
  • [9] Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm
    Every, Mark R.
    Jackson, Philip J. B.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 81 - 84
  • [10] PITCH TRACKING FOR MODEL-BASED SPEECH SEPARATION
    Lee, S. W.
    Soong, Frank K.
    Ching, P. C.
    Lee, Tan
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 145 - 148