Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model

被引:35
|
作者
Degottex, Gilles [1 ,2 ]
Stylianou, Yannis [1 ,2 ]
机构
[1] Univ Crete, Dept Comp Sci, GR-71003 Iraklion, Greece
[2] FORTH, Inst Comp Sci, GR-70013 Iraklion, Greece
基金
瑞士国家科学基金会;
关键词
Voice model; sinusoidal model; harmonic model; non-stationary;
D O I
10.1109/TASL.2013.2266772
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Voice models often use frequency limits to split the speech spectrum into two or more voiced/unvoiced frequency bands. However, from the voice production, the amplitude spectrum of the voiced source decreases smoothly without any abrupt frequency limit. Accordingly, multiband models struggle to estimate these limits and, as a consequence, artifacts can degrade the perceived quality. Using a linear frequency basis adapted to the non-stationarities of the speech signal, the Fan Chirp Transformation (FChT) have demonstrated harmonicity at frequencies higher than usually observed from the DFT which motivates a full-band modeling. The previously proposed Adaptive Quasi-Harmonic model (aQHM) offers even more flexibility than the FChT by using a non-linear frequency basis. In the current paper, exploiting the properties of aQHM, we describe a full-band Adaptive Harmonic Model (aHM) along with detailed descriptions of its corresponding algorithms for the estimation of harmonics up to the Nyquist frequency. Formal listening tests show that the speech reconstructed using aHM is nearly indistinguishable from the original speech. Experiments with synthetic signals also show that the proposed aHM globally outperforms previous sinusoidal and harmonic models in terms of precision in estimating the sinusoidal parameters. As a perspective, such a precision is interesting for building higher level models upon the sinusoidal parameters, like spectral envelopes for speech synthesis.
引用
收藏
页码:2085 / 2095
页数:11
相关论文
共 50 条
  • [41] PITCH MODIFICATIONS OF SPEECH BASED ON AN ADAPTIVE HARMONIC MODEL
    Kafentzis, George P.
    Degottex, Gilles
    Rosec, Olivier
    Stylianou, Yannis
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [42] A full-band Monte Carlo model for the temperature dependence of electron and hole transport in silicon
    Fischer, B
    Hofmann, KR
    APPLIED PHYSICS LETTERS, 2000, 76 (05) : 583 - 585
  • [43] Analysis of saturation velocity and energy relaxation time of electrons in Si using full-band Monte Carlo simulation
    Yoshihara, Koki
    Hiroki, Akira
    ELECTRICAL ENGINEERING IN JAPAN, 2020, 211 (1-4) : 40 - 46
  • [44] Design of Second Order Digital FIR Full-Band Differentiators Using Weighting Coefficients
    Lekic, Predrag N.
    Micic, Aca D.
    Ristic, Jovan D.
    Lekic, Julijana B.
    IETE JOURNAL OF RESEARCH, 2010, 56 (01) : 22 - 29
  • [45] Full-Band EEG Recordings Using Hybrid AC/DC-Divider Filters
    Nasretdinov, Azat
    Evstifeev, Alexander
    Vinokurova, Daria
    Burkhanova-Zakirova, Gulshat
    Chernova, Kseniya
    Churina, Zoya
    Khazipov, Roustem
    ENEURO, 2021, 8 (04)
  • [46] Hybrid particle-based full-band analysis of ultra-small MOS
    Wigger, SJ
    Goodnick, SM
    Saraniti, M
    VLSI DESIGN, 2001, 13 (1-4) : 125 - +
  • [47] A compressed sensing technique for OFDM channel estimation using full-band training symbols
    Lee, Huang-Chang
    Chen, Pin-Yuan
    Gong, Cihun-Siyong Alex
    2017 NINTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2017), 2017, : 562 - 564
  • [48] Geometric deep learning with adaptive full-band spatial diffusion for accurate, efficient, and robust cortical parcellation
    Zhu, Yuanzhuo
    Li, Xianjun
    Niu, Chen
    Wang, Fan
    Ma, Jianhua
    MEDICAL IMAGE ANALYSIS, 2025, 101
  • [49] DEEPFILTERNET2: TOWARDS REAL-TIME SPEECH ENHANCEMENT ON EMBEDDED DEVICES FOR FULL-BAND AUDIO
    Schroeter, H.
    Maier, A.
    Escalante-B, A. N.
    Rosenkranz, T.
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [50] Time-domain full-band method using orthogonal edge basis functions
    Gonçalves, MS
    Hernandez-Figueroa, HE
    Bordonalli, AC
    IEEE PHOTONICS TECHNOLOGY LETTERS, 2006, 18 (1-4) : 52 - 54