Long-term flexible 2D cepstral modeling of speech spectral amplitudes

被引:1
|
作者
Firouzmand, Mohammad [1 ]
Girin, Laurent [1 ]
机构
[1] INPG, Grenoble Lab Images Speech Signal & Automat, Grenoble, France
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech analysis; speech processing; speech coding; speech modeling; speech synthesis;
D O I
10.1109/ICASSP.2008.4518515
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a method for modeling the envelope of spectral amplitude parameters of speech signals in "two dimensions" (2D). It consists of two cascaded modelings: the first one along the frequency axis is the usual cepstrum technique, which consists of modeling the log-scaled spectral envelope with a Discrete Cosine Model (DCM). The second one, along the time axis, consists of modeling the trajectory of the envelope DCM coefficients by another similar DCM model. An iterative algorithm is proposed to optimally fit this 2D-model to the data according to a perceptual criterion based on frequency masking. This approach is shown to provide an efficient and flexible representation of spectral amplitude parameters in terms of coefficient rates, while providing good signal quality, opening new perspectives in very-low bit-rate sinusoidal speech coding.
引用
收藏
页码:3937 / 3940
页数:4
相关论文
共 50 条
  • [31] Short- and Long-Term Relationship Orientation and 2D:4D Finger-Length Ratio
    Schwarz, Sascha
    Mustafic, Maida
    Hassebrauck, Manfred
    Joerg, Johannes
    ARCHIVES OF SEXUAL BEHAVIOR, 2011, 40 (03) : 565 - 574
  • [32] A temporal warped 2D psychoacoustic modeling for robust speech recognition system
    Dai, Peng
    Soon, Ing Yann
    SPEECH COMMUNICATION, 2011, 53 (02) : 229 - 241
  • [33] Short- and Long-Term Relationship Orientation and 2D:4D Finger-Length Ratio
    Sascha Schwarz
    Maida Mustafić
    Manfred Hassebrauck
    Johannes Jörg
    Archives of Sexual Behavior, 2011, 40 : 565 - 574
  • [34] Self-consistent long-term dynamics of space charge driven resonances in 2D and 3D
    Hofmann, Ingo
    Oeftiger, Adrian
    Boine-Frankenheim, Oliver
    PHYSICAL REVIEW ACCELERATORS AND BEAMS, 2021, 24 (02)
  • [35] Long-term stable and catalytic 2D MXene nanosheets wrapped with dialdehyde xylan for ultrafast gelation
    Li, Nan
    Shao, Lupeng
    Xia, Qiang
    Tan, Shujun
    Zhao, Shuwen
    Li, XuPeng
    Su, Zhenhua
    Hao, Xiang
    Peng, Feng
    GREEN CHEMISTRY, 2023, 25 (11) : 4309 - 4318
  • [36] Efficient 2D LIDAR-Based Map Updating For Long-Term Operations in Dynamic Environments
    Stefanini, Elisa
    Ciancolini, Enrico
    Settimi, Alessandro
    Pallottino, Lucia
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 832 - 839
  • [37] 2D Variations in Coda Amplitudes in the Middle East
    Pasyanos, Michael E.
    Gok, Rengin
    Walter, William R.
    BULLETIN OF THE SEISMOLOGICAL SOCIETY OF AMERICA, 2016, 106 (05) : 1915 - 1925
  • [38] SHORT AND LONG-TERM PROGNOSTIC VALUE OF PREOPERATIVE 2D ECHOCARDIOGRAPHY PRIOR TO VASCULAR-SURGERY
    WHOLEY, RM
    AURIGEMMA, GP
    DAHLBERG, ST
    PENNIMAN, CM
    LEPPO, JA
    CIRCULATION, 1992, 86 (04) : 577 - 577
  • [39] Enhanced performance and long-term stability of 2D photodetectors through hexagonal boron nitride encapsulation
    Zhao, Huijuan
    Zhou, Qiyuan
    Wang, Yufan
    Wang, Jiaxuan
    Ding, Huanlin
    Li, Shuhan
    Guo, Xiaohan
    Wang, Weiqi
    Gao, Li
    APPLIED PHYSICS LETTERS, 2025, 126 (02)
  • [40] Speech Pathologists in Long-Term Care FOREWORD
    Holland, Audrey L.
    SEMINARS IN SPEECH AND LANGUAGE, 2013, 34 (01) : 1 - 1