Long-term flexible 2D cepstral modeling of speech spectral amplitudes

被引:1
|
作者
Firouzmand, Mohammad [1 ]
Girin, Laurent [1 ]
机构
[1] INPG, Grenoble Lab Images Speech Signal & Automat, Grenoble, France
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
speech analysis; speech processing; speech coding; speech modeling; speech synthesis;
D O I
10.1109/ICASSP.2008.4518515
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a method for modeling the envelope of spectral amplitude parameters of speech signals in "two dimensions" (2D). It consists of two cascaded modelings: the first one along the frequency axis is the usual cepstrum technique, which consists of modeling the log-scaled spectral envelope with a Discrete Cosine Model (DCM). The second one, along the time axis, consists of modeling the trajectory of the envelope DCM coefficients by another similar DCM model. An iterative algorithm is proposed to optimally fit this 2D-model to the data according to a perceptual criterion based on frequency masking. This approach is shown to provide an efficient and flexible representation of spectral amplitude parameters in terms of coefficient rates, while providing good signal quality, opening new perspectives in very-low bit-rate sinusoidal speech coding.
引用
收藏
页码:3937 / 3940
页数:4
相关论文
共 50 条
  • [21] Flexible modeling of the hazard rate and treatment effects in long-term survival studies
    Hagar, Yolanda
    Dignam, James J.
    Dukic, Vanja
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (05) : 2455 - 2480
  • [22] 3D MODEL RETRIEVAL USING 2D CEPSTRAL FEATURES
    Lee, Chang-Hsing
    Shih, Jau-Ling
    Chou, Chih-Hsun
    Yu, Kung-Ming
    Hung, Chuan-Yen
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2365 - 2368
  • [23] FLEXIBLE SIMULATION AND MODELING FOR 2D TOPOLOGY NOC SYSTEM DESIGN
    Gharan, Masoud Oveis
    Khan, Gul N.
    2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 180 - 185
  • [24] Systematic long-term comparison of spectral UV measurements and UVSPEC modeling results
    Mayer, B
    Seckmeyer, G
    Kylling, A
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 1997, 102 (D7) : 8755 - 8767
  • [25] Modeling of spectral dependences for 2D photonic crystal waveguide systems
    A. N. Bogolubov
    G. V. Belokopytov
    Z. O. Dombrovskaya
    Moscow University Physics Bulletin, 2013, 68 : 344 - 350
  • [26] Modeling of spectral dependences for 2D photonic crystal waveguide systems
    Bogolubov, A. N.
    Belokopytov, G. V.
    Dombrovskaya, Z. O.
    MOSCOW UNIVERSITY PHYSICS BULLETIN, 2013, 68 (05) : 344 - 350
  • [27] ADSTOCK MODELING FOR THE LONG-TERM
    BROADBENT, S
    FRY, T
    JOURNAL OF THE MARKET RESEARCH SOCIETY, 1995, 37 (04): : 385 - 403
  • [28] THE ROLE OF SHORT-TERM AND LONG-TERM AUDITORY STORAGE IN PROCESSING SPECTRAL RELATIONS FOR ADULT AND CHILD SPEECH
    OHDE, RN
    PERRY, AH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (03): : 1303 - 1313
  • [29] Recursive On-Line (2D)2PCA and Its Application to Long-Term Background Subtraction
    Seo, Ja-Won
    Kim, Seong Dae
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (08) : 2333 - 2344
  • [30] Long-term variations of the pulsation amplitudes of 16(EN) lacertae
    Jerzykiewicz, M
    Pigulski, A
    PROCEEDINGS OF THE 27TH MEETING OF THE POLISH ASTRONOMICAL SOCIETY, 1996, : 76 - 77