Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech

被引:0
|
作者
Tang Weikang [1 ]
Shao Yubin [1 ]
Long Hua [1 ]
Du Qingzhi [1 ]
Peng Yi [1 ]
Chen Liang [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming 650500, Yunnan, Peoples R China
关键词
signal processing; syllable matching; extreme value point; logarithmic frequency range;
D O I
10.3788/LOP202259.0707001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Based on the spectral peak point characteristics of Chinese speech, this study proposes a syllable matching algorithm to improve the matching effect of Chinese speech syllables in noisy environments. First, a discrete cosine transform is used to extract the speech signal envelope spectrogram, and the human ear masking effect is used for spectral energy judgment to obtain the extreme value points of spectral energy in each frame. Then, the syllable signal is corresponded to a binary sequence by performing binary quantization in the logarithmic frequency range. Finally, the syllable matching result is determined based on the template comparison of the binary sequence. The results show that the proposed algorithm outperforms the conventional methods for matching syllables in the noiseless Chinese speech. Additionally, it has a high matching accuracy at low signal-to-noise ratios.
引用
收藏
页数:9
相关论文
共 16 条
  • [1] Ai J Q, 2020, APPL RES COMPUTERS, V37, P147
  • [2] Pitch estimation of speech and music sound based on multi-scale product with auditory feature extraction
    Ben Messaoud M.A.
    Bouzid A.
    [J]. International Journal of Speech Technology, 2016, 19 (1) : 65 - 73
  • [3] A novel and efficient 8-point DCT approximation for image compression
    Brahimi, Nabila
    Bouden, Toufik
    Brahimi, Tahar
    Boubchir, Larbi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (11-12) : 7615 - 7631
  • [4] Comparison of Mel Frequency Ceptrum Coefficient and Perceptual Linear Predictive in Perceptual Measurement of Chinese Initials
    Chen, Sai
    Wang, Hongcui
    Jia, Jia
    An, Yeteng
    Dang, Jianwu
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 291 - +
  • [5] Wavelet Threshold Denoising Algorithm for Impulse Noise Removal
    Fang Bin
    Chen Jiayi
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (22)
  • [6] Gan T, 2015, FAST BROADCAST AUDIO
  • [7] Guo Xing-ji, 2006, J HENAN NORMAL U NAT, V34
  • [8] Improved Non-Local Mean Denoising Algorithm Based on Difference Hash Algorithm
    Hua Chunjian
    Ma Jinke
    Chen Ying
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (14)
  • [9] On the Order of Approximation in Limit Theorems for Negative-Binomial Sums of Strictly Stationary m-Dependent Random Variables
    Hung, Tran Loc
    Kien, Phan Tri
    [J]. ACTA MATHEMATICA VIETNAMICA, 2021, 46 (01) : 203 - 224
  • [10] Lee C Y., 2006, SPEECH SCI, V13, P177