RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK

被引:1
|
作者
Zhou, Qiaoling [1 ]
机构
[1] Fujian Agr & Forestry Univ, Int Coll, 15 Shangxiadian Rd, Fuzhou 350002, Peoples R China
关键词
Improved spectrum subtraction; Deep neural network; Speech enhancement; Amplitude spectrum; English communication; NOISE;
D O I
10.24507/ijicic.16.05.1711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the introduced unstructured voiceless problems of conventional spectrum subtraction in English speech signals enhancement, this paper proposes a novel English speech signals enhancement algorithm. This algorithm uses an improved minimal controlled recursive averaging (IMCRA) method to estimate noise spectrum, and tracks the estimated noise spectrum in real time. Then, the deep neural network (DNN) is used to construct the nonlinear mapping function of log amplitude spectrum between speech with noises and ideal pure speech for English speech enhancement. To validate the feasibility and effectiveness of the proposed algorithm, the standard IEEE speech signals and Noise-91 noise signals are used for experiments. Experimental results have shown that the proposed IMCRA method has stronger ability to avoid noises in speech signals, and the DNN method can well recover the speech components and spectrum structure polluted by noises. To enhance English speech in daily international speech communication, the proposed combination method has strong robustness to various real noise environments, and brings significant improvement to interpersonal communication and human computer communication.
引用
收藏
页码:1711 / 1723
页数:13
相关论文
共 50 条
  • [31] Deep neural network and noise classification-based speech enhancement
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Han, Wei
    MODERN PHYSICS LETTERS B, 2017, 31 (19-21):
  • [32] A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement
    Ouyang, Zhiheng
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3224 - 3228
  • [33] Fractional feature-based speech enhancement with deep neural network
    Xu, Liyun
    Zhang, Tong
    SPEECH COMMUNICATION, 2023, 153
  • [34] Subjective intelligibility of deep neural network-based speech enhancement
    Gelderblom, Femke B.
    Tronstad, Tron V.
    Viggen, Erlend Magnus
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
  • [35] A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Sun, Meng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (04): : 835 - 838
  • [36] Research on the neural network based on an improved PSO algorithm
    Liu, Jiang
    GREEN BUILDING, ENVIRONMENT, ENERGY AND CIVIL ENGINEERING, 2017, : 49 - 53
  • [37] An improved spectral subtraction method for speech enhancement using a perceptual weighting filter
    Udrea, Radu Mihnea
    Vizireanu, Nicolae D.
    Ciochina, Silviu
    DIGITAL SIGNAL PROCESSING, 2008, 18 (04) : 581 - 587
  • [38] LOCAL TRAJECTORY BASED SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION WITH DEEP NEURAL NETWORK
    You, Yongbin
    Qian, Yanmin
    Yu, Kai
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 5 - 9
  • [39] Research on Dungan speech synthesis based on Deep Neural Network
    Chen, Lijia
    Yang, Hongwu
    Wang, Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 46 - 50
  • [40] An Auditory Perception based Improved Multi-Band Spectral Subtraction Algorithm for Enhancement of Speech Degraded by Non-Stationary Noises
    Upadhyay, Navneet
    Karmakar, Abhijit
    4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012), 2012,