IMAGE AND AUDIO-SPEECH DENOISING BASED ON HIGHER-ORDER STATISTICAL MODELING OF WAVELET COEFFICIENTS AND LOCAL VARIANCE ESTIMATION

被引:14
|
作者
Kittisuwan, Pichid [1 ]
Chanwimaluan, Thitiporn
Marukatat, Sanparith
Asdornwised, Widhyakorn [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Elect Engn, Bangkok 10330, Thailand
关键词
Pearson Type VII random vectors; image denoising; wavelet transforms; BIVARIATE SHRINKAGE; RANDOM VECTORS; TRANSFORM;
D O I
10.1142/S0219691310003808
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
At first, this paper is concerned with wavelet-based image denoising using Bayesian technique. In conventional denoising process, the parameters of probability density function (PDF) are usually calculated from the first few moments, mean and variance. In the first part of our work, a new image denoising algorithm based on Pearson Type VII random vectors is proposed. This PDF is used because it allows higher-order moments to be incorporated into the noiseless wavelet coefficients' probabilistic model. One of the cruxes of the Bayesian image denoising algorithms is to estimate the variance of the clean image. Here, maximum a posterior (MAP) approach is employed for not only noiseless wavelet-coefficient estimation but also local observed variance acquisition. For the local observed variance estimation, the selection of noisy wavelet-coefficient model, either a Laplacian or a Gaussian distribution, is based upon the corrupted noise power where Gamma distribution is used as a prior for the variance. Evidently, our selection of prior is motivated by analytical and computational tractability. In our experiments, our proposed method gives promising denoising results with moderate complexity. Eventually, our image denoising method can be simply extended to audio/speech processing by forming matrix representation whose rows are formed by time segments of digital speech waveforms. This way, the use of our image denoising methods can be exploited to improve the performance of various audio/speech tasks, e.g., denoised enhancement of voice activity detection to capture voiced speech, significantly needed for speech coding and voice conversion applications. Moreover, one of the voice abnormality detections, called oropharyngeal dysphagia classification, is also required denoising method to improve the signal quality in elderly patients. We provide simple speech examples to demonstrate the prospects of our techniques.
引用
收藏
页码:987 / 1017
页数:31
相关论文
共 34 条
  • [31] A novel method for early diagnosis of Alzheimer’s disease based on higher-order spectral estimation of spontaneous speech signals
    Mahda Nasrolahzadeh
    Zeynab Mohammadpoory
    Javad Haddadnia
    Cognitive Neurodynamics, 2016, 10 : 495 - 503
  • [32] A novel method for early diagnosis of Alzheimer's disease based on higher-order spectral estimation of spontaneous speech signals
    Nasrolahzadeh, Mahda
    Mohammadpoory, Zeynab
    Haddadnia, Javad
    COGNITIVE NEURODYNAMICS, 2016, 10 (06) : 495 - 503
  • [33] Image-based gradient non-linearity characterization to determine higher-order spherical harmonic coefficients for improved spatial position accuracy in magnetic resonance imaging
    Weavers, Paul T.
    Tao, Shengzhen
    Trzasko, Joshua D.
    Shu, Yunhong
    Tryggestad, Erik J.
    Gunter, Jeffrey L.
    McGee, Kiaran P.
    Litwiller, Daniel V.
    Hwang, Ken-Pin
    Bernstein, Matt A.
    MAGNETIC RESONANCE IMAGING, 2017, 38 : 54 - 62
  • [34] Pose-Invariant 3D Proximal Femur Estimation through Bi-planar Image Segmentation with Hierarchical Higher-Order Graph-Based Priors
    Wang, Chaohui
    Boussaid, Haithem
    Simon, Loic
    Lazennec, Jean-Yves
    Paragios, Nikos
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI 2011, PT III, 2011, 6893 : 346 - +