IMAGE AND AUDIO-SPEECH DENOISING BASED ON HIGHER-ORDER STATISTICAL MODELING OF WAVELET COEFFICIENTS AND LOCAL VARIANCE ESTIMATION

被引：14

作者：

Kittisuwan, Pichid ^{[1
]}

Chanwimaluan, Thitiporn

Marukatat, Sanparith

Asdornwised, Widhyakorn ^{[1
]}

机构：

[1] Chulalongkorn Univ, Fac Engn, Dept Elect Engn, Bangkok 10330, Thailand

来源：

INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING | 2010年 / 8卷 / 06期

关键词：

Pearson Type VII random vectors; image denoising; wavelet transforms; BIVARIATE SHRINKAGE; RANDOM VECTORS; TRANSFORM;

D O I：

10.1142/S0219691310003808

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

At first, this paper is concerned with wavelet-based image denoising using Bayesian technique. In conventional denoising process, the parameters of probability density function (PDF) are usually calculated from the first few moments, mean and variance. In the first part of our work, a new image denoising algorithm based on Pearson Type VII random vectors is proposed. This PDF is used because it allows higher-order moments to be incorporated into the noiseless wavelet coefficients' probabilistic model. One of the cruxes of the Bayesian image denoising algorithms is to estimate the variance of the clean image. Here, maximum a posterior (MAP) approach is employed for not only noiseless wavelet-coefficient estimation but also local observed variance acquisition. For the local observed variance estimation, the selection of noisy wavelet-coefficient model, either a Laplacian or a Gaussian distribution, is based upon the corrupted noise power where Gamma distribution is used as a prior for the variance. Evidently, our selection of prior is motivated by analytical and computational tractability. In our experiments, our proposed method gives promising denoising results with moderate complexity. Eventually, our image denoising method can be simply extended to audio/speech processing by forming matrix representation whose rows are formed by time segments of digital speech waveforms. This way, the use of our image denoising methods can be exploited to improve the performance of various audio/speech tasks, e.g., denoised enhancement of voice activity detection to capture voiced speech, significantly needed for speech coding and voice conversion applications. Moreover, one of the voice abnormality detections, called oropharyngeal dysphagia classification, is also required denoising method to improve the signal quality in elderly patients. We provide simple speech examples to demonstrate the prospects of our techniques.

引用

页码：987 / 1017

页数：31

共 34 条

[31] A novel method for early diagnosis of Alzheimer’s disease based on higher-order spectral estimation of spontaneous speech signals
Mahda Nasrolahzadeh
Zeynab Mohammadpoory
Javad Haddadnia
Cognitive Neurodynamics, 2016, 10 : 495 - 503
[32] A novel method for early diagnosis of Alzheimer's disease based on higher-order spectral estimation of spontaneous speech signals
Nasrolahzadeh, Mahda
Mohammadpoory, Zeynab
Haddadnia, Javad
COGNITIVE NEURODYNAMICS, 2016, 10 (06) : 495 - 503
[33] Image-based gradient non-linearity characterization to determine higher-order spherical harmonic coefficients for improved spatial position accuracy in magnetic resonance imaging
Weavers, Paul T.
Tao, Shengzhen
Trzasko, Joshua D.
Shu, Yunhong
Tryggestad, Erik J.
Gunter, Jeffrey L.
McGee, Kiaran P.
Litwiller, Daniel V.
Hwang, Ken-Pin
Bernstein, Matt A.
MAGNETIC RESONANCE IMAGING, 2017, 38 : 54 - 62
[34] Pose-Invariant 3D Proximal Femur Estimation through Bi-planar Image Segmentation with Hierarchical Higher-Order Graph-Based Priors
Wang, Chaohui
Boussaid, Haithem
Simon, Loic
Lazennec, Jean-Yves
Paragios, Nikos
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI 2011, PT III, 2011, 6893 : 346 - +

← 1 2 3 4 →