Ensemble Estimators for Multivariate Entropy Estimation

被引:57
作者
Sricharan, Kumar [1 ]
Wei, Dennis [1 ]
Hero, Alfred O., III [1 ]
机构
[1] Univ Michigan, Dept Elect & Comp Engn, Ann Arbor, MI 48109 USA
关键词
Efficient estimation; ensemble estimators; entropy estimation; kernel density estimation; parametric convergence rate; plug-in estimation; INTEGRAL FUNCTIONALS;
D O I
10.1109/TIT.2013.2251456
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of estimation of density functionals like entropy and mutual information has received much attention in the statistics and information theory communities. A large class of estimators of functionals of the probability density suffer from the curse of dimensionality, wherein the mean squared error decays increasingly slowly as a function of the sample size T as the dimension d of the samples increases. In particular, the rate is often glacially slow of order O(T-gamma/d), where gamma > 0 is a rate parameter. Examples of such estimators include kernel density estimators, k-nearest neighbor (k-NN) density estimators, k-NN entropy estimators, intrinsic dimension estimators, and other examples. In this paper, we propose a weighted affine combination of an ensemble of such estimators, where optimal weights can be chosen such that the weighted estimator converges at a much faster dimension invariant rate of O(T-1). Furthermore, we show that these optimal weights can be determined by solving a convex optimization problem which can be performed offline and does not require training data. We illustrate the superior performance of our weighted estimator for two important applications: 1) estimating the Panter-Dite distortion-rate factor; and 2) estimating the Shannon entropy for testing the probability distribution of a random sample.
引用
收藏
页码:4374 / 4388
页数:15
相关论文
共 19 条
[1]  
[Anonymous], THESIS U MICHIGAN AN
[2]  
[Anonymous], 1989, Limit Theorems in Probability and Statistics, Eds
[3]  
Beirlant J, 1997, International Journal of Mathematical and Statistical Sciences, V6, P17
[4]   ESTIMATION OF INTEGRAL FUNCTIONALS OF A DENSITY [J].
BIRGE, L ;
MASSART, P .
ANNALS OF STATISTICS, 1995, 23 (01) :11-29
[5]  
Corona F., 2009, IEEE MLSP INT WORKSH
[6]  
Costa J., 2003, 334 U MICH COMM SIGN
[7]   Geodesic entropic graphs for dimension and entropy estimation in manifold learning [J].
Costa, JA ;
Hero, AO .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2210-2221
[8]   OPTIMIZATION OF K NEAREST-NEIGHBOR DENSITY ESTIMATES [J].
FUKUNAGA, K ;
HOSTETLER, LD .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1973, 19 (03) :320-326
[9]   Uniform in Bandwidth Estimation of Integral Functionals of the Density Function [J].
Gine, Evarist ;
Mason, David M. .
SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (04) :739-761
[10]   DENSITY-FREE CONVERGENCE PROPERTIES OF VARIOUS ESTIMATORS OF ENTROPY [J].
GYORFI, L ;
VANDERMEULEN, EC .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1987, 5 (04) :425-436