Complexity-Regularized Tree-Structured Partition for Mutual Information Estimation

被引:7
|
作者
Silva, Jorge F. [1 ]
Narayanan, Shrikanth [2 ]
机构
[1] Univ Chile, Dept Elect Engn, Santiago 1058, Chile
[2] Univ So Calif, Viterbi Sch Engn, Dept Elect Engn, Los Angeles, CA 90089 USA
基金
美国国家科学基金会;
关键词
Complexity regularization; data-dependent partitions; histogram-based estimates; minimum cost tree pruning; mutual information (MI); strong consistency; tree-structured partitions (TSPs); Vapnik and Chervonenkis inequality; DIVERGENCE ESTIMATION; CONSISTENT; CLASSIFICATION; OPTIMIZATION; CONVERGENCE; PROBABILITY;
D O I
10.1109/TIT.2011.2177771
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new histogram-based mutual information estimator using data-driven tree-structured partitions (TSP) is presented in this paper. The derived TSP is a solution to a complexity regularized empirical information maximization, with the objective of finding a good tradeoff between the known estimation and approximation errors. A distribution-free concentration inequality for this tree-structured learning problem as well as finite sample performance bounds for the proposed histogram-based solution is derived. It is shown that this solution is density-free strongly consistent and that it provides, with an arbitrary high probability, an optimal balance between the mentioned estimation and approximation errors. Finally, for the emblematic scenario of independence, I(X;Y), it is shown that the TSP estimate converges to zero with O(e(-n1/3+log log n)).
引用
收藏
页码:1940 / 1952
页数:13
相关论文
共 44 条