Nonparametric k-nearest-neighbor entropy estimator

被引:48
|
作者
Lombardi, Damiano [1 ]
Pant, Sanjay
机构
[1] Inria Paris Rocquencourt, Boite Postale 105, F-78153 Le Chesnay, France
关键词
D O I
10.1103/PhysRevE.93.013310
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
A nonparametric k-nearest-neighbor-based entropy estimator is proposed. It improves on the classical Kozachenko-Leonenko estimator by considering nonuniform probability densities in the region of k-nearest neighbors around each sample point. It aims to improve the classical estimators in three situations: first, when the dimensionality of the random variable is large; second, when near-functional relationships leading to high correlation between components of the random variable are present; and third, when the marginal variances of random variable components vary significantly with respect to each other. Heuristics on the error of the proposed and classical estimators are presented. Finally, the proposed estimator is tested for a variety of distributions in successively increasing dimensions and in the presence of a near-functional relationship. Its performance is compared with a classical estimator, and a significant improvement is demonstrated.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Distributed processing of moving K-nearest-neighbor query on moving objects
    Wu, Wei
    Guo, Wenyuan
    Tan, Kian-Lee
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1091 - +
  • [42] An Improved Algorithm for k-Nearest-Neighbor Finding and Surface Normals Estimation
    赵灿
    孟祥林
    TsinghuaScienceandTechnology, 2009, 14(S1) (S1) : 77 - 81
  • [43] Optimal construction of k-nearest-neighbor graphs for identifying noisy clusters
    Maier, Markus
    Hein, Matthias
    von Luxburg, Ulrike
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (19) : 1749 - 1764
  • [44] Divergence Estimation for Multidimensional Densities Via k-Nearest-Neighbor Distances
    Wang, Qing
    Kulkarni, Sanjeev R.
    Verdu, Sergio
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (05) : 2392 - 2405
  • [45] K-Nearest-Neighbor Local Sampling Based Conditional Independence Testing
    Li, Shuai
    Zhang, Yingjie
    Zhu, Hongtu
    Wang, Christina Dan
    Shu, Hai
    Chen, Ziqi
    Sun, Zhuoran
    Yang, Yanfeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] GRkNN: Group reverse k-nearest-neighbor query in spatial databases
    Song X.-Y.
    Yu C.-C.
    Sun H.-L.
    Xu J.-K.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (12): : 2229 - 2238
  • [47] FINCH: Evaluating Reverse k-Nearest-Neighbor Queries on Location Data
    Wu, Wei
    Yang, Fei
    Chan, Chee-Yong
    Tan, Kian-Lee
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 1056 - 1067
  • [48] ARKGraph: All-Range Approximate K-Nearest-Neighbor Graph
    Zuo, Chaoji
    Deng, Dong
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (10): : 2645 - 2658
  • [49] Efficient Cluster-Based k-Nearest-Neighbor Machine Translation
    Wang, Dexin
    Fan, Kai
    Chen, Boxing
    Xiong, Deyi
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2175 - 2187
  • [50] K-NEAREST-NEIGHBOR DECISION RULE PERFORMANCE IN A SPEECH RECOGNITION SYSTEM
    WHITE, GM
    FONG, PJ
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1975, SMC5 (03): : 389 - 389