Refined Lower Bounds for Nearest Neighbor Condensation

被引:0
|
作者
Chitnis, Rajesh [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England
来源
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167 | 2022年 / 167卷
关键词
nearest neighbor condensation; parameterized complexity; exponential time hypothesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most commonly used classification techniques is the nearest neighbor rule: given a training set T of labeled points in a metric space (X, rho), a new unlabeled point x is an element of chi is assigned the label of its nearest neighbor in T. To improve both the space & time complexity of this classification, it is desirable to reduce the size of the training set without compromising too much on the accuracy of the classification. Hart (1968) formalized this as the NEAREST NEIGHBOR CONDENSATION (NNC) problem: find a subset C subset of T of minimum size which is consistent with T, i.e., each point t is an element of T has the same label as that of its nearest neighbor in C. This problem is known to be NP-hard (Wilfong, 1991), and the heuristics used in practice often have weak or no theoretical guarantees. We analyze this problem via the refined lens of parameterized complexity, and obtain strong lower bounds for the k-NNC-(Z(d), l(p)) problem which asks if there is a consistent subset of size <= k for a given training set of size n in the metric space (Z(d), l(p)) for any 1 <= p <= infinity: The k-NNC-(Z(d), l(p)) problem is W[1]-hard parameterized by k + d, i.e., unless FPT = W[1], there is no f(k, d) center dot n(O(1)) time algorithm for any computable function f. Under the Exponential Time Hypothesis (ETH), there is no d >= 2 and computable function f such that the k-NNC-(Z(d), l(p)) problem can be solved in f(k, d) center dot n(o(k1-1/d)) time. The second lower bound shows that there is a so-called (Marx and Sidiropoulos, 2014) "limited blessing of low-dimensionality": for small d some improvement might be possible over the brute-force n(O(k)) time algorithm, but as d becomes large the brute-force algorithm becomes asymptotically optimal. It also shows that the is the n(O(root k)) time algorithm of Biniaz et al. (2019) for k-NNC-(R-2, l(2)) is asymptotically tight. Our lower bounds on the fine-grained complexity of NEAREST NEIGHBOR CONDENSATION in a sense justify the use of heuristics in practice, even though they have weak or no theoretical guarantees.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Nearest neighbor and reverse nearest neighbor queries for moving objects
    Benetis, R
    Jensen, CS
    Karciauskas, G
    Saltenis, S
    IDEAS 2002: INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2002, : 44 - 53
  • [22] A novel version of k nearest neighbor: Dependent nearest neighbor
    Ertugrul, Omer Faruk
    Tagluk, Mehmet Emin
    APPLIED SOFT COMPUTING, 2017, 55 : 480 - 490
  • [23] DUAL VECTORS AND LOWER BOUNDS FOR THE NEAREST LATTICE POINT PROBLEM
    HASTAD, J
    COMBINATORICA, 1988, 8 (01) : 75 - 81
  • [24] Fast algorithm for nearest neighbor search based on a lower bound tree
    Chen, YS
    Hung, YP
    Fuh, CS
    EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, 2001, : 446 - 453
  • [25] Lower Bounds on Near Neighbor Search via Metric Expansion
    Panigrahy, Rina
    Talwar, Kunal
    Wieder, Udi
    2010 IEEE 51ST ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, 2010, : 805 - 814
  • [26] Nearest Neighbor and Kernel Survival Analysis: Nonasymptotic Error Bounds and Strong Consistency Rates
    Chen, George H.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [27] COUNTERION CONDENSATION - EFFECTS OF SITE BINDING, FLUCTUATIONS IN NEAREST-NEIGHBOR INTERACTIONS, AND BENDING
    HEATH, PJ
    SCHURR, JM
    MACROMOLECULES, 1992, 25 (16) : 4149 - 4159
  • [28] Nearest neighbor ensemble
    Domeniconi, C
    Yan, B
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, : 228 - 231
  • [29] NEAREST NEIGHBOR PROBLEMS
    Wilfong, Gordon
    INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 1992, 2 (04) : 383 - 416
  • [30] NEAREST NEIGHBOR ALGORITHM
    KIRKPATRICK, RC
    LECTURE NOTES IN PHYSICS, 1985, 238 : 302 - 311