Refined Lower Bounds for Nearest Neighbor Condensation

被引:0
|
作者
Chitnis, Rajesh [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England
来源
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167 | 2022年 / 167卷
关键词
nearest neighbor condensation; parameterized complexity; exponential time hypothesis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most commonly used classification techniques is the nearest neighbor rule: given a training set T of labeled points in a metric space (X, rho), a new unlabeled point x is an element of chi is assigned the label of its nearest neighbor in T. To improve both the space & time complexity of this classification, it is desirable to reduce the size of the training set without compromising too much on the accuracy of the classification. Hart (1968) formalized this as the NEAREST NEIGHBOR CONDENSATION (NNC) problem: find a subset C subset of T of minimum size which is consistent with T, i.e., each point t is an element of T has the same label as that of its nearest neighbor in C. This problem is known to be NP-hard (Wilfong, 1991), and the heuristics used in practice often have weak or no theoretical guarantees. We analyze this problem via the refined lens of parameterized complexity, and obtain strong lower bounds for the k-NNC-(Z(d), l(p)) problem which asks if there is a consistent subset of size <= k for a given training set of size n in the metric space (Z(d), l(p)) for any 1 <= p <= infinity: The k-NNC-(Z(d), l(p)) problem is W[1]-hard parameterized by k + d, i.e., unless FPT = W[1], there is no f(k, d) center dot n(O(1)) time algorithm for any computable function f. Under the Exponential Time Hypothesis (ETH), there is no d >= 2 and computable function f such that the k-NNC-(Z(d), l(p)) problem can be solved in f(k, d) center dot n(o(k1-1/d)) time. The second lower bound shows that there is a so-called (Marx and Sidiropoulos, 2014) "limited blessing of low-dimensionality": for small d some improvement might be possible over the brute-force n(O(k)) time algorithm, but as d becomes large the brute-force algorithm becomes asymptotically optimal. It also shows that the is the n(O(root k)) time algorithm of Biniaz et al. (2019) for k-NNC-(R-2, l(2)) is asymptotically tight. Our lower bounds on the fine-grained complexity of NEAREST NEIGHBOR CONDENSATION in a sense justify the use of heuristics in practice, even though they have weak or no theoretical guarantees.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Vertex Cover Kernelization Revisited: Upper and Lower Bounds for a Refined Parameter
    Jansen, Bart M. P.
    Bodlaender, Hans L.
    28TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2011), 2011, 9 : 177 - 188
  • [42] Fast k-nearest Neighbor Search for Face Identification Using Bounds of Residual Score
    Ishii, Masato
    Imaoka, Hitoshi
    Sato, Atsushi
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 194 - 199
  • [43] Cluster variation theory of the condensation of atoms in a honeycomb lattice gas with first nearest neighbor exclusion
    Asada, H
    Sinya, T
    Yano, M
    SURFACE SCIENCE, 2003, 527 (1-3) : 173 - 182
  • [44] Spin waves of anisotropic ferrimagnets with nearest neighbor and next-nearest neighbor interactions
    Xue, DS
    Gao, MZ
    Yang, JB
    Kong, Y
    Li, FS
    PHYSICA STATUS SOLIDI B-BASIC RESEARCH, 1996, 193 (01): : 161 - 166
  • [45] Approximate direct and reverse nearest neighbor queries, and the k-nearest neighbor graph
    Figueroa, Karina
    Paredes, Rodrigo
    SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 91 - +
  • [47] Nearest and reverse nearest neighbor queries for moving objects
    Rimantas Benetis
    Christian S. Jensen
    Gytis Karĉiauskas
    Simonas Ŝaltenis
    The VLDB Journal, 2006, 15 : 229 - 249
  • [48] Nearest and reverse nearest neighbor queries for moving objects
    Benetis, Rimantas
    Jensen, Christian S.
    Karciauskas, Gytis
    Saltenis, Simonas
    VLDB JOURNAL, 2006, 15 (03): : 229 - U1
  • [49] Nearest Neighbor Classifier Based on Nearest Feature Decisions
    James, Alex Pappachen
    Dimitrijev, Sima
    COMPUTER JOURNAL, 2012, 55 (09): : 1072 - 1087
  • [50] Coevolution of nearest neighbor classifiers
    Gagne, Christian
    Parizeau, Marc
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 921 - 946