Refined Lower Bounds for Nearest Neighbor Condensation

被引：0

作者：

Chitnis, Rajesh ^{[1
]}

机构：

[1] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England

来源：

INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167 | 2022年 / 167卷

关键词：

nearest neighbor condensation; parameterized complexity; exponential time hypothesis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

One of the most commonly used classification techniques is the nearest neighbor rule: given a training set T of labeled points in a metric space (X, rho), a new unlabeled point x is an element of chi is assigned the label of its nearest neighbor in T. To improve both the space & time complexity of this classification, it is desirable to reduce the size of the training set without compromising too much on the accuracy of the classification. Hart (1968) formalized this as the NEAREST NEIGHBOR CONDENSATION (NNC) problem: find a subset C subset of T of minimum size which is consistent with T, i.e., each point t is an element of T has the same label as that of its nearest neighbor in C. This problem is known to be NP-hard (Wilfong, 1991), and the heuristics used in practice often have weak or no theoretical guarantees. We analyze this problem via the refined lens of parameterized complexity, and obtain strong lower bounds for the k-NNC-(Z(d), l(p)) problem which asks if there is a consistent subset of size <= k for a given training set of size n in the metric space (Z(d), l(p)) for any 1 <= p <= infinity: The k-NNC-(Z(d), l(p)) problem is W[1]-hard parameterized by k + d, i.e., unless FPT = W[1], there is no f(k, d) center dot n(O(1)) time algorithm for any computable function f. Under the Exponential Time Hypothesis (ETH), there is no d >= 2 and computable function f such that the k-NNC-(Z(d), l(p)) problem can be solved in f(k, d) center dot n(o(k1-1/d)) time. The second lower bound shows that there is a so-called (Marx and Sidiropoulos, 2014) "limited blessing of low-dimensionality": for small d some improvement might be possible over the brute-force n(O(k)) time algorithm, but as d becomes large the brute-force algorithm becomes asymptotically optimal. It also shows that the is the n(O(root k)) time algorithm of Biniaz et al. (2019) for k-NNC-(R-2, l(2)) is asymptotically tight. Our lower bounds on the fine-grained complexity of NEAREST NEIGHBOR CONDENSATION in a sense justify the use of heuristics in practice, even though they have weak or no theoretical guarantees.

引用

页数：20

共 50 条

[41] Vertex Cover Kernelization Revisited: Upper and Lower Bounds for a Refined Parameter
Jansen, Bart M. P.
Bodlaender, Hans L.
28TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2011), 2011, 9 : 177 - 188
[42] Fast k-nearest Neighbor Search for Face Identification Using Bounds of Residual Score
Ishii, Masato
Imaoka, Hitoshi
Sato, Atsushi
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 194 - 199
[43] Cluster variation theory of the condensation of atoms in a honeycomb lattice gas with first nearest neighbor exclusion
Asada, H
Sinya, T
Yano, M
SURFACE SCIENCE, 2003, 527 (1-3) : 173 - 182
[44] Spin waves of anisotropic ferrimagnets with nearest neighbor and next-nearest neighbor interactions
Xue, DS
Gao, MZ
Yang, JB
Kong, Y
Li, FS
PHYSICA STATUS SOLIDI B-BASIC RESEARCH, 1996, 193 (01): : 161 - 166
[45] Approximate direct and reverse nearest neighbor queries, and the k-nearest neighbor graph
Figueroa, Karina
Paredes, Rodrigo
SISAP 2009: 2009 SECOND INTERNATIONAL WORKSHOP ON SIMILARITY SEARCH AND APPLICATIONS, PROCEEDINGS, 2009, : 91 - +
[46] ON THE PROBABILITY THAT A RANDOM POINT IS THE JTH NEAREST NEIGHBOR TO ITS OWN KTH NEAREST NEIGHBOR
HENZE, N
JOURNAL OF APPLIED PROBABILITY, 1986, 23 (01) : 221 - 226
[47] Nearest and reverse nearest neighbor queries for moving objects
Rimantas Benetis
Christian S. Jensen
Gytis Karĉiauskas
Simonas Ŝaltenis
The VLDB Journal, 2006, 15 : 229 - 249
[48] Nearest and reverse nearest neighbor queries for moving objects
Benetis, Rimantas
Jensen, Christian S.
Karciauskas, Gytis
Saltenis, Simonas
VLDB JOURNAL, 2006, 15 (03): : 229 - U1
[49] Nearest Neighbor Classifier Based on Nearest Feature Decisions
James, Alex Pappachen
Dimitrijev, Sima
COMPUTER JOURNAL, 2012, 55 (09): : 1072 - 1087
[50] Coevolution of nearest neighbor classifiers
Gagne, Christian
Parizeau, Marc
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (05) : 921 - 946

← 1 2 3 4 5 →