A Brief Index for Proximity Searching

被引:0
|
作者
Tellez, Eric Sadit [1 ]
Chavez, Edgar [1 ]
Camarena-Ibarrola, Antonio [1 ]
机构
[1] Univ Michoacana, Hidalgo, Mexico
来源
PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS | 2009年 / 5856卷
关键词
SPACES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many pattern recognition tasks can be modeled as proximity searching. Here the common task is to quickly find all the elements close to a given query without sequentially scanning a very large database. A recent shift in the searching paradigm has been established by using permutations instead of distances to predict proximity. Every object in the database record how the set of reference objects (the permutants) is seen, i.e. only the relative positions are used. When a query arrives the relative displacements in the permutants between the query and a particular object is measured. This approach turned out to be the roost efficient and scalable, at the expense of loosing recall in the answers. The permutation of every object is represented with kappa, short integers in practice, producing bulky indexes of 16 kappa n, bits. In this paper we show how to represent the permutation as a binary vector, using just one bit for each permutant (instead of log kappa in the plain representation). The Hamming distance in the binary signature is used then to predict proximity between objects in the database. We tested this approach with many real life metric databases obtaining faster queries with a recall close to the Spearman rho using 16 times less space.
引用
收藏
页码:529 / 536
页数:8
相关论文
共 50 条
  • [1] Brief communication - Adjacency and proximity searching in the Science Citation Index and Google
    Kostoff, Ronald N.
    Rigsby, John T.
    Barth, Ryan B.
    JOURNAL OF INFORMATION SCIENCE, 2006, 32 (06) : 581 - 587
  • [2] Dynamic Permutation Based Index for Proximity Searching
    Figueroa, Karina
    Paredes, Rodrigo
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2015, 2015, 9371 : 97 - 102
  • [3] Boosting the Permutation Based Index for Proximity Searching
    Figueroa, Karina
    Paredes, Rodrigo
    PATTERN RECOGNITION (MCPR 2015), 2015, 9116 : 103 - 112
  • [4] Fixed Height Queries Tree Permutation Index for Proximity Searching
    Figueroa, Karina
    Paredes, Rodrigo
    Antonio Camarena-Ibarrola, J.
    Reyes, Nora
    PATTERN RECOGNITION (MCPR 2017), 2017, 10267 : 74 - 83
  • [5] Proximity searching in high dimensional spaces with a proximity preserving order
    Chávez, E
    Figueroa, K
    Navarro, G
    MICAI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3789 : 405 - 414
  • [6] A Unified Approach to Approximate Proximity Searching
    Arya, Sunil
    da Fonseca, Guilherme D.
    Mount, David M.
    ALGORITHMS-ESA 2010, 2010, 6346 : 374 - +
  • [7] VEHICLE SCHEDULING - PROXIMITY PRIORITY SEARCHING
    WILLIAMS, BW
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1982, 33 (10) : 961 - 966
  • [8] Faster proximity searching with the distal SAT
    Chavez, Edgar
    Luduena, Veronica
    Reyes, Nora
    Roggero, Patricia
    INFORMATION SYSTEMS, 2016, 59 : 15 - 47
  • [9] Faster proximity searching in metric data
    Chávez, E
    Figueroa, K
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 222 - 231
  • [10] List of Clustered Permutations for Proximity Searching
    Figueroa, Karina
    Paredes, Rodrigo
    SIMILARITY SEARCH AND APPLICATIONS (SISAP), 2013, 8199 : 50 - 58