Efficient Indexing of Similarity Models with Inequality Symbolic Regression

被引:0
|
作者
Bartos, Tomas [1 ]
Skopal, Tomas [1 ]
Mosko, Juraj [1 ]
机构
[1] Charles Univ Prague, Fac Math & Phys, SIRET Grp, Prague, Czech Republic
关键词
Genetic programming; Similarity research; Content based retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing amount of available unstructured content introduced a new concept of searching for information - the content-based retrieval. The principle behind is that the objects are compared based on their content which is far more complex than simple text or metadata based searching. Many indexing techniques arose to provide an efficient and effective similarity searching. However, these methods are restricted to a specific domain such as the metric space model. If this prerequisite is not fulfilled, indexing cannot be used, while each similarity search query degrades to sequential scanning which is unacceptable for large datasets. Inspired by previous successful results, we decided to apply the principles of genetic programming to the area of database indexing. We developed the GP-SIMDEX which is a universal framework that is capable of finding precise and efficient indexing methods for similarity searching for any given similarity data. For this purpose, we introduce the inequality symbolic regression principle and show how it helps the GP-SIMDEX Framework to find appropriate results that in most, cases outperform the best-known indexing methods.
引用
收藏
页码:901 / 908
页数:8
相关论文
共 50 条
  • [1] Inequality constraints in regression models to symbolic interval variables
    Lima Neto, Eufrdsio de A.
    de Carvalho, Francisco de A. T.
    Coelho Neto, Jose F.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 801 - 806
  • [2] Symbolic Regression Using Nearest Neighbor Indexing
    McRee, Randall
    GECCO-2010 COMPANION PUBLICATION: PROCEEDINGS OF THE 12TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2010, : 1983 - 1989
  • [3] Efficient Metric Indexing for Similarity Search and Similarity Joins
    Chen, Lu
    Gao, Yunjun
    Li, Xinhan
    Jensen, Christian S.
    Chen, Gang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (03) : 556 - 571
  • [4] Efficient Metric Indexing for Similarity Search
    Chen, Lu
    Gao, Yunjun
    Li, Xinhan
    Jensen, Christian S.
    Chen, Gang
    2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2015, : 591 - 602
  • [5] Universal Indexing of Arbitrary Similarity Models
    Barton, Tomas
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1392 - 1397
  • [6] SPINEX-symbolic regression: similarity-based symbolic regression with explainable neighbors exploration
    Naser, M. Z.
    Naser, Ahmad Z.
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (05):
  • [7] Symbolic regression of generative network models
    Menezes, Telmo
    Roth, Camille
    SCIENTIFIC REPORTS, 2014, 4
  • [8] Distilling Financial Models by Symbolic Regression
    La Malfa, Gabriele
    La Malfa, Emanuele
    Belavkin, Roman
    Pardalos, Panos M.
    Nicosia, Giuseppe
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT II, 2022, 13164 : 502 - 517
  • [9] Symbolic regression of generative network models
    Telmo Menezes
    Camille Roth
    Scientific Reports, 4
  • [10] Efficient and flexible bitmap indexing for complex similarity queries
    Cha, GH
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 708 - 720