Loosely-specified query processing in large-scale information systems

被引:0
|
作者
Nica, A
Rundensteiner, EA
机构
[1] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[2] Worcester Polytech Inst, Dept Comp Sci, Worcester, MA 01609 USA
关键词
heterogeneous information systems; loosely-specified queries; query templates; query planning; distributed system;
D O I
10.1142/S0218843097000124
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Challenging issues for processing queries specified over large-scale information spaces (for example, Digital Libraries or the World Wide Web) include the diversity of the information sources in terms of their structures, query interfaces and search capabilities, as well as the dynamics of sources continuously being added, removed or upgraded. In this paper, we give an innovative solution for query planning in such environments. The foundation of our solution is the Dynamic Information Integration Model (DIIM) which supports the specification of not only content but also capabilities of resources without requiring the establishment of a uniform integration schema. Besides the development of the DIIM model, contributions of this paper include: (1) the introduction of the notion of fully specified queries that are semantically equivalent to a loosely-specified query; (2) a translation algorithm of a loosely-specified query into a set of semantically equivalent feasible query plans that are consistent with the binding patterns of query templates of the individual sources (capability descriptions in DIIM) and with interrelationships between information sources (expressed as join constraints in DIIM); and (3) a search restriction algorithm for optimizing query processing by pruning the search space into the relevant subspace of a query. The plans obtained by the proposed query planning process which is composed of the search restriction and translation algorithms can be shown to correspond to query plans semantically equivalent to the initial loosely-specified input query.
引用
收藏
页码:241 / 268
页数:28
相关论文
共 50 条
  • [41] Query by Example in Large-Scale Code Repositories
    Balachandran, Vipin
    2015 31ST INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME) PROCEEDINGS, 2015, : 467 - 476
  • [42] Large Scale Hamming Distance Query Processing
    Liu, Alex X.
    Shen, Ke
    Torng, Eric
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 553 - 564
  • [43] Supporting Program Comprehension through Fast Query response in Large-Scale Systems
    Lin, Jinfeng
    Liu, Yalin
    Cleland-Huang, Jane
    2020 IEEE/ACM 28TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2020, : 285 - 295
  • [44] Implementing large-scale autonomic server monitoring using process query systems
    Roblee, C
    Berk, V
    Cybenko, G
    ICAC 2005: Second International Conference on Autonomic Computing, Proceedings, 2005, : 123 - 133
  • [45] Evolutionary approach for semantic-based query sampling in large-scale information sources
    Jung, Jason J.
    INFORMATION SCIENCES, 2012, 182 (01) : 30 - 39
  • [46] Research on massive information query and intelligent analysis method in a complex large-scale system
    Wang, Dailin
    Lv, Yunlei
    Ren, Danting
    Li, Linhui
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (04) : 2906 - 2926
  • [47] On the Throughput Optimization in Large-scale Batch-processing Systems
    Kar, Sounak
    Rehrmann, Robin
    Mukhopadhyay, Arpan
    Alt, Bastian
    Ciucu, Florin
    Koeppl, Heinz
    Binnig, Carsten
    Rizk, Amr
    PERFORMANCE EVALUATION, 2020, 144
  • [48] On the Throughput Optimization in Large-Scale Batch-Processing Systems
    Kar S.
    Rehrmann R.
    Mukhopadhyay A.
    Alt B.
    Ciucu F.
    Koeppl H.
    Binnig C.
    Rizk A.
    Performance Evaluation Review, 2021, 48 (03): : 128 - 129
  • [49] Large-scale processing of coals
    Procycat, F
    ZEITSCHRIFT DES VEREINES DEUTSCHER INGENIEURE, 1933, 77 : 893 - 897