SmallClient for big data: an indexing framework towards fast data retrieval

被引:15
|
作者
Siddiqa, Aisha [1 ]
Karim, Ahmad [2 ]
Chang, Victor [3 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[2] Bahauddin Zakariya Univ, Dept Informat Technol, Multan 60000, Pakistan
[3] Xian Jiaotong Liverpool Univ, IBSS, Suzhou 100044, Peoples R China
关键词
Big data; Big data indexing; Big data retrieval; Big data analytics; Query execution; Data search performance; CLOUD; EFFICIENT; PERFORMANCE; TAXONOMY; STORAGE;
D O I
10.1007/s10586-016-0712-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Numerous applications are continuously generating massive amount of data and it has become critical to extract useful information while maintaining acceptable computing performance. The objective of this work is to design an indexing framework which minimizes indexing overhead and improves query execution and data search performance with optimum aggregation of computing performance. We propose SmallClient, an indexing framework to speed up query execution. SmallClient has three modules: block creation, index creation and query execution. Block creation module supports improving data retrieval performance with minimum data uploading overhead. Index creation module allows maximum indexes on a dataset to increase index hit ratio with minimized indexing overhead. Finally, query execution module offers incoming queries to utilize these indexes. The evaluation shows that SmallClient outperforms Hadoop full scan with more than 90% search performance. Meanwhile, indexing overhead of SmallClient is reduced to approximately 50 and 80% for index size and indexing time respectively.
引用
收藏
页码:1193 / 1208
页数:16
相关论文
共 50 条
  • [41] On the analysis of big data indexing execution strategies
    Siddiqa, Aisha
    Karim, Ahmad
    Saba, Tanzila
    Chang, Victor
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2017, 32 (05) : 3259 - 3271
  • [42] Challenges and recommendations in big data indexing strategies
    Mohamed M.A.
    Abdel-Fattah M.A.
    Khedr A.E.
    International Journal of e-Collaboration, 2021, 17 (02) : 22 - 39
  • [43] Storage and Query Indexing Methods on Big Data
    QingE Wu
    Yao Yu
    Lintao Zhou
    Yingbo Lu
    Hu Chen
    Xiaoliang Qian
    Arabian Journal for Science and Engineering, 2024, 49 : 7359 - 7374
  • [44] BIND: An Indexing Strategy for Big Data Processing
    Habbal, Adib
    Adamu, Fatima Binta
    Hassan, Suhaidi
    Cottrell, R. Les
    White, Bebo
    Kaiiali, Mustafa
    Wazan, Ahmad Samer
    TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 645 - 650
  • [45] Storage and Query Indexing Methods on Big Data
    Wu, QingE
    Yu, Yao
    Zhou, Lintao
    Lu, Yingbo
    Chen, Hu
    Qian, Xiaoliang
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (05) : 7359 - 7374
  • [46] A Big Data Approach for Health Data Information Retrieval
    Ciampi, Mario
    Masciari, Elio
    De Pietro, Giuseppe
    Silvestri, Stefano
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2533 - 2540
  • [47] Toward a big data approach for indexing encrypted data in Cloud Computing
    Kaci, Abdellah
    Bouabana-Tebibel, Thouraya
    Rachedi, Abderrezak
    Yahiaoui, Chafia
    SECURITY AND PRIVACY, 2019, 2 (03)
  • [48] FENCE: Fast, ExteNsible, and ConsolidatEd Framework for Intelligent Big Data Processing
    Ramneek
    Cha, Seung-Jun
    Pack, Sangheon
    Jeon, Seung Hyub
    Jeong, Yeon Jeong
    Kim, Jin Mee
    Jung, Sungin
    IEEE ACCESS, 2020, 8 : 125423 - 125437
  • [49] Towards Real-time Collaborative Filtering for Big Fast Data
    Diaz-Aviles, Ernesto
    Nejdl, Wolfgang
    Drumond, Lucas
    Schmidt-Thieme, Lars
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 779 - 780
  • [50] A systematic data characteristic understanding framework towards physical-sensor big data challenges
    Ma, Zhipeng
    Jorgensen, Bo Norregaard
    Ma, Zheng Grace
    JOURNAL OF BIG DATA, 2024, 11 (01)