Research on Data Query Optimization Based on SparkSQL and MongoDB

被引:3
|
作者
Chen, Yujun [1 ]
Lou, Yuansheng [1 ]
Ye, Feng [1 ]
机构
[1] Hohai Univ, Coll Comp & Informat, Nanjing, Jiangsu, Peoples R China
关键词
component; massive data analysis; apache spark; mongodb; query optimization;
D O I
10.1109/DCABES.2018.00046
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
With the arrival of the era of big data, the analysis and processing of massive data has become a very critical computing problem. This paper proposes a query optimization method based on SparkSQL and MongoDB. It analyzes the principle and compares it with other literature in order to draw the conclusion. The conclusion shows that when dealing with problems such as interactive SQL queries, the Apache Spark engine can reasonably decompose the tasks based on the dependencies between the massive data, thereby reducing the data query processing time and improving the operating efficiency. Also it is very suitable for storing some simple data with large amount due to flexible query and index of MongoDB. Obviously, the combination of the two can significantly improve the query speed of massive data.
引用
收藏
页码:144 / 147
页数:4
相关论文
共 50 条
  • [1] Conceptual Graphs Based Modeling of MongoDB Data Structure and Query
    Varga, Viorica
    Andor, Camelia-Florina
    Sacarea, Christian
    GRAPH-BASED REPRESENTATION AND REASONING (ICCS 2019), 2019, 11530 : 262 - 270
  • [2] First Past the Post: Evaluating Query Optimization in MongoDB
    Tao, Dawei
    Liu, Enqi
    Kadupitige, Sidath Randeni
    Cahill, Michael
    Fekete, Alan
    Rohm, Uwe
    DATABASES THEORY AND APPLICATIONS, ADC 2024, 2025, 15449 : 99 - 113
  • [3] Evaluation of storage and query performance of sensor based Internet of Things data with MongoDB
    Yilmaz, Nazim
    Alatli, Oylum
    Ciloglugil, Birol
    Erdur, Riza Cenk
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [4] A Graph Based Knowledge and Reasoning Representation Approach for Modeling MongoDB Data Structure and Query
    Andor, Camelia-Florina
    Varga, Viorica
    Sacarea, Christian
    2019 27TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2019, : 263 - 268
  • [5] Query Optimization Based on Data Provenance
    Huang Li
    Cheng Hongbing
    NEW TRENDS AND APPLICATIONS OF COMPUTER-AIDED MATERIAL AND ENGINEERING, 2011, 186 : 586 - 590
  • [6] Research on big data processing and analysis architecture based on MongoDB
    1600, Academy of Sciences of the Czech Republic, Dolejskova 5, Praha 8, 182 00, Czech Republic (61):
  • [7] Research on classification query optimization algorithm in data stream
    Zhou Hong
    Wang Bin
    Fu Chunyan
    Zhi Yuan
    Xue Jiamei
    PROCEEDINGS OF THE 2015 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER ENGINEERING AND ELECTRONICS (ICECEE 2015), 2015, 24 : 1219 - 1223
  • [8] Research on Big Data Storage Structure and Query Optimization
    Zhang, Jinhai
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS, ELECTRONICS AND CONTROL (ICCSEC), 2017, : 1508 - 1511
  • [9] Research on Query Analysis and Optimization Based on Spark
    Li, Yan
    Wang, Hongbo
    Li, Yangyang
    PROCEEDINGS OF 2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2017), 2017, : 251 - 255
  • [10] A Mapping-Based Method to Query MongoDB Documents with SPARQL
    Michel, Franck
    Faron-Zucker, Catherine
    Montagnat, Johan
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT II, 2016, 9828 : 52 - 67