In-Memory Big Data Management and Processing: A Survey

被引:235
|
作者
Zhang, Hao [1 ]
Chen, Gang [2 ]
Ooi, Beng Chin [1 ]
Tan, Kian-Lee [1 ]
Zhang, Meihui [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
关键词
Primary memory; DRAM; relational databases; distributed databases; query processing; PHASE-CHANGE MEMORY; HIGH-PERFORMANCE; SCALABLE SYSTEM; MULTI-CORE; COLD DATA; B+-TREES; SAP HANA; MAIN; TRANSACTION; JOINS;
D O I
10.1109/TKDE.2015.2427795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Growing main memory capacity has fueled the development of in-memory big data management and processing. By eliminating disk I/O bottleneck, it is now possible to support interactive data analytics. However, in-memory systems are much more sensitive to other sources of overhead that do not matter in traditional I/O-bounded disk-based systems. Some issues such as fault-tolerance and consistency are also more challenging to handle in in-memory environment. We are witnessing a revolution in the design of database systems that exploits main memory as its data storage layer. Many of these researches have focused along several dimensions: modern CPU and memory hierarchy utilization, time/space efficiency, parallelism, and concurrency control. In this survey, we aim to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks. We also give a comprehensive presentation of important technology in memory management, and some key factors that need to be considered in order to achieve efficient in-memory data management and processing.
引用
收藏
页码:1920 / 1948
页数:29
相关论文
共 50 条
  • [41] A Many-core Architecture for In-Memory Data Processing
    Agrawal, Sandeep R.
    Idicula, Sam
    Raghavan, Arun
    Vlachos, Evangelos
    Govindaraju, Venkatraman
    Varadarajan, Venkatanathan
    Balkesen, Cagri
    Giannikis, Georgios
    Roth, Charlie
    Agarwal, Nipun
    Sedlar, Eric
    50TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2017, : 245 - 258
  • [42] Deca: A Garbage Collection Optimizer for In-Memory Data Processing
    Shi, Xuanhua
    Ke, Zhixiang
    Zhou, Yongluan
    Jin, Hai
    Lu, Lu
    Zhang, Xiong
    He, Ligang
    Hu, Zhenyu
    Wang, Fei
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2019, 36 (01):
  • [43] Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications
    Abrahamse, Robin
    Hadnagy, Akos
    Al-Ars, Zaid
    Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2022, 2022, : 1228 - 1234
  • [44] Memory-Disaggregated In-Memory Object Store Framework for Big Data Applications
    Abrahamse, Robin
    Hadnagy, Akos
    Al-Ars, Zaid
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022), 2022, : 1228 - 1234
  • [45] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Zhiguang Chen
    Yutong Lu
    Nong Xiao
    Fang Liu
    Knowledge and Information Systems, 2014, 41 : 335 - 354
  • [46] Employing In-Memory Data Grids for Distributed Graph Processing
    Tasci, Serafettin
    Demirbas, Murat
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1856 - 1864
  • [47] Data Processing and Information Classification-An In-Memory Approach
    Andrighetti, Milena
    Turvani, Giovanna
    Santoro, Giulia
    Vacca, Marco
    Marchesin, Andrea
    Ottati, Fabrizio
    Roch, Massimo Ruo
    Graziano, Mariagrazia
    Zamboni, Maurizio
    SENSORS, 2020, 20 (06)
  • [48] MEMTUNE: Dynamic Memory Management for In-memory Data Analytic Platforms
    Xu, Luna
    Li, Min
    Zhang, Li
    Butt, Ali R.
    Wang, Yandong
    Hu, Zane Zhenhua
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 383 - 392
  • [49] SparkNN: A distributed in-memory data partitioning for KNN queries on big spatial data
    Al Aghbari Z.
    Ismail T.
    Kamel I.
    Data Science Journal, 2020, 19 (01) : 1 - 14
  • [50] Dynamic Data Migration in Hybrid Main Memories for In-Memory Big Data Storage
    Mai, Hai Thanh
    Park, Kyoung Hyun
    Lee, Hun Soon
    Kim, Chang Soo
    Lee, Miyoung
    Hur, Sung Jin
    ETRI JOURNAL, 2014, 36 (06) : 988 - 998