In-Memory Big Data Management and Processing: A Survey

被引:235
|
作者
Zhang, Hao [1 ]
Chen, Gang [2 ]
Ooi, Beng Chin [1 ]
Tan, Kian-Lee [1 ]
Zhang, Meihui [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
关键词
Primary memory; DRAM; relational databases; distributed databases; query processing; PHASE-CHANGE MEMORY; HIGH-PERFORMANCE; SCALABLE SYSTEM; MULTI-CORE; COLD DATA; B+-TREES; SAP HANA; MAIN; TRANSACTION; JOINS;
D O I
10.1109/TKDE.2015.2427795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Growing main memory capacity has fueled the development of in-memory big data management and processing. By eliminating disk I/O bottleneck, it is now possible to support interactive data analytics. However, in-memory systems are much more sensitive to other sources of overhead that do not matter in traditional I/O-bounded disk-based systems. Some issues such as fault-tolerance and consistency are also more challenging to handle in in-memory environment. We are witnessing a revolution in the design of database systems that exploits main memory as its data storage layer. Many of these researches have focused along several dimensions: modern CPU and memory hierarchy utilization, time/space efficiency, parallelism, and concurrency control. In this survey, we aim to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks. We also give a comprehensive presentation of important technology in memory management, and some key factors that need to be considered in order to achieve efficient in-memory data management and processing.
引用
收藏
页码:1920 / 1948
页数:29
相关论文
共 50 条
  • [31] Online Data Deduplication for In-Memory Big-Data Analytic Systems
    Sun, Yushi
    Zeng, Catherine Y.
    Chung, Jaeyoon
    Huang, Zhe
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [32] Demystifying Intel Data Streaming Accelerator for In-Memory Data Processing
    Berthold, Andre
    Fuerst, Constantin
    Obersteiner, Antonia
    Schmidt, Lennart
    Habich, Dirk
    Lehner, Wolfgang
    Schirmeier, Horst
    PROCEEDINGS OF THE 2ND WORKSHOP ON DISRUPTIVE MEMORY SYSTEMS, DIMES 2024, 2024, : 9 - 16
  • [33] Management and Analytic of Biomedical Big Data with Cloud-based In-Memory Database and Dynamic Querying
    Feng, Mengling
    Ghassemi, Mohammad
    Brennan, Thomas
    Ellenberger, John
    Hussain, Ishrar
    Mark, Roger
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 1970 - 1970
  • [34] An Energy Efficient In-Memory Computing Architecture Using Reconfigurable Magnetic Logic Circuits for Big Data Processing
    Gargari, Milad Ashtari
    Eslami, Nima
    Moaiyeri, Mohammad Hossein
    IEEE TRANSACTIONS ON MAGNETICS, 2023, 59 (12) : 1 - 10
  • [35] Load Balancing Scheme for Supporting Real-time Processing of Big Data in Distributed In-Memory Systems
    Bok, Kyoungsoo
    Choi, Kitae
    Lim, Jongtae
    Yoo, Jaesoo
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 170 - 174
  • [36] In-memory Spatial-Aware Framework for Processing Proximity-Alike Queries in Big Spatial Data
    Al Jawarneh, Isam Mashhour
    Bellavista, Paolo
    Corradi, Antonio
    Foschini, Luca
    Montanari, Rebecca
    Zanotti, Andrea
    2018 IEEE 23RD INTERNATIONAL WORKSHOP ON COMPUTER AIDED MODELING AND DESIGN OF COMMUNICATION LINKS AND NETWORKS (CAMAD), 2018, : 86 - 91
  • [37] In-Memory Computing Architectures for Big Data and Machine Learning Applications
    Snasel, Vaclav
    Tran Khanh Dang
    Pham, Phuong N. H.
    Kueng, Josef
    Kong, Lingping
    FUTURE DATA AND SECURITY ENGINEERING. BIG DATA, SECURITY AND PRIVACY, SMART CITY AND INDUSTRY 4.0 APPLICATIONS, FDSE 2022, 2022, 1688 : 19 - 33
  • [38] Distributed In-memory Data Management for Workflow Executions
    Souza, Renan
    Silva, Vitor
    Lima, Alexandre A. B.
    de Oliveira, Daniel
    Valduriez, Patrick
    Mattoso, Marta
    PeerJ Computer Science, 2021, 7 : 1 - 30
  • [39] Distributed in-memory data management for workflow executions
    Souza, Renan
    Silva, Vitor
    Lima, Alexandre A. B.
    de Oliveira, Daniel
    Valduriez, Patrick
    Mattoso, Marta
    PEERJ COMPUTER SCIENCE, 2021,
  • [40] A hybrid memory built by SSD and DRAM to support in-memory Big Data analytics
    Chen, Zhiguang
    Lu, Yutong
    Xiao, Nong
    Liu, Fang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 335 - 354