In-Memory Big Data Management and Processing: A Survey

被引:235
|
作者
Zhang, Hao [1 ]
Chen, Gang [2 ]
Ooi, Beng Chin [1 ]
Tan, Kian-Lee [1 ]
Zhang, Meihui [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117417, Singapore
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou 310027, Zhejiang, Peoples R China
[3] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
关键词
Primary memory; DRAM; relational databases; distributed databases; query processing; PHASE-CHANGE MEMORY; HIGH-PERFORMANCE; SCALABLE SYSTEM; MULTI-CORE; COLD DATA; B+-TREES; SAP HANA; MAIN; TRANSACTION; JOINS;
D O I
10.1109/TKDE.2015.2427795
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Growing main memory capacity has fueled the development of in-memory big data management and processing. By eliminating disk I/O bottleneck, it is now possible to support interactive data analytics. However, in-memory systems are much more sensitive to other sources of overhead that do not matter in traditional I/O-bounded disk-based systems. Some issues such as fault-tolerance and consistency are also more challenging to handle in in-memory environment. We are witnessing a revolution in the design of database systems that exploits main memory as its data storage layer. Many of these researches have focused along several dimensions: modern CPU and memory hierarchy utilization, time/space efficiency, parallelism, and concurrency control. In this survey, we aim to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks. We also give a comprehensive presentation of important technology in memory management, and some key factors that need to be considered in order to achieve efficient in-memory data management and processing.
引用
收藏
页码:1920 / 1948
页数:29
相关论文
共 50 条
  • [1] Fast and Efficient In-Memory Big Data Processing
    Malik, Babur Hayat
    Maryam, Maliha
    Khalid, Myda
    Khlaid, Javaria
    Rehman, Naj Am Ur
    Sajjad, Syeda Iqra
    Islam, Tanveer
    Butt, Umair Ahmed
    Raza, Ali
    Nasr, M. Saad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (05) : 517 - 524
  • [2] Timo: In-Memory Temporal Query Processing for Big Temporal Data
    Zheng, Xiao
    Liu, Hou-kai
    Wei, Lin-na
    Wu, Xuan-gou
    Zhang, Zhen
    2019 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2019, : 121 - 126
  • [3] Survey of In-memory Big Data Analytics and Latest Research Opportunities
    Gangarde, Rupali
    Pawar, Ambika
    Dani, Ajay
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 197 - 201
  • [4] Timo: In-memory temporal query processing for big temporal data
    Zheng, Xiao
    Liu, Houkai
    Wang, Xiujun
    Wu, Xuangou
    Yu, Feng
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (13):
  • [5] In-Memory Performance for Big Data
    Graefe, Goetz
    Volos, Haris
    Kimura, Hideaki
    Kuno, Harumi
    Tucek, Joseph
    Lillibridge, Mark
    Veitch, Alistair
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (01): : 37 - 48
  • [6] LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data
    Tang, Mingjie
    Yu, Yongyang
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    Aref, Walid G.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1565 - 1568
  • [7] MemepiC: Towards a Unified In-Memory Big Data Management System
    Cai, Qingchao
    Zhang, Hao
    Guo, Wentian
    Chen, Gang
    Ooi, Beng Chin
    Tan, Kian-Lee
    Wong, Weng-Fai
    IEEE TRANSACTIONS ON BIG DATA, 2019, 5 (01) : 4 - 17
  • [8] Representing a Model for the Anonymization of Big Data Stream Using In-Memory Processing
    Shamsinejad E.
    Banirostam T.
    Pedram M.M.
    Rahmani A.M.
    Annals of Data Science, 2025, 12 (1) : 223 - 252
  • [9] Massively Parallel Big Data Classification on a Programmable Processing In-Memory Architecture
    Kim, Yeseong
    Imani, Mohsen
    Gupta, Saransh
    Zhou, Minxuan
    Rosing, Tajana S.
    2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,
  • [10] DigitalPIM: Digital-based Processing In-Memory for Big Data Acceleration
    Imani, Mohsen
    Gupta, Saransh
    Kim, Yeseong
    Zhou, Minxuan
    Rosing, Tajana
    GLSVLSI '19 - PROCEEDINGS OF THE 2019 ON GREAT LAKES SYMPOSIUM ON VLSI, 2019, : 429 - 434