CedCom: A High-Performance Architecture for Big Data Applications

被引:0
|
作者
Raynaud, Tanguy [1 ]
Haque, Rafiqul [1 ]
Ait-kaci, Hassan [1 ]
机构
[1] Claude Bernard Univ Lyon 1, Lab Informat Image & Syst Informat LIRIS, F-69100 Lyon, France
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Distributed architecture is widely used for storing and processing Big Data. Operations on Big Data need first, locating the required data blocks and then, reading them. Data can be located in different types of memories in particular, cache memory, main memory, and secondary memory. Reading data from secondary memory to process Big Data jobs is not an ideal approach especially for high performance applications because, accessing data in secondary devices can be slow for processors. In addition, fetching data from main memory is time consuming due to limited I/O bandwidth. These system level issues are barriers for optimizing performance of Big Data applications. Simply put, for optimizing the application performance, it is not sufficient to have efficient algorithms only, an efficient architecture is needed to provide faster data access by the processors. The need for such an architecture has been documented in the literature, however, the state of the art is still missing an efficient architecture. This paper develops a promising architecture which caches data in main memory. It essentially transforms a main memory into a attraction memory which enables high-speed data access. Also, it enables automatic migration of data blocks and computations across the nodes contained in the clusters. It offers an exchange protocol for fast transfer of data blocks between the different physical nodes and speeds up job processing. The proposed architecture combines the power of Cache-Only Memory Architecture (COMA) and the structural principle of Hadoop.
引用
收藏
页码:621 / 632
页数:12
相关论文
共 50 条
  • [1] High-performance modelling and simulation for big data applications
    Kolodziej, Joanna
    Gonzalez-Velez, Horacio
    Karatza, Helen D.
    SIMULATION MODELLING PRACTICE AND THEORY, 2017, 76 : 1 - 2
  • [2] FAST: A High-Performance Architecture for Heterogeneous Big Data Forensics
    Pungila, Ciprian
    Negru, Viorel
    INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 618 - 627
  • [3] Rethinking High Performance Computing System Architecture for Scientific Big Data Applications
    Chen, Yong
    Chen, Chao
    Yin, Yanlong
    Sun, Xian-He
    Thakur, Rajeev
    Gropp, William D.
    2016 IEEE TRUSTCOM/BIGDATASE/ISPA, 2016, : 1605 - 1612
  • [4] Designing a novel high-performance FPGA architecture for data intensive applications
    Siozios, Kostas
    Soudris, Dimitrios
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2009, 4 (02) : 155 - 166
  • [5] Designing a novel high-performance FPGA architecture for data intensive applications
    Kostas Siozios
    Dimitrios Soudris
    Journal of Real-Time Image Processing, 2009, 4 : 155 - 166
  • [6] High-Performance Extreme Learning Machines: A Complete Toolbox for Big Data Applications
    Akusok, Anton
    Bjork, Kaj-Mikael
    Miche, Yoan
    Lendasse, Amaury
    IEEE ACCESS, 2015, 3 : 1011 - 1025
  • [7] High-Performance Computing for Big Data Processing
    Wu, Yulei
    Xiang, Yang
    Ge, Jingguo
    Muller, Peter
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 88 : 693 - 695
  • [8] Contributions to High-Performance Big Data Computing
    Fox, Geoffrey
    Qiu, Judy
    Crandall, David
    Von Laszewski, Gregor
    Beckstein, Oliver
    Paden, John
    Paraskevakos, Ioannis
    Jha, Shantenu
    Wang, Fusheng
    Marathe, Madhav
    Vullikanti, Anil
    Cheatham, Thomas
    FUTURE TRENDS OF HPC IN A DISRUPTIVE SCENARIO, 2019, 34 : 34 - 81
  • [9] Performance prediction of data streams on high-performance architecture
    Gautam, Bhaskar
    Basava, Annappa
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2019, 9 (01)
  • [10] Sigma: a Scalable High Performance Big Data Architecture
    Cassavia, Nunziato
    Masciari, Elio
    2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021), 2021, : 236 - 239