High-performance data mining with intelligent SSD

被引:0
|
作者
Yong-Yeon Jo
Sang-Wook Kim
Sung-Woo Cho
Duck-Ho Bae
Hyunok Oh
机构
[1] Hanyang University,Department of Computer and Software
[2] Hanyang University,Department of Information Systems
来源
Cluster Computing | 2017年 / 20卷
关键词
Intelligent SSD; Simulator-based evaluation; Collaborative processing; Heterogeneous scheduling;
D O I
暂无
中图分类号
学科分类号
摘要
An intuitive way to process the big data efficiently is to reduce the volume of data transferred over the storage interface to a host system. This is the reason that the notion of intelligent SSD (iSSD) was proposed to give processing power to SSD. There is rich literature on iSSD, however, its real implementation has not been provided to the public yet. Most prior work aims to quantify the benefits of iSSD with analytical modeling. In this paper, we first develop on iSSD simulator and present the potential of iSSD in data mining through the iSSD simulator. Our iSSD simulator performs on top of the gem 5 simulator and fully simulates all the processes of data mining algorithms running in iSSD with cycle-level accuracy. Then, we further addresse how to exploit all the computing resources for efficient processing of data mining algorithms. These days, CPU, GPU, and SSD are recently equipped together in most computing environment. If SSD is replaced with iSSD later on, we have a new computing environment where the three computing resources collaborate one another to process big data quite effectively. For this, scheduling is required to decide which computing resource is going to run for which function at which time. In our heterogeneous scheduling, types of computing resources, memory sizes in computing resources, and inter-processor communication times including IO time in SSD are considered. Our scheduling results show that processing in the collaborative environment outperforms that in the traditional one by up to about 10 times.
引用
收藏
页码:1155 / 1166
页数:11
相关论文
共 50 条
  • [41] High-Performance Biomedical Association Mining with MapReduce
    Ji, Yanqing
    Tian, Yun
    Shen, Fangyang
    Tran, John
    2015 12TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY - NEW GENERATIONS, 2015, : 465 - 470
  • [42] An open multi-tier architecture for high-performance data mining using SOA
    Rahman, Muhammad Mushfiqur
    Maksud-Ul-Alam
    Rahman, S. M. Monzurur
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2015, 7 (01) : 60 - 82
  • [43] iTransformer: Using SSD to Improve Disk Scheduling for High-performance I/O
    Zhang, Xuechen
    Davis, Kei
    Jiang, Song
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2012, : 715 - 726
  • [44] Network-on-SSD: A Scalable and High-Performance Communication Design Paradigm for SSDs
    Tavakkol, Arash
    Arjomand, Mohammad
    Sarbazi-Azad, Hamid
    IEEE COMPUTER ARCHITECTURE LETTERS, 2013, 12 (01) : 5 - 8
  • [45] Delayed Partial Parity Scheme for Reliable and High-Performance Flash Memory SSD
    Im, Soojun
    Shin, Dongkun
    2010 IEEE 26TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST), 2010,
  • [46] Pageserver: High-Performance SSD-Based Checkpointing of Transactional Distributed Memory
    Gerhold, Steffen
    Kaemmer, Nico
    Weggerle, Alexander
    Himpel, Christian
    Schulthess, Peter
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 1, 2010, : 235 - 239
  • [47] High-Performance Breaking and Intelligent of Miniature Circuit Breakers
    Yin, Jianning
    Lang, Xiaojian
    Xu, Haotian
    Duan, Jiandong
    SENSORS, 2022, 22 (16)
  • [48] A high-performance distributed algorithm for mining association rules
    Assaf Schuster
    Ran Wolff
    Dan Trock
    Knowledge and Information Systems, 2005, 7 : 458 - 475
  • [49] A high-performance distributed algorithm for mining association rules
    Schuster, A
    Wolff, R
    Trock, D
    KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 7 (04) : 458 - 475
  • [50] A high-performance distributed algorithm for mining association rules
    Schuster, A
    Wolff, R
    Trock, D
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 291 - 298