A heterogeneous computing system for data mining workflows

被引:0
|
作者
Luo, Ping
Lu, Kevin
He, Qing
Shi, Zhongzhi
机构
[1] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100080, Peoples R China
[2] Brunel Univ, Uxbridge UB8 3PH, Middx, England
[3] Chinese Acad Sci, Grad Sch, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The computing-intensive Data Mining (DM) process calls for the support of a Heterogeneous Computing (HC) system, which consists of multiple computers with different configurations, connected by a high-speed LAN, for increased computational power and resources. DM process can be described as a multi-phase pipeline process, and in each phase there could be many optional methods. This makes the workflow of DM very complex and can be modelled only by a Directed Acyclic Graph (DAG). An HC system needs an effective and efficient scheduling framework, which orchestrates all the computing hardware to perform multiple competitive DM workflows. Motivated by the need of a practical solution of the scheduling problem for the DM workflow, this paper proposes a dynamic DAG scheduling algorithm according to the characteristics of execution time estimation model for DM jobs. Based on an approximate estimation of job execution time, this algorithm first maps DM jobs to machines in a decentralized and diligent (defined in this paper) manner. Then the performance of this initial mapping can be improved through job migrations when necessary. The scheduling heuristic used in it considers the factors of both the minimal completion time criterion and the critical path in a DAG. We implement this system in an established Multi-Agent System (MAS) environment, in which the reuse of existing DM algorithms is achieved by encapsulating them into agents. Practical classification problems are used to test and measure the system performance. The detailed experiment procedure and result analysis are also discussed in this paper.
引用
收藏
页码:177 / 189
页数:13
相关论文
共 50 条
  • [1] A heterogeneous computing system for data mining workflows in multi-agent environments
    Luo, Ping
    Lu, Kevin
    Huang, Rui
    He, Qing
    Shi, Zhongzhi
    EXPERT SYSTEMS, 2006, 23 (05) : 258 - 272
  • [2] Scheduling of Big Data Workflows in the Hadoop Framework with Heterogeneous Computing Cluster
    Rahmani, Amir Masoud
    Chamzini, Ehsan Yazdani
    Pourshaban, Mohsen
    Hosseinzadeh, Mehdi
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [3] Heterogeneous Computing Systems for Complex Scientific Discovery Workflows
    Hagleitner, Christoph
    Diamantopoulos, Dionysios
    Ringlein, Burkhard
    Evangelinos, Constantinos
    Johns, Charles
    Chang, Rong N.
    D'Amora, Bruce
    Kahle, James A.
    Sexton, James
    Johnston, Michael
    Pyzer-Knapp, Edward
    Ward, Chris
    PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 13 - 18
  • [4] Supporting Distributed Application Workflows in Heterogeneous Computing Environments
    Wu, Qishi
    Gu, Yi
    PROCEEDINGS OF THE 2008 14TH IEEE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, : 3 - 10
  • [5] Mining Workflows for Anomalous Data Transfers
    Tu, Huy
    Papadimitriou, George
    Kiran, Mariam
    Wang, Cong
    Mandal, Anirban
    Deelman, Ewa
    Menzies, Tim
    2021 IEEE/ACM 18TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2021), 2021, : 1 - 12
  • [6] Supporting agricultural communities with workflows on heterogeneous computing resources
    Balasko, Akos
    Lovas, Robert
    Gergely, Mark
    Manolis, Nikos
    2014 6TH INTERNATIONAL WORKSHOP ON SCIENCE GATEWAYS (IWSG), 2014, : 18 - 23
  • [7] Time and Energy Optimization Algorithms for the Static Scheduling of Multiple Workflows in Heterogeneous Computing System
    Jiang, Junqiang
    Lin, Yaping
    Xie, Guoqi
    Fu, Li
    Yang, Junfeng
    JOURNAL OF GRID COMPUTING, 2017, 15 (04) : 435 - 456
  • [8] Time and Energy Optimization Algorithms for the Static Scheduling of Multiple Workflows in Heterogeneous Computing System
    Junqiang Jiang
    Yaping Lin
    Guoqi Xie
    Li Fu
    Junfeng Yang
    Journal of Grid Computing, 2017, 15 : 435 - 456
  • [9] Optimizing Distributed Computing Workflows in Heterogeneous Network Environments
    Gu, Yi
    Wu, Qishi
    DISTRIBUTED COMPUTING AND NETWORKING, PROCEEDINGS, 2010, 5935 : 142 - 154
  • [10] Dynamic Mapping of Application Workflows in Heterogeneous Computing Environments
    Qasim, Muhammad
    Iqbal, Touseef
    Munir, Ehsan Ullah
    Tziritas, Nikos
    Khan, Samee U.
    Yang, Laurence T.
    2017 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2017, : 462 - 471