A heterogeneous computing system for data mining workflows in multi-agent environments

被引:6
|
作者
Luo, Ping [1 ]
Lu, Kevin
Huang, Rui
He, Qing
Shi, Zhongzhi
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing, Peoples R China
[2] Brunel Univ, Uxbridge UB8 3PH, Middx, England
关键词
data mining; heterogeneous computing; directed acyclic graph; multi-agent system environment;
D O I
10.1111/j.1468-0394.2006.00408.x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The computing-intensive data mining (DM) process calls for the support of a heterogeneous computing system, which consists of multiple computers with different configurations connected by a high-speed large-area network for increased computational power and resources. The DM process can be described as a multi-phase pipeline process, and in each phase there could be many optional methods. This makes the workflow for DM very complex and it can be modeled only by a directed acyclic graph (DAG). A heterogeneous computing system needs an effective and efficient scheduling framework, which orchestrates all the computing hardware to perform multiple competitive DM workflows. Motivated by the need for a practical solution of the scheduling problem for the DM workflow, this paper proposes a dynamic DAG scheduling algorithm according to the characteristics of an execution time estimation model for DM jobs. Based on an approximate estimation of job execution time, this algorithm first maps DM jobs to machines in a decentralized and diligent (defined in this paper) manner. Then the performance of this initial mapping can be improved through job migrations when necessary. The scheduling heuristic used considers the factors of both the minimal completion time criterion and the critical path in a DAG. We implement this system in an established multi-agent system environment, in which the reuse of existing DM algorithms is achieved by encapsulating them into agents. The system evaluation and its usage in oil well logging analysis are also discussed.
引用
收藏
页码:258 / 272
页数:15
相关论文
共 50 条
  • [1] A heterogeneous computing system for data mining workflows
    Luo, Ping
    Lu, Kevin
    He, Qing
    Shi, Zhongzhi
    FLEXIBLE AND EFFICIENT INFORMATION HANDLING, 2006, 4042 : 177 - 189
  • [2] Multi-agent computing system in a heterogeneous network
    Uhruski, P
    Grochowski, M
    Schaefer, R
    PAR ELEC 2002: INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2002, : 233 - 238
  • [3] Multi-agent computing system in a heterogeneous network
    Uhruski, P.
    Grochowski, M.
    Schaefer, R.
    Proceedings - International Conference on Parallel Computing in Electrical Engineering, PARELEC 2002, 2002, : 233 - 238
  • [4] A Customizable Multi-Agent System for Distributed Data Mining
    Di Fatta, Giuseppe
    Fortino, Giancarlo
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 42 - +
  • [5] Multi-Agent System Using G-XMDR for Data Synchronization in Pervasive Computing Environments
    Kook, Youn-Gyou
    Kim, R. Young-Chul
    Choi, Young-Keun
    MULTI-AGENT SYSTEMS FOR SOCIETY, 2009, 4078 : 297 - +
  • [6] A Multi-Agent system for data collection from power generators and weather stations in heterogeneous environments
    Awan, Shahid M.
    Khan, Zubair A.
    Rehman, Abdur
    Mahmood, Waqar
    38TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2012), 2012, : 4704 - 4708
  • [7] Multi-agent cluster system for optimal performance in heterogeneous computer environments
    Ikeda, T
    Hara, A
    Ichimura, T
    Takahama, T
    Taniguchi, Y
    Yamada, H
    Hakozaki, R
    Sakuda, H
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 932 - 937
  • [8] A Distributed Data Mining System Based on Multi-agent Technology
    郭黎明
    张艳珍
    Journal of DongHua University, 2006, (06) : 80 - 83
  • [9] Distributed data mining system based on multi-agent technology
    Guo, Li-Ming
    Zhang, Yan-Zhen
    Journal of Donghua University (English Edition), 2006, 23 (06) : 80 - 83
  • [10] A multi-agent conversational system with heterogeneous data sources access
    Eisman, Eduardo M.
    Navarro, Maria
    Luis Castro, Juan
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 53 : 172 - 191