Data Distribution and Scheduling for Distributed Analytics Tasks

被引:0
|
作者
Pasteris, Stephen [1 ]
Wang, Shiqiang [2 ]
Makaya, Christian [2 ]
Chan, Kevin [3 ]
Herbster, Mark [1 ]
机构
[1] UCL, Dept Comp Sci, London, England
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
[3] US Army, Res Lab, Adelphi, MD USA
关键词
Data placement; Internet of Things (IoT); maximum flow problem; mobile edge computing; optimization; FLOW; ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider a distributed edge computing system where we have a number of interconnected machines with limited communication bandwidth and storage capacity. Analytics tasks run on the machines, where each task runs on a single machine but may require data from multiple other machines. Every task requires a given amount of data to run, and it needs to receive all its data within a specific deadline. The application scenario is that each machine has limited storage, thus we usually cannot place the entire amount of data for a specific task on a single machine that executes the task. We assume that the task execution is sparse in time, so that at most one task is executed in the system at any time. The problem we study in this paper is how to distribute the data on machines in the system, without violating the bandwidth and storage constraints, while ensuring that the data transfer deadlines are met. We prove that the optimal solution to this problem is equivalent to that of a max-flow problem on a specifically constructed graph. We present how to construct this graph so that the problem can be solved using standard algorithms for max-flow problems, and also provide some numerical results and further discussions.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Scheduling distributed real-time tasks in the DGMF model
    Chen, DJ
    Mok, AK
    Baruah, S
    SIXTH IEEE REAL-TIME TECHNOLOGY AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2000, : 14 - 22
  • [42] Scheduling Independent Tasks on Heterogeneous Distributed Environments by Differential Evolution
    Kroemer, Pavel
    Snasel, Vaclav
    Platos, Jan
    Abraham, Ajith
    Izakian, Hesam
    2009 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS 2009), 2009, : 170 - +
  • [43] A methodology for scheduling real-time tasks in a distributed system
    Bestavros, A
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1997, : 449 - 456
  • [44] Scheduling distributed real-time tasks with minimum jitter
    Di Natale, M
    Stankovic, JA
    IEEE TRANSACTIONS ON COMPUTERS, 2000, 49 (04) : 303 - 316
  • [45] Heuristic scheduling algorithms for stochastic tasks in a distributed multiprocessor environments
    Maksoud, Ehab Abdel
    Ammar, Reda A.
    INTERNATIONAL E-CONFERENCE ON COMPUTER SCIENCE 2005, 2005, 2 : 1 - 5
  • [46] Comparison of Heuristics for Scheduling Independent Tasks on Heterogeneous Distributed Environments
    Izakian, Hesam
    Abraham, Ajith
    Snasel, Vaclav
    INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION, VOL 1, PROCEEDINGS, 2009, : 8 - +
  • [47] A heuristic scheduling algorithm for stochastic tasks in distributed multiprocessor systems
    Maksoud, EA
    Ammar, RA
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2002, : 277 - 282
  • [48] A lowerbound on the schedule time for scheduling tasks on distributed memory systems
    Darbha, S
    THIRTIETH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, VOL 1: SOFTWARE TECHNOLOGY AND ARCHITECTURE, 1997, : 632 - 633
  • [49] A New Approach for Workflow Tasks Scheduling in Distributed Heterogeneous Environment
    Prabavathi, K.
    Varalakshmi, P.
    RECENT TRENDS IN NETWORKS AND COMMUNICATIONS, 2010, 90 : 663 - 671
  • [50] Differential Evolution for Scheduling Independent Tasks on Heterogeneous Distributed Environments
    Kromer, Pavel
    Abraham, Ajith
    Snasel, Vaclav
    Platos, Jan
    Izakian, Hesam
    ADVANCES IN INTELLIGENT WEB MASTERING-2, PROCEEDINGS, 2010, 67 : 127 - +