A throughput optimal algorithm for map task scheduling in MapReduce with data locality

被引:3
作者
Wang, Weina [1 ]
Zhu, Kai [1 ]
Ying, Lei [1 ]
Tan, Jian [2 ]
Zhang, Li [2 ]
机构
[1] Electrical, Computer and Energy Engineering, Arizona State University, Tempe
[2] IBM T. J. Watson Research Center, Yorktown Heights, NY
来源
Performance Evaluation Review | 2013年 / 40卷 / 04期
关键词
D O I
10.1145/2479942.2479947
中图分类号
学科分类号
摘要
MapReduce/Hadoop framework has been widely used to process large-scale datasets on computing clusters. Scheduling map tasks to improve data locality is crucial to the performance of MapReduce. Many works have been devoted to increasing data locality for better efficiency. However, to the best of our knowledge, fundamental limits of MapReduce computing clusters with data locality, including the capacity region and throughput optimal algorithms, have not been studied. In this paper, we address these problems from a stochastic network perspective. Our focus is to strike the right balance between data-locality and load-balancing to maximize throughput. We present a new queueing architecture and propose a map task scheduling algorithm constituted by the Join the Shortest Queue policy together with the MaxWeight policy. We identify an outer bound on the capacity region, and then prove that the proposed algorithm can stabilize any arrival rate vector strictly within this outer bound. It shows that the algorithm is throughput optimal and the outer bound coincides with the actual capacity region. The proofs in this paper deal with random processing time with different parameters and nonpreemptive tasks, which differentiate our work from many other works, so the proof technique itself is also a contribution of this paper. Copyright is held by author/owner(s).
引用
收藏
页码:33 / 42
页数:9
相关论文
共 12 条
[1]  
Ananthanarayanan G., Agarwal S., Kandula S., Greenberg A., Stoica I., Harlan D., Harris E., Scarlett: Coping with skewed content popularity in mapreduce clusters, Proc. European Conf Computer Systems (EuroSys), pp. 287-300, (2011)
[2]  
Ananthanarayanan G., Ghodsi A., Wang A., Borthakur D., Kandula S., Shenker S., Stoica I., Pacman: Coordinated memory caching for parallel jobs, Proc. Conf. Networked Systems Design and Implementations (USENIX), pp. 20-20, (2012)
[3]  
Dean J., Ghemawat S., Mapreduce: Simplified data processing on large clusters, ACM Commun, 51, 1, pp. 107-113, (2008)
[4]  
Ghemawat S., Gobioff H., Leung S.-T., The google file system, Proc ACM Symp. Operating Systems Principles (SOSP), pp. 29-43, (2003)
[5]  
Isard M., Prabhakaran V., Currey J., Wieder U., Talwar K., Goldberg A., Quincy: Fair scheduling for distributed computing clusters, Proc. ACM Symp. Operating Systems Principles (SOSP), pp. 261-276, (2009)
[6]  
Kavulya S., Tan J., Gandhi R., Narasimhan P., An analysis of traces from a production mapreduce cluster, Proc. IEEE/ACM Int. Conf. Cluster, Cloud and Grid Computing (CCGRID), pp. 94-103, (2010)
[7]  
Maguluri S.T., Srikant R., Scheduling jobs with unknown duration in clouds, Proc. IEEE Int. Conf. Computer Communications (INFOCOM), (2013)
[8]  
Shvachko K., Kuang H., Radia S., Chansler R., The hadoop distributed file system, IEEE Symp. Mass Storage Systems and Technologies (MSST), pp. 1-10, (2010)
[9]  
Tassiulas L., Ephremides A., Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks, IEEE Trans. Autom. Control, 4, pp. 1936-1948, (1992)
[10]  
Tassiulas L., Ephremides A., Dynamic server allocation to parallel queues with randomly varying connectivity, IEEE Trans. Inf. Theory, 39, pp. 466-478, (1993)