Towards Self-Tuning Parameter Servers

被引:0
|
作者
Liu, Chris [1 ]
Zhang, Pengfei [1 ]
Tang, Bo [2 ]
Shen, Hang [3 ]
Lai, Ziliang [1 ]
Lo, Eric [1 ]
Chung, Korris [4 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[2] Southern Univ Sci & Technol, Peng Cheng Lab, Shenzhen, Peoples R China
[3] Sichuan Univ, Chengdu, Peoples R China
[4] Hong Kong Polytech Univ, Hong Kong, Peoples R China
关键词
D O I
10.1109/BigData50022.2020.9378141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years, manypplications have been driven advances by the use of Machine Learning (ML). Nowadays, it is common to see industrial-strength machine learning jobs that involve millions of model parameters, terabytes of training data, and weeks of training Good efficiency, i.e., fast completion time of running a specific ML training job, therefore, is a key feature of a successful ML system. While the completion time of a long running ML job is determined by the time required to reach model convergence, that is also largely influenced by the values of various system settings. In this paper, we contribute techniques towards building self-tuning parameter servers. Parameter Server (PS) is a popular system architecture for large-scale machine learning systems; and by self-tuning we mean while a long running ML job is iteratively training the expert-suggested model, the system is also iteratively learning which system setting is more efficient for that job and applies it online. Our techniques are general enough to various PS-style ML systems. Experiments on TensorFlow show that our techniques can reduce the completion times of a variety of long-running TensorFlow jobs from 14x to 18x.
引用
收藏
页码:310 / 319
页数:10
相关论文
共 50 条
  • [1] SELF-TUNING OPTIMIZATION ON STORAGE SERVERS IN PARALLEL FILE SYSTEMS
    Liao, Jianwei
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2014, 23 (04)
  • [2] Design weighting parameter tuning of multivariable self-tuning controllers
    Cho, WC
    Lee, IS
    Kim, KY
    COMPUTERS & ELECTRICAL ENGINEERING, 2002, 28 (06) : 465 - 480
  • [3] Self-tuning Batching with DVFS for Improving Performance and Energy Efficiency in Servers
    Cheng, Dazhao
    Guo, Yanfei
    Zhou, Xiaobo
    2013 IEEE 21ST INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS & SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS 2013), 2013, : 40 - 49
  • [4] Towards self-tuning of dynamic resources for workloads
    Duan, Fu
    Han, Yongjie
    Zhao, Qiuyong
    Me, Keming
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 297 - 300
  • [5] Self-tuning controller in consideration of the precision in parameter estimation
    Zheng, Tonghai
    Tian, Shubao
    Lin, Junqi
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 1988, (03): : 108 - 114
  • [6] SELF-TUNING REGULATOR FOR DISTRIBUTED PARAMETER-SYSTEMS
    HAMZA, MH
    SHEIRAH, MA
    AUTOMATICA, 1978, 14 (05) : 453 - 463
  • [7] ENHANCED PARAMETER-ESTIMATION FOR SELF-TUNING CONTROL
    SMITH, CA
    BURNHAM, KJ
    JAMES, DJG
    ELECTRONICS LETTERS, 1995, 31 (05) : 412 - 414
  • [8] Self-Tuning Batching with DVFS for Performance Improvement and Energy Efficiency in Internet Servers
    Cheng, Dazhao
    Guo, Yanfei
    Jiang, Changjun
    Zhou, Xiaobo
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2015, 10 (01)
  • [9] Towards Random Access Channel Self-Tuning in LTE
    Amirijoo, Mehdi
    Frenger, Pal
    Gunnarsson, Fredrik
    Moe, Johan
    Zetterberg, Kristina
    2009 IEEE VEHICULAR TECHNOLOGY CONFERENCE, VOLS 1-5, 2009, : 2309 - 2313
  • [10] Towards Dynamic Self-Tuning for Intrusion Detection Systems
    Kim, Sun-il
    Nwanze, Nnamdi
    Kintner, Jasen
    2010 IEEE 29TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2010, : 17 - 24