Device Placement Optimization with Reinforcement Learning

被引:0
|
作者
Mirhoseini, Azalia [1 ]
Pham, Hieu [1 ]
Le, Quoc, V [1 ]
Steiner, Benoit [1 ]
Larsen, Rasmus [1 ]
Zhou, Yuefeng [1 ]
Kumar, Naveen [2 ]
Norouzi, Mohammad [1 ]
Bengio, Samy [1 ]
Dean, Jeff [1 ]
机构
[1] Google Brain, Mountain View, CA 94043 USA
[2] Google, Mountain View, CA USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷
关键词
ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The past few years have witnessed a growth in size and computational requirements for training and inference with neural networks. Currently, a common approach to address these requirements is to use a heterogeneous distributed environment with a mixture of hardware devices such as CPUs and GPUs. Importantly, the decision of placing parts of the neural models on devices is often made by human experts based on simple heuristics and intuitions. In this paper, we propose a method which learns to optimize device placement for TensorFlow computational graphs. Key to our method is the use of a sequence-tosequence model to predict which subsets of operations in a TensorFlow graph should run on which of the available devices. The execution time of the predicted placements is then used as the reward signal to optimize the parameters of the sequence-to-sequence model. Our main result is that on Inception-V3 for ImageNet classification, and on RNN LSTM, for language modeling and neural machine translation, our model finds non-trivial device placements that outperform hand-crafted heuristics and traditional algorithmic methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Virtual Network Function Placement Optimization Algorithm Based on Improve Deep Reinforcement Learning
    Tang Lun
    He Lanqin
    Lian Qinyi
    Tan Qi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1724 - 1732
  • [22] Online VNF Placement using Deep Reinforcement Learning and Reward Constrained Policy Optimization
    Mohamed, Ramy
    Avgeris, Marios
    Leivadeas, Aris
    Lambadaris, Ioannis
    2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 269 - 274
  • [23] Deep reinforcement learning mechanism for deadline-aware cache placement in device-to-device mobile edge networks
    Manoj Kumar Somesula
    Sai Krishna Mothku
    Anusha Kotte
    Wireless Networks, 2023, 29 : 569 - 588
  • [24] Synergistic Fibroblast Optimization Based Improved Reinforcement Learning For Intelligent Assistive Device
    Subashini, P.
    Dhivyaprabha, T. T.
    Krishnaveni, M.
    Viyas, G. Vedha
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
  • [25] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
    Manoj Kumar Somesula
    Anusha Kotte
    Sudarshan Chakravarthy Annadanam
    Sai Krishna Mothku
    Mobile Networks and Applications, 2022, 27 : 2100 - 2117
  • [26] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
    Somesula, Manoj Kumar
    Kotte, Anusha
    Annadanam, Sudarshan Chakravarthy
    Mothku, Sai Krishna
    MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 2100 - 2117
  • [27] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
    Liu, Hongtai
    Ding, Shengduo
    Wang, Shunyi
    Zhao, Gang
    Wang, Chao
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (04)
  • [28] Deep Reinforcement Learning-Based Ground-Via Placement Optimization for EMI Mitigation
    Gu, Zheming
    Zhang, Ling
    Jin, Hang
    Tao, Tuomin
    Li, Da
    Li, Er-Ping
    IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY, 2023, 65 (02) : 564 - 573
  • [29] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
    Hongtai Liu
    Shengduo Ding
    Shunyi Wang
    Gang Zhao
    Chao Wang
    Journal of Network and Systems Management, 2022, 30
  • [30] Device Codesign using Reinforcement Learning
    Cardwell, Suma G.
    Patel, Karan
    Schuman, Catherine D.
    Smith, J. Darby
    Kwon, Jaesuk
    Maicke, Andrew
    Arzate, Jared
    Incorvia, Jean Anne C.
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,