Device Placement Optimization with Reinforcement Learning

被引：0

作者：

Mirhoseini, Azalia ^{[1
]}

Pham, Hieu ^{[1
]}

Le, Quoc, V ^{[1
]}

Steiner, Benoit ^{[1
]}

Larsen, Rasmus ^{[1
]}

Zhou, Yuefeng ^{[1
]}

Kumar, Naveen ^{[2
]}

Norouzi, Mohammad ^{[1
]}

Bengio, Samy ^{[1
]}

Dean, Jeff ^{[1
]}

机构：

[1] Google Brain, Mountain View, CA 94043 USA

[2] Google, Mountain View, CA USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

关键词：

ALGORITHMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The past few years have witnessed a growth in size and computational requirements for training and inference with neural networks. Currently, a common approach to address these requirements is to use a heterogeneous distributed environment with a mixture of hardware devices such as CPUs and GPUs. Importantly, the decision of placing parts of the neural models on devices is often made by human experts based on simple heuristics and intuitions. In this paper, we propose a method which learns to optimize device placement for TensorFlow computational graphs. Key to our method is the use of a sequence-tosequence model to predict which subsets of operations in a TensorFlow graph should run on which of the available devices. The execution time of the predicted placements is then used as the reward signal to optimize the parameters of the sequence-to-sequence model. Our main result is that on Inception-V3 for ImageNet classification, and on RNN LSTM, for language modeling and neural machine translation, our model finds non-trivial device placements that outperform hand-crafted heuristics and traditional algorithmic methods.

引用

页数：10

共 50 条

[21] Virtual Network Function Placement Optimization Algorithm Based on Improve Deep Reinforcement Learning
Tang Lun
He Lanqin
Lian Qinyi
Tan Qi
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1724 - 1732
[22] Online VNF Placement using Deep Reinforcement Learning and Reward Constrained Policy Optimization
Mohamed, Ramy
Avgeris, Marios
Leivadeas, Aris
Lambadaris, Ioannis
2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 269 - 274
[23] Deep reinforcement learning mechanism for deadline-aware cache placement in device-to-device mobile edge networks
Manoj Kumar Somesula
Sai Krishna Mothku
Anusha Kotte
Wireless Networks, 2023, 29 : 569 - 588
[24] Synergistic Fibroblast Optimization Based Improved Reinforcement Learning For Intelligent Assistive Device
Subashini, P.
Dhivyaprabha, T. T.
Krishnaveni, M.
Viyas, G. Vedha
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017,
[25] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
Manoj Kumar Somesula
Anusha Kotte
Sudarshan Chakravarthy Annadanam
Sai Krishna Mothku
Mobile Networks and Applications, 2022, 27 : 2100 - 2117
[26] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
Somesula, Manoj Kumar
Kotte, Anusha
Annadanam, Sudarshan Chakravarthy
Mothku, Sai Krishna
MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 2100 - 2117
[27] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
Liu, Hongtai
Ding, Shengduo
Wang, Shunyi
Zhao, Gang
Wang, Chao
JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (04)
[28] Deep Reinforcement Learning-Based Ground-Via Placement Optimization for EMI Mitigation
Gu, Zheming
Zhang, Ling
Jin, Hang
Tao, Tuomin
Li, Da
Li, Er-Ping
IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY, 2023, 65 (02) : 564 - 573
[29] Multi-objective Optimization Service Function Chain Placement Algorithm Based on Reinforcement Learning
Hongtai Liu
Shengduo Ding
Shunyi Wang
Gang Zhao
Chao Wang
Journal of Network and Systems Management, 2022, 30
[30] Device Codesign using Reinforcement Learning
Cardwell, Suma G.
Patel, Karan
Schuman, Catherine D.
Smith, J. Darby
Kwon, Jaesuk
Maicke, Andrew
Arzate, Jared
Incorvia, Jean Anne C.
2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,

← 1 2 3 4 5 →