Multi-Objective Resource Scheduling for IoT Systems Using Reinforcement Learning

被引:4
|
作者
Shresthamali, Shaswot [1 ]
Kondo, Masaaki [1 ]
Nakamura, Hiroshi [2 ]
机构
[1] Keio Univ, Fac Sci & Technol, Dept Informat & Comp Sci, Tokyo, Kanagawa 2238522, Japan
[2] Univ Tokyo, Grad Sch Informat Sci & Technol, Dept Informat Phys & Comp, Tokyo 1138656, Japan
基金
日本学术振兴会;
关键词
multi objective optimization; multi-objective reinforcement learning; wireless sensors; energy harvesting; POWER MANAGEMENT; SENSOR NETWORKS; ENERGY; TRANSMISSION; OPTIMIZATION; ALLOCATION;
D O I
10.3390/jlpea12040053
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
IoT embedded systems have multiple objectives that need to be maximized simultaneously. These objectives conflict with each other due to limited resources and tradeoffs that need to be made. This requires multi-objective optimization (MOO) and multiple Pareto-optimal solutions are possible. In such a case, tradeoffs are made w.r.t. a user-defined preference. This work presents a general Multi-objective Reinforcement Learning (MORL) framework for MOO of IoT embedded systems. This framework comprises a general Multi-objective Markov Decision Process (MOMDP) formulation and two novel low-compute MORL algorithms. The algorithms learn policies to tradeoff between multiple objectives using a single preference parameter. We take the energy scheduling problem in general Energy Harvesting Wireless Sensor Nodes (EHWSNs) as a case example in which a sensor node is required to maximize its sensing rate, and transmission performance as well as ensure long-term uninterrupted operation within a very tight energy budget. We simulate single-task and dual-task EHWSN systems to evaluate our framework. The results demonstrate that our MORL algorithms can learn better policies at lower learning costs and successfully tradeoff between multiple objectives at runtime.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] Reinforcement learning-based multi-objective energy-efficient task scheduling in fog-cloud industrial IoT-based systems
    V. Vijayalakshmi
    M. Saravanan
    Soft Computing, 2023, 27 : 17473 - 17491
  • [32] Reinforcement learning-based multi-objective energy-efficient task scheduling in fog-cloud industrial IoT-based systems
    Vijayalakshmi, V.
    Saravanan, M.
    SOFT COMPUTING, 2023, 27 (23) : 17473 - 17491
  • [33] An adaptive multi-objective multi-task scheduling method by hierarchical deep reinforcement learning
    Zhang, Jianxiong
    Guo, Bing
    Ding, Xuefeng
    Hu, Dasha
    Tang, Jun
    Du, Ke
    Tang, Chao
    Jiang, Yuming
    APPLIED SOFT COMPUTING, 2024, 154
  • [34] Investigating the performance of multi-objective reinforcement learning techniques in the context of IoT with harvesting energy
    Haouari, Bakhta
    Mzid, Rania
    Mosbahi, Olfa
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [35] A multi-objective multi-agent deep reinforcement learning approach to residential appliance scheduling
    Lu, Junlin
    Mannion, Patrick
    Mason, Karl
    IET SMART GRID, 2022, 5 (04) : 260 - 280
  • [36] Deep Reinforcement Learning based Multi-Objective Systems for Financial Trading
    Bisht, Kiran
    Kumar, Arun
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (IEEE - ICRAIE-2020), 2020,
  • [37] Meta-Learning for Multi-objective Reinforcement Learning
    Chen, Xi
    Ghadirzadeh, Ali
    Bjorkman, Marten
    Jensfelt, Patric
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 977 - 983
  • [38] Mutated Deep Reinforcement Learning Scheduling in Cloud for Resource-Intensive IoT Systems
    Shingne, Harshala
    Shriram, R.
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 132 (03) : 2143 - 2155
  • [39] Mutated Deep Reinforcement Learning Scheduling in Cloud for Resource-Intensive IoT Systems
    Harshala Shingne
    R. Shriram
    Wireless Personal Communications, 2023, 132 : 2143 - 2155
  • [40] Multi-objective Reinforcement Learning for Responsive Grids
    Perez, Julien
    Germain-Renaud, Cecile
    Kegl, Balazs
    Loomis, Charles
    JOURNAL OF GRID COMPUTING, 2010, 8 (03) : 473 - 492