RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引:27
|
作者
Sharma, Deepak Kumar [1 ]
Rodrigues, Joel J. P. C. [2 ,3 ]
Vashishth, Vidushi [1 ]
Khanna, Anirudh [4 ]
Chhabra, Anshuman [4 ,5 ]
机构
[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India
[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil
[3] Inst Telecomunicacoes, Covilha, Portugal
[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India
[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;
D O I
10.1007/s11276-020-02331-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
引用
收藏
页码:4319 / 4338
页数:20
相关论文
共 50 条
  • [31] Reinforcement learning based dynamic distributed routing scheme for mega LEO satellite networks
    Yixin HUANG
    Shufan WU
    Zeyu KANG
    Zhongcheng MU
    Hai HUANG
    Xiaofeng WU
    Andrew Jack TANG
    Xuebin CHENG
    Chinese Journal of Aeronautics , 2023, (02) : 284 - 291
  • [32] AN OPTIMAL-CONTROL APPROACH TO DYNAMIC ROUTING IN NETWORKS
    MOSS, FH
    SEGALL, A
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1982, 27 (02) : 329 - 339
  • [33] Location based Routing in Opportunistic Networks using Cascade Learning
    Singh, Jagdeep
    Obaidat, Mohammad S.
    Dhurandher, Sanjay K.
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION, AND TELECOMMUNICATION SYSTEMS (IEEE CITS 2021), 2021, : 6 - 10
  • [34] Reinforcement learning based dynamic distributed routing scheme for mega LEO satellite networks
    Huang, Yixin
    Wu, Shufan
    Kang, Zeyu
    Mu, Zhongcheng
    Huang, Hai
    Wu, Xiaofeng
    Tang, Andrew Jack
    Cheng, Xuebin
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (02) : 284 - 291
  • [35] A Deep Reinforcement Learning-based Routing Scheme with Two Modes for Dynamic Networks
    Cong, Peizhuang
    Zhang, Yuchao
    Wang, Wendong
    Xu, Ke
    Li, Ruidong
    Li, Fuliang
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [36] RL-IoT: Reinforcement Learning-Based Routing Approach for Cognitive Radio-Enabled IoT Communications
    Malik, Tauqeer Safdar
    Malik, Kaleem Razzaq
    Afzal, Ayesha
    Ibrar, Muhammad
    Wang, Lei
    Song, Houbing
    Shah, Nadir
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (02) : 1836 - 1847
  • [37] Dynamic Multihop Routing in Terahertz Flow-Guided Nanosensor Networks: A Reinforcement Learning Approach
    Garcia-Sanchez, Antonio-Javier
    Asorey-Cacheda, Rafael
    Garcia-Haro, Joan
    Gomez-Tornero, Jose-Luis
    IEEE SENSORS JOURNAL, 2023, 23 (04) : 3408 - 3422
  • [38] Opportunistic Fair Scheduling in Wireless Networks: An Approximate Dynamic Programming Approach
    Zhi Zhang
    Sudhir Moola
    Edwin K. P. Chong
    Mobile Networks and Applications, 2010, 15 : 710 - 728
  • [39] Opportunistic Fair Scheduling in Wireless Networks: An Approximate Dynamic Programming Approach
    Zhang, Zhi
    Moola, Sudhir
    Chong, Edwin K. P.
    MOBILE NETWORKS & APPLICATIONS, 2010, 15 (05): : 710 - 728
  • [40] Neural Networks and Differential Dynamic Programming for Reinforcement Learning Problems
    Yamaguchi, Akihiko
    Atkeson, Christopher G.
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 5434 - 5441