RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引:27
|
作者
Sharma, Deepak Kumar [1 ]
Rodrigues, Joel J. P. C. [2 ,3 ]
Vashishth, Vidushi [1 ]
Khanna, Anirudh [4 ]
Chhabra, Anshuman [4 ,5 ]
机构
[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India
[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil
[3] Inst Telecomunicacoes, Covilha, Portugal
[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India
[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;
D O I
10.1007/s11276-020-02331-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.
引用
收藏
页码:4319 / 4338
页数:20
相关论文
共 50 条
  • [21] Combining reinforcement learning with mathematical programming: An approach for optimal design of heat exchanger networks
    Tan, Hui
    Hong, Xiaodong
    Liao, Zuwei
    Sun, Jingyuan
    Yang, Yao
    Wang, Jingdai
    Yang, Yongrong
    CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2024, 69 : 63 - 71
  • [22] Combining reinforcement learning with mathematical programming:An approach for optimal design of heat exchanger networks
    Hui Tan
    Xiaodong Hong
    Zuwei Liao
    Jingyuan Sun
    Yao Yang
    Jingdai Wang
    Yongrong Yang
    Chinese Journal of Chemical Engineering, 2024, 69 (05) : 63 - 71
  • [23] A Dynamic Programming Approach for Routing in Wireless Mesh Networks
    Crichigno, J.
    Khoury, J.
    Wu, M. Y.
    Shu, W.
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [24] Probability and Priority Based Routing Approach for Opportunistic Networks
    Avhad, Kiran
    Limkar, Suresh
    Kulkarni, Anagha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2013, 2014, 247 : 285 - 292
  • [25] Routing Recovery for UAV Networks with Deliberate Attacks: A Reinforcement Learning based Approach
    He, Sijie
    Jia, Ziye
    Dong, Chao
    Wang, Wei
    Cao, Yilu
    Yang, Yang
    Wu, Qihui
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 952 - 957
  • [26] Ants and reinforcement learning: A case study in routing in dynamic networks
    Subramanian, D
    Druschel, P
    Chen, J
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 832 - 838
  • [27] Dynamic Adjustment Strategy of n-Epidemic Routing Protocol for Opportunistic Networks: A Learning Automata Approach
    Zhang, Feng
    Wang, Xiaoming
    Zhang, Lichen
    Li, Peng
    Wang, Liang
    Yu, Wangyang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (04): : 2020 - 2037
  • [28] A location Prediction-based routing scheme for opportunistic networks in an IoT scenario
    Dhurandher, Sanjay K.
    Borah, Satya J.
    Woungang, I.
    Bansal, Aman
    Gupta, Apoory
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 118 : 369 - 378
  • [29] Cracking the Anonymous IoT Routing Networks: A Deep Learning Approach
    Bansal G.
    Chamola V.
    Hussain A.
    Khan M.K.
    IEEE Internet of Things Magazine, 2023, 6 (01): : 120 - 126
  • [30] Reinforcement learning based dynamic distributed routing scheme for mega LEO satellite networks
    Yixin HUANG
    Shufan WU
    Zeyu KANG
    Zhongcheng MU
    Hai HUANG
    Xiaofeng WU
    Andrew Jack TANG
    Xuebin CHENG
    Chinese Journal of Aeronautics, 2023, 36 (02) : 284 - 291