RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks

被引：27

作者：

Sharma, Deepak Kumar ^{[1
]}

Rodrigues, Joel J. P. C. ^{[2
,3
]}

Vashishth, Vidushi ^{[1
]}

Khanna, Anirudh ^{[4
]}

Chhabra, Anshuman ^{[4
,5
]}

机构：

[1] Netaji Subhas Univ Technol, Dept Informat Technol, New Delhi, India

[2] Fed Univ Piaui UFPI, Campus Petronio Portela, Teresina, PI, Brazil

[3] Inst Telecomunicacoes, Covilha, Portugal

[4] Netaji Subhas Univ Technol, Div Elect & Commun Engn, New Delhi, India

[5] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA

来源：

WIRELESS NETWORKS | 2020年 / 26卷 / 06期

关键词：

Opportunistic networks; Internet of Things; Reinforcement learning; Markov decision process; Dynamic programming; ONE simulator; Machine learning; Policy iteration; ALGORITHM; DESIGN;

D O I：

10.1007/s11276-020-02331-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Routing in Opportunistic Internet of Things networks (OppIoTs) is a challenging task because of intermittent connectivity between devices and the lack of a fixed path between the source and destination of messages. Recently, machine learning (ML) and reinforcement learning (RL) have been used with great success to automate processes in a number of different problem domains. In this paper, we seek to fully automate the OppIoT routing process by using the Policy Iteration algorithm to maximize the possibility of message delivery. Moreover, we model the OppIoT environment as a Markov decision process (MDP) replete with states, actions, rewards, and transition probabilities. The proposed routing protocol, RLProph, is able to optimize the routing process via the optimal policy obtained by solving the MDP using Policy Iteration. Through extensive simulations, we show that RLProph outperforms a number of ML-based and context-aware routing protocols on a multitude of performance criteria.

引用

页码：4319 / 4338

页数：20

共 50 条

[1] RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks
Deepak Kumar Sharma
Joel J. P. C. Rodrigues
Vidushi Vashishth
Anirudh Khanna
Anshuman Chhabra
Wireless Networks, 2020, 26 : 4319 - 4338
[2] Congestion-Aware Routing in Dynamic IoT Networks: A Reinforcement Learning Approach
Farag, Hossam
Stefanovic, Cedomir
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[3] GMMR: A Gaussian mixture model based unsupervised machine learning approach for optimal routing in opportunistic IoT networks
Vashishth, Vidushi
Chhabra, Anshuman
Sharma, Deepak Kumar
COMPUTER COMMUNICATIONS, 2019, 134 : 138 - 148
[4] Bionic Conventional Deep Learning Model-Based Optimal Routing in Opportunistic IOT Networks
Gopinathan, S.
Babu, S.
JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025,
[5] A novel federated learning approach for routing optimisation in opportunistic IoT networks
Bhardwaj, Moulik
Singh, Jagdeep
Gupta, Nitin
Jadon, Kuldeep Singh
Dhurandher, Sanjay Kumar
INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2024, 46 (01) : 24 - 38
[6] Reinforcement Learning-Based Routing Protocol for Opportunistic Networks
Dhurandher, Sanjay Kumar
Singh, Jagdeep
Obaidat, Mohammad S.
Woungang, Isaac
Srivastava, Samariddhi
Rodrigues, Joel J. P. C.
ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
[7] Reinforcement learning based reliability-aware routing in IoT networks
Ergun, Kazim
Ayoub, Raid
Mercati, Pietro
Rosing, Tajana
AD HOC NETWORKS, 2022, 132
[8] Reinforcement learning-based fuzzy geocast routing protocol for opportunistic networks
Khalid, Khuram
Woungang, Isaac
Dhurandher, Sanjay K.
Singh, Jagdeep
INTERNET OF THINGS, 2021, 14
[9] A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks
Cong, Peizhuang
Zhang, Yuchao
Liu, Zheli
Baker, Thar
Tawfik, Hissam
Wang, Wendong
Xu, Ke
Li, Ruidong
Li, Fuliang
COMPUTER NETWORKS, 2021, 192
[10] An AI-based approach for dynamic routing in IoT networks
Gountia, Debasis
Mishra, Pranati
Dash, Ranjan Kumar
Pradhan, Nihar Ranjan
Mohanty, Sachi Nandan
PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (03)

← 1 2 3 4 5 →