TimeGAN as a Simulator for Reinforcement Learning Training in Programmable Data Planes

被引：0

作者：

Tavares, Thiago Caproni ^{[1
]}

de Almeida, Leandro C. ^{[2
]}

Silva, Washington R. D. ^{[3
]}

Chiesa, Marco ^{[4
]}

Verdi, Fabio L. ^{[3
]}

机构：

[1] IFSULDEMINAS, Pocos De Caldas, Brazil

[2] IFPB, Joao Pessoa, Paraiba, Brazil

[3] Univ Fed Sao Carlos, Sorocaba, Brazil

[4] KTH Royal Inst Technol, Stockholm, Sweden

来源：

PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024 | 2024年

基金：

瑞典研究理事会;

关键词：

Machine Learning; Generative Adversarial Networks; Autonomous Management;

D O I：

10.1109/NOMS59830.2024.10575112

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study explores the application of Time Series GAN in a Programmable Data Plane (PDP) for enhancing Reinforcement Learning within the context of computer networks, particularly in video applications. We address various challenges, including dataset augmentation, balancing, and extended RL training times in real setups. By leveraging synthetic data generated by TimeGAN, we accelerate experimentation, enhance dataset diversity, and simplify RL model training, ultimately evaluating TimeGAN's performance against real setups in resource optimization for PDPs using an RL agent. This research contributes by directly comparing GAN usage and real setups, bridging a gap in computer network literature, and highlighting a 99% similarity in Quality of Service achieved by an RL model trained with synthetic data, affirming TimeGAN's potential as a valuable simulator without compromising RL training efficacy.

引用

页数：9

共 50 条

[1] A simulator for reinforcement learning training in the recommendation field
Pang, Guangyao
Zhu, Xiaoying
Lu, Keda
Peng, Zizhen
Deng, Weitao
2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 1037 - 1042
[2] Offloading Machine Learning to Programmable Data Planes: A Systematic Survey
Parizotto, Ricardo
Coelho, Bruno Loureiro
Nunes, Diego Cardoso
Haque, Israat
Schaeffer-Filho, Alberto
ACM COMPUTING SURVEYS, 2024, 56 (01)
[3] A Data-Driven Pandemic Simulator with Reinforcement Learning
Zhang, Yuting
Ma, Biyang
Cao, Langcai
Liu, Yanyu
ELECTRONICS, 2024, 13 (13)
[4] PROGRAMMABLE SIMULATOR SPEEDS OPERATOR TRAINING
BARRETT, GF
BENKO, TW
RIDDELL, G
BELL LABORATORIES RECORD, 1981, 59 (07): : 213 - 216
[5] A Programmable Framework for Validating Data Planes
Bressana, Pietro
Zilberman, Noa
Soule, Robert
SIGCOMM'18: PROCEEDINGS OF THE ACM SIGCOMM 2018 CONFERENCE: POSTERS AND DEMOS, 2018, : 1 - 3
[6] Programmable reinforcement learning agents
Andre, D
Russell, SJ
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1019 - 1025
[7] On the use of conditional TimeGAN to enhance the robustness of a reinforcement learning agent in the building domain
Fochesato, Marta
Khayatian, Fazel
Lima, Doris Fonseca
Nagy, Zoltan
PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 208 - 217
[8] Dynamic Property Enforcement in Programmable Data Planes
Neves, Miguel
Huffaker, Bradley
Levchenko, Kiri
Barcellos, Marinho
2019 IFIP NETWORKING CONFERENCE (IFIP NETWORKING), 2019,
[9] Dynamic Property Enforcement in Programmable Data Planes
Neves, Miguel
Huffaker, Bradley
Levchenko, Kirill
Barcellos, Marinho
IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (04) : 1540 - 1552
[10] Dynamic Property Enforcement in Programmable Data Planes
Neves, Miguel
Huffakert, Bradley
Levchenko, Kiri
Barcellos, Marinho
2019 IFIP NETWORKING CONFERENCE (IFIP NETWORKING), 2019,

← 1 2 3 4 5 →