TimeGAN as a Simulator for Reinforcement Learning Training in Programmable Data Planes

被引:0
|
作者
Tavares, Thiago Caproni [1 ]
de Almeida, Leandro C. [2 ]
Silva, Washington R. D. [3 ]
Chiesa, Marco [4 ]
Verdi, Fabio L. [3 ]
机构
[1] IFSULDEMINAS, Pocos De Caldas, Brazil
[2] IFPB, Joao Pessoa, Paraiba, Brazil
[3] Univ Fed Sao Carlos, Sorocaba, Brazil
[4] KTH Royal Inst Technol, Stockholm, Sweden
基金
瑞典研究理事会;
关键词
Machine Learning; Generative Adversarial Networks; Autonomous Management;
D O I
10.1109/NOMS59830.2024.10575112
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This study explores the application of Time Series GAN in a Programmable Data Plane (PDP) for enhancing Reinforcement Learning within the context of computer networks, particularly in video applications. We address various challenges, including dataset augmentation, balancing, and extended RL training times in real setups. By leveraging synthetic data generated by TimeGAN, we accelerate experimentation, enhance dataset diversity, and simplify RL model training, ultimately evaluating TimeGAN's performance against real setups in resource optimization for PDPs using an RL agent. This research contributes by directly comparing GAN usage and real setups, bridging a gap in computer network literature, and highlighting a 99% similarity in Quality of Service achieved by an RL model trained with synthetic data, affirming TimeGAN's potential as a valuable simulator without compromising RL training efficacy.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A simulator for reinforcement learning training in the recommendation field
    Pang, Guangyao
    Zhu, Xiaoying
    Lu, Keda
    Peng, Zizhen
    Deng, Weitao
    2020 IEEE INTL SYMP ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, INTL CONF ON BIG DATA & CLOUD COMPUTING, INTL SYMP SOCIAL COMPUTING & NETWORKING, INTL CONF ON SUSTAINABLE COMPUTING & COMMUNICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2020), 2020, : 1037 - 1042
  • [2] Offloading Machine Learning to Programmable Data Planes: A Systematic Survey
    Parizotto, Ricardo
    Coelho, Bruno Loureiro
    Nunes, Diego Cardoso
    Haque, Israat
    Schaeffer-Filho, Alberto
    ACM COMPUTING SURVEYS, 2024, 56 (01)
  • [3] A Data-Driven Pandemic Simulator with Reinforcement Learning
    Zhang, Yuting
    Ma, Biyang
    Cao, Langcai
    Liu, Yanyu
    ELECTRONICS, 2024, 13 (13)
  • [4] PROGRAMMABLE SIMULATOR SPEEDS OPERATOR TRAINING
    BARRETT, GF
    BENKO, TW
    RIDDELL, G
    BELL LABORATORIES RECORD, 1981, 59 (07): : 213 - 216
  • [5] A Programmable Framework for Validating Data Planes
    Bressana, Pietro
    Zilberman, Noa
    Soule, Robert
    SIGCOMM'18: PROCEEDINGS OF THE ACM SIGCOMM 2018 CONFERENCE: POSTERS AND DEMOS, 2018, : 1 - 3
  • [6] Programmable reinforcement learning agents
    Andre, D
    Russell, SJ
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1019 - 1025
  • [7] On the use of conditional TimeGAN to enhance the robustness of a reinforcement learning agent in the building domain
    Fochesato, Marta
    Khayatian, Fazel
    Lima, Doris Fonseca
    Nagy, Zoltan
    PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 208 - 217
  • [8] Dynamic Property Enforcement in Programmable Data Planes
    Neves, Miguel
    Huffaker, Bradley
    Levchenko, Kiri
    Barcellos, Marinho
    2019 IFIP NETWORKING CONFERENCE (IFIP NETWORKING), 2019,
  • [9] Dynamic Property Enforcement in Programmable Data Planes
    Neves, Miguel
    Huffaker, Bradley
    Levchenko, Kirill
    Barcellos, Marinho
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (04) : 1540 - 1552
  • [10] Dynamic Property Enforcement in Programmable Data Planes
    Neves, Miguel
    Huffakert, Bradley
    Levchenko, Kiri
    Barcellos, Marinho
    2019 IFIP NETWORKING CONFERENCE (IFIP NETWORKING), 2019,