Using a half cheetah habitat for random augmentation computing

被引:0
|
作者
Kishor K. [1 ]
机构
[1] ABES Institute of Technology Ghaziabad, Uttar Pradesh
关键词
Augmented Random Search; Dynamical System; Hyper parameter; Linear Policies; Model-free Vs model-based learning; RL (Reinforcement Learning);
D O I
10.1007/s11042-024-19084-0
中图分类号
学科分类号
摘要
Reinforcement learning algorithms that depend on physical models are thought to provide more substantial outcomes in comparison to model-free approaches when implemented on dynamic systems. Previous research has mostly focused on addressing self-management difficulties via the use of sophisticated neural network models. In order to get state-of-the-art outcomes, these models need a significant amount of training data. When using model-free policy search techniques, it is often considered that model-free reinforcement learning processes with policies that provide worse outcomes may be avoided by utilizing augmented random search (ARS), model-free. This technique improves the effectiveness and speeds up the training of linear methods for performing control tasks on the virtual physics engine Half-Cheetah Environment. To reach this objective, we use the computational efficiency of Augmented Random Search (ARS) to assess the agent's performance over several random episodes and hyper parameters. The study also showcases the effectiveness of the search approach used for this project via the examination of simulation data, including many events and agents' behaviors. Our simulation reveals that the commonly used metric for assessing the efficiency of RL learning is inadequate in accurately determining the efficacy of specific circumstances. In these instances, Augmented Random Search (ARS) surpasses other algorithms by achieving more rewards. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:5927 / 5946
页数:19
相关论文
共 50 条
  • [31] Assessing wildfire impact on Trigonella elliptica habitat using random forest modeling
    Moradi, Ehsan
    Tavili, Ali
    Darabi, Hamid
    Muchova, Zlatica
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2024, 353
  • [32] RANDOM FOREST CLASSIFICATION SCENARIOS FOR BENTHIC HABITAT MAPPING USING PLANETSCOPE IMAGE
    Wicaksono, Pramaditya
    Lazuardi, Wahyu
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 8245 - 8248
  • [33] Spatial modeling of habitat preferences of biological species using Markov random fields
    Avalos, Carlos Diaz
    JOURNAL OF APPLIED STATISTICS, 2007, 34 (07) : 807 - 821
  • [34] Computing Invariant Sets of Random Differential Equations Using Polynomial Chaos
    Breden, Maxime
    Kuehn, Christian
    SIAM JOURNAL ON APPLIED DYNAMICAL SYSTEMS, 2020, 19 (01): : 577 - 618
  • [35] A high performance random number generator using heterogeneous computing platform
    Li, F. (lifan2013666@163.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [36] Mean Circuit Design Using Correlated Random Bitstreams in Stochastic Computing
    Li, Feiyu
    Xie, Guangjun
    Han, Jie
    Zhang, Yongqiang
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (NANO), 2022, : 4 - 7
  • [37] A True Random Number Generator for Probabilistic Computing using Stochastic Magnetic Actuated Random Transducer Devices
    Shukla, Ankit
    Heller, Laura
    Morshed, Md Golam
    Rehm, Laura
    Ghosh, Avik W.
    Kent, Andrew D.
    Rakheja, Shaloo
    2023 24TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN, ISQED, 2023, : 314 - 323
  • [38] Uncovering Bivariate Interactions in High Dimensional Data Using Random Forests with Data Augmentation
    Arevalillo, Jorge M.
    Navarro, Hilario
    FUNDAMENTA INFORMATICAE, 2011, 113 (02) : 97 - 115
  • [39] Hyperspectral image classification using a deep relation network with random replacement data augmentation
    Lu, Xinhua
    Hao, Jiaxuan
    Wang, Hua
    Qiao, Jianliang
    Huang, Junbo
    REMOTE SENSING LETTERS, 2024, 15 (08) : 805 - 815
  • [40] COMPUTING RANDOM-FIELDS
    WISEMAN, NE
    NEDUNURI, S
    COMPUTER JOURNAL, 1986, 29 (04): : 373 - 377