Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity

被引:4
|
作者
Manzl, Peter [1 ]
Rogov, Oleg [2 ]
Gerstmayr, Johannes [1 ]
Mikkola, Aki [2 ]
Orzechowski, Grzegorz [2 ]
机构
[1] Univ Innsbruck, Dept Mechatron, Tech Str 13, A-6020 Innsbruck, Tyrol, Austria
[2] LUT Univ, Dept Mech Engn, Yliopistonkatu 34, Lappeenranta 53850, South Karelia, Finland
关键词
Reinforcement learning; Reliability analysis; Inverse pendulum; Machine learning; Dynamical systems; DYNAMICS;
D O I
10.1007/s11044-023-09960-2
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Reinforcement learning (RL) is one of the emerging fields of artificial intelligence (AI) intended for designing agents that take actions in the physical environment. RL has many vital applications, including robotics and autonomous vehicles. The key characteristic of RL is its ability to learn from experience without requiring direct programming or supervision. To learn, an agent interacts with an environment by acting and observing the resulting states and rewards. In most practical applications, an environment is implemented as a virtual system due to cost, time, and safety concerns. Simultaneously, multibody system dynamics (MSD) is a framework for efficiently and systematically developing virtual systems of arbitrary complexity. MSD is commonly used to create virtual models of robots, vehicles, machinery, and humans. The features of RL and MSD make them perfect companions in building sophisticated, automated, and autonomous mechatronic systems. The research demonstrates the use of RL in controlling multibody systems. While AI methods are used to solve some of the most challenging tasks in engineering, their proper understanding and implementation are demanding. Therefore, we introduce and detail three commonly used RL algorithms to control the inverted N-pendulum on the cart. Single-, double-, and triple-pendulum configurations are investigated, showing the capability of RL methods to handle increasingly complex dynamical systems. We show 2D state space zones where the agent succeeds or fails the stabilization. Despite passing randomized tests during training, blind spots may occur where the agent's policy fails. Results confirm that RL is a versatile, although complex, control engineering approach.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] EVALUATION OF RELIABILITY OF MECHANICAL SYSTEMS.
    Chegodaev, D.E.
    Samsonov, V.N.
    Problemy Prochnosti, 1987, (12): : 100 - 102
  • [2] Reinforcement learning for thermal and reliability management in manycore systems
    Weber, Iacana Ianiski
    Zanini, Vitor Balbinot
    Moraes, Fernando Gehm
    DESIGN AUTOMATION FOR EMBEDDED SYSTEMS, 2025, 29 (01)
  • [3] Adaptation to creation: Progress of organizational learning and increasing complexity of learning systems
    Nair, KU
    SYSTEMS RESEARCH AND BEHAVIORAL SCIENCE, 2001, 18 (06) : 505 - 521
  • [4] Reduction of mechanical Complexity while Increasing the Functionality of the Headlight Systems
    Boehm, Gerald
    Bemmer, Christian
    Moser, Andreas
    OPTISCHE TECHNOLOGIEN IN DER FAHRZEUGTECHNIK, 2012, 2154 : 59 - 71
  • [5] Multigrid methods for policy evaluation and reinforcement learning
    Ziv, O
    Shimkin, N
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL & 13TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1 AND 2, 2005, : 1391 - 1396
  • [6] Evaluation of Reinforcement Learning Methods for a Self-learning System
    Bechtold, David
    Wendt, Alexander
    Jantsch, Axel
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 36 - 47
  • [7] Tracking and stabilization of Mechanical Systems using Reinforcement Learning
    Bhuvaneswari, S.
    Pasumarthy, Ramkrishna
    Ravindran, Balaraman
    Mahindrakar, Arun D.
    2018 INDIAN CONTROL CONFERENCE (ICC), 2018, : 206 - 211
  • [8] Empirical evaluation methods for multiobjective reinforcement learning algorithms
    Vamplew, Peter
    Dazeley, Richard
    Berry, Adam
    Issabekov, Rustam
    Dekker, Evan
    MACHINE LEARNING, 2011, 84 (1-2) : 51 - 80
  • [9] Empirical evaluation methods for multiobjective reinforcement learning algorithms
    Peter Vamplew
    Richard Dazeley
    Adam Berry
    Rustam Issabekov
    Evan Dekker
    Machine Learning, 2011, 84 : 51 - 80
  • [10] Methods for reliability evaluation of trust and reputation systems
    Janiszewski, Marek B.
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2016, 2016, 10031