Learning Exploration Strategies to Solve Real-World Marble Runs

被引:0
|
作者
Allaire, Alisa [1 ]
Atkeson, Christopher G. [1 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICRA48891.2023.10160759
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tasks involving locally unstable or discontinuous dynamics (such as bifurcations and collisions) remain challenging in robotics, because small variations in the environment can have a significant impact on task outcomes. For such tasks, learning a robust deterministic policy is difficult. We focus on structuring exploration with multiple stochastic policies based on a mixture of experts (MoE) policy representation that can be efficiently adapted. The MoE policy is composed of stochastic sub-policies that allow exploration of multiple distinct regions of the action space (or strategies) and a high-level selection policy to guide exploration towards the most promising regions. We develop a robot system to evaluate our approach in a real-world physical problem solving domain. After training the MoE policy in simulation, online learning in the real world demonstrates efficient adaptation within just a few dozen attempts, with a minimal sim2real gap. Our results confirm that representing multiple strategies promotes efficient adaptation in new environments and strategies learned under different dynamics can still provide useful information about where to look for good strategies.
引用
收藏
页码:7243 / 7249
页数:7
相关论文
共 50 条
  • [1] Learning by Exploration: New Challenges in Real-World Environments
    Wu, Qingyun
    Wang, Huazheng
    Wang, Hongning
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3575 - 3576
  • [2] Strategies to Turn Real-world Data Into Real-world Knowledge
    Hong, Julian C.
    JAMA NETWORK OPEN, 2021, 4 (10)
  • [3] A Review on Machine Learning Strategies for Real-World Engineering Applications
    Jhaveri, Rutvij H.
    Revathi, A.
    Ramana, Kadiyala
    Raut, Roshani
    Dhanaraj, Rajesh Kumar
    MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [4] Active Learning Strategies on a Real-World Thyroid Ultrasound Dataset
    Sreedhar, Hari
    Lajoinie, Guillaume P. R.
    Raffaelli, Charles
    Delingette, Herve
    DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS, DALI 2023, 2024, 14379 : 127 - 136
  • [5] Real-World Learning Control for Autonomous Exploration of a Biomimetic Robotic Shark
    Yan, Shuaizheng
    Wu, Zhengxing
    Wang, Jian
    Huang, Yupei
    Tan, Min
    Yu, Junzhi
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (04) : 3966 - 3974
  • [6] Directed Real-World Learned Exploration
    Hutsebaut-Buysse, Matthias
    Guinjoan, Ferran Gebelli
    Rademakers, Erwin
    Latre, Steven
    Temsamani, Abdellatif Bey
    Mets, Kevin
    Mannens, Erik
    De Schepper, Tom
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5227 - 5234
  • [7] Deep learning and alternative learning strategies for retrospective real-world clinical data
    David Chen
    Sijia Liu
    Paul Kingsbury
    Sunghwan Sohn
    Curtis B. Storlie
    Elizabeth B. Habermann
    James M. Naessens
    David W. Larson
    Hongfang Liu
    npj Digital Medicine, 2
  • [8] Deep learning and alternative learning strategies for retrospective real-world clinical data
    Chen, David
    Liu, Sijia
    Kingsbury, Paul
    Sohn, Sunghwan
    Storlie, Curtis B.
    Habermann, Elizabeth B.
    Naessens, James M.
    Larson, David W.
    Liu, Hongfang
    NPJ DIGITAL MEDICINE, 2019, 2 (1)
  • [9] Practical Applications of NMR to Solve Real-World Problems
    Brinson, Robert G.
    MOLECULES, 2021, 26 (23):
  • [10] Learning With Real-World Data
    不详
    IEEE CONTROL SYSTEMS MAGAZINE, 2023, 43 (05): : 158 - 159