Co-imitation: Learning Design and Behaviour by Imitation

被引:0
|
作者
Rajani, Chang [1 ,2 ]
Arndt, Karol [2 ]
Blanco-Mulero, David [2 ]
Luck, Kevin Sebastian [2 ,3 ]
Kyrki, Ville [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Aalto Univ, Dept Elect Engn & Automat EEA, Espoo, Finland
[3] Finnish Ctr Artificial Intelligence, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The co-adaptation of robots has been a long-standing research endeavour with the goal of adapting both body and behaviour of a system for a given task, inspired by the natural evolution of animals. Co-adaptation has the potential to eliminate costly manual hardware engineering as well as improve the performance of systems. The standard approach to co-adaptation is to use a reward function for optimizing behaviour and morphology. However, defining and constructing such reward functions is notoriously difficult and often a significant engineering effort. This paper introduces a new viewpoint on the co-adaptation problem, which we call co-imitation: finding a morphology and a policy that allow an imitator to closely match the behaviour of a demonstrator. To this end we propose a co-imitation methodology for adapting behaviour and morphology by matching state distributions of the demonstrator. Specifically, we focus on the challenging scenario with mismatched state- and action-spaces between both agents. We find that co-imitation increases behaviour similarity across a variety of tasks and settings, and demonstrate co-imitation by transferring human walking, jogging and kicking skills onto a simulated humanoid.
引用
收藏
页码:6200 / 6208
页数:9
相关论文
共 50 条
  • [31] Imitation Game: A Model-based and Imitation Learning Deep Reinforcement Learning Hybrid
    Veith, Eric Msp
    Logemann, Torben
    Berezin, Aleksandr
    Wellssow, Arlena
    Balduin, Stephan
    2024 12TH WORKSHOP ON MODELING AND SIMULATION OF CYBER-PHYSICAL ENERGY SYSTEMS, MSCPES, 2024,
  • [32] Humanoid Behaviour Learning through Visuomotor Association by Self-Imitation
    Dawood, Farhan
    Loo, Chu Kiong
    2014 JOINT 7TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 15TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2014, : 922 - 929
  • [33] Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
    Lu, Yiren
    Fu, Justin
    Tucker, George
    Pan, Xinlei
    Bronstein, Eli
    Roelofs, Rebecca
    Sapp, Benjamin
    White, Brandyn
    Faust, Aleksandra
    Whiteson, Shimon
    Anguelov, Dragomir
    Levine, Sergey
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7553 - 7560
  • [34] Teacher-learner interaction quantifies scaffolding behaviour in imitation learning
    Okazaki, Shuntaro
    Muraoka, Yoshihiro
    Osu, Rieko
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [35] Teacher-learner interaction quantifies scaffolding behaviour in imitation learning
    Shuntaro Okazaki
    Yoshihiro Muraoka
    Rieko Osu
    Scientific Reports, 9
  • [36] Vaccinating behaviour guided by imitation and aspiration
    Arefin, Md. Rajib
    Masaki, Tanaka
    Tanimoto, Jun
    PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2020, 476 (2239):
  • [37] Imitation of successful behaviour in Cournot markets
    Bosch-Domènech, A
    Vriend, NJ
    ECONOMIC JOURNAL, 2003, 113 (487): : 495 - 524
  • [38] Imitation dynamics predict vaccinating behaviour
    Bauch, CT
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 272 (1573) : 1669 - 1675
  • [39] Imitation in human and animal behaviour.
    Leyhausen, P
    ETHOLOGY, 1998, 104 (05) : 453 - 454
  • [40] Imitation Learning: A Survey of Learning Methods
    Hussein, Ahmed
    Gaber, Mohamed Medhat
    Elyan, Eyad
    Jayne, Chrisina
    ACM COMPUTING SURVEYS, 2017, 50 (02)