Co-imitation: Learning Design and Behaviour by Imitation

被引:0
|
作者
Rajani, Chang [1 ,2 ]
Arndt, Karol [2 ]
Blanco-Mulero, David [2 ]
Luck, Kevin Sebastian [2 ,3 ]
Kyrki, Ville [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Aalto Univ, Dept Elect Engn & Automat EEA, Espoo, Finland
[3] Finnish Ctr Artificial Intelligence, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The co-adaptation of robots has been a long-standing research endeavour with the goal of adapting both body and behaviour of a system for a given task, inspired by the natural evolution of animals. Co-adaptation has the potential to eliminate costly manual hardware engineering as well as improve the performance of systems. The standard approach to co-adaptation is to use a reward function for optimizing behaviour and morphology. However, defining and constructing such reward functions is notoriously difficult and often a significant engineering effort. This paper introduces a new viewpoint on the co-adaptation problem, which we call co-imitation: finding a morphology and a policy that allow an imitator to closely match the behaviour of a demonstrator. To this end we propose a co-imitation methodology for adapting behaviour and morphology by matching state distributions of the demonstrator. Specifically, we focus on the challenging scenario with mismatched state- and action-spaces between both agents. We find that co-imitation increases behaviour similarity across a variety of tasks and settings, and demonstrate co-imitation by transferring human walking, jogging and kicking skills onto a simulated humanoid.
引用
收藏
页码:6200 / 6208
页数:9
相关论文
共 50 条
  • [1] LEARNING OF IMITATION AND LEARNING THROUGH IMITATION IN WHITE RAT
    HARUKI, Y
    TSUZUKI, T
    ANNUAL OF ANIMAL PSYCHOLOGY, 1967, 17 (02): : 57 - &
  • [2] Imitation as behaviour parsing
    Byrne, RW
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2003, 358 (1431) : 529 - 536
  • [3] Imitation and utilisation behaviour
    DeRenzi, E
    Cavalleri, F
    Facchini, S
    JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY, 1996, 61 (04): : 396 - 400
  • [4] Imitation of intentional behaviour
    Jansen, Bart
    ECAI 2006, PROCEEDINGS, 2006, 141 : 26 - +
  • [5] Learning by imitation
    Basçi, E
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 1999, 23 (9-10): : 1569 - 1585
  • [6] Social Learning and Imitation
    Wile
    AMERICAN JOURNAL OF ORTHOPSYCHIATRY, 1942, 12 (04) : 743 - 743
  • [7] Imitation and the effort of learning
    Williams, Justin H. G.
    BEHAVIORAL AND BRAIN SCIENCES, 2008, 31 (01) : 40 - +
  • [8] Social Learning and Imitation
    Flugel, J. C.
    INTERNATIONAL JOURNAL OF PSYCHOANALYSIS, 1943, 24 : 85 - 87
  • [9] Quantum Imitation Learning
    Cheng, Zhihao
    Zhang, Kaining
    Shen, Li
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14190 - 14204
  • [10] Social Learning and Imitation
    不详
    PSYCHIATRIC QUARTERLY, 1942, 16 (04) : 820 - 821