Teachable robots: Understanding human teaching behavior to build more effective robot learners

被引:213
|
作者
Thomaz, Andrea L. [1 ]
Breazeal, Cynthia [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
human-robot interaction; reinforcement learning; user studies;
D O I
10.1016/j.artint.2007.09.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While Reinforcement Learning (RL) is not traditionally designed for interactive supervisory input from a human teacher, several works in both robot and software agents have adapted it for human input by letting a human trainer control the reward signal. In this work, we experimentally examine the assumption underlying these works, namely that the human-given reward is compatible with the traditional RL reward signal. We describe an experimental platform with a simulated RL robot and present an analysis of real-time human teaching behavior found in a study in which untrained subjects taught the robot to perform a new task. We report three main observations on how people administer feedback when teaching a Reinforcement Learning agent: (a) they use the reward channel not only for feedback, but also for future-directed guidance; (b) they have a positive bias to their feedback, possibly using the signal as a motivational channel; and (c) they change their behavior as they develop a mental model of the robotic learner. Given this, we made specific modifications to the simulated RL robot, and analyzed and evaluated its learning behavior in four follow-up experiments with human trainers. We report significant improvements on several learning measures. This work demonstrates the importance of understanding the human-teacher/robot-learner partnership in order to design algorithms that support how people want to teach and simultaneously improve the robot's learning behavior. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:716 / 737
页数:22
相关论文
共 50 条
  • [41] UNDERSTANDING HUMAN-BEHAVIOR FOR EFFECTIVE POLICE WORK - RUSSELL,HE AND BEIGEL,A
    不详
    POLICE CHIEF, 1976, 43 (07): : 78 - 78
  • [42] UNDERSTANDING HUMAN-BEHAVIOR FOR EFFECTIVE POLICE WORK - RUSSELL,HE AND BEIGEL,A
    MODLIN, HC
    HOSPITAL AND COMMUNITY PSYCHIATRY, 1977, 28 (02): : 142 - 142
  • [43] Prediction of human behavior in human-robot interaction using psychological scales for anxiety and negative attitudes toward robots
    Nomura, Tatsuya
    Kanda, Takayuki
    Suzuki, Tomohiro
    Kato, Kensuke
    IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (02) : 442 - 451
  • [44] Communicating robots' intent through visual cues enhances human anticipatory behavior in human-dual robot collaboration
    Psarakis, Loizos
    Nathanael, Dimitris
    Marmaras, Nicolas
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2025, 92
  • [45] A Path Planning Behavior Learning of Six Legged Robot Based on Human Teaching Model
    Sulistijono, Indra Adji
    Kurniawan, Marianus Bayu Aji
    Setiawardhana
    Prasetiono, Eka
    PROCEEDINGS OF THE 2013 IEEE WORKSHOP ON ROBOTIC INTELLIGENCE IN INFORMATIONALLY STRUCTURED SPACE (RIISS), 2013, : 108 - 113
  • [46] A Collaborative Homeostatic-Based Behavior Controller for Social Robots in Human-Robot Interaction Experiments
    Cao, Hoang-Long
    Esteban, Pablo Gomez
    Albert, De Beir
    Simut, Ramona
    Van de Perre, Greet
    Lefeber, Dirk
    Vanderborght, Bram
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2017, 9 (05) : 675 - 690
  • [47] ROBEE: A homeostatic-based social behavior controller for robots in Human-Robot Interaction experiments
    Cao, Hoang-Long
    Esteban, Pablo Gomez
    De Beir, Albert
    Simut, Ramona
    Van de Perre, Greet
    Lefeber, Dirk
    Vanderborght, Bram
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS IEEE-ROBIO 2014, 2014, : 516 - 521
  • [48] Does the Correspondence Bias Apply to Social Robots?: Dispositional and Situational Attributions of Human Versus Robot Behavior
    Edwards, Autumn
    Edwards, Chad
    FRONTIERS IN ROBOTICS AND AI, 2022, 8
  • [49] Self-Explaining Social Robots: An Explainable Behavior Generation Architecture for Human-Robot Interaction
    Stange, Sonja
    Hassan, Teena
    Schroeder, Florian
    Konkol, Jacqueline
    Kopp, Stefan
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [50] Mental Schemas of Robots as More Human-Like Are Associated with Higher Blood Pressure and Negative Emotions in a Human-Robot Interaction
    Broadbent, Elizabeth
    Lee, Yong In
    Stafford, Rebecca Q.
    Kuo, I. Han
    MacDonald, Bruce A.
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2011, 3 (03) : 291 - 297