The Altruistic Robot: Do What I Want, Not Just What I Say

被引:0
|
作者
Billingsley, Richard [1 ]
Billingsley, John [1 ]
Gardenfors, Peter [1 ]
Peppas, Pavlos [1 ]
Prade, Henri [1 ]
Skillicorn, David [1 ]
Williams, Mary-Anne [1 ]
机构
[1] Univ Technol Sydney, Ultimo, Australia
来源
关键词
DECISION;
D O I
10.1007/978-3-319-67582-4_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As autonomous robots expand their application beyond research labs and production lines, they must work in more flexible and less well defined environments. To escape the requirement for exhaustive instruction and stipulated preference ordering, a robot's operation must involve choices between alternative actions, guided by goals. We describe a robot that learns these goals from humans by considering the timeliness and context of instructions and rewards as evidence of the contours and gradients of an unknown human utility function. In turn, this underlies a choice-theory based rational preference relationship. We examine how the timing of requests, and contexts in which they arise, can lead to actions that pre-empt requests using methods we term contemporaneous entropy learning and context sensitive learning. We provide experiments on these two methods to demonstrate their usefulness in guiding a robot's actions.
引用
收藏
页码:149 / 162
页数:14
相关论文
共 50 条