Better vision through manipulation

被引:83
|
作者
Metta, G
Fitzpatrick, P
机构
[1] Univ Genoa, DIST, LIRA Lab, I-16145 Genoa, Italy
[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
humanoid robotics; active segmentation; epigenesis;
D O I
10.1177/10597123030112004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vision and manipulation are inextricably intertwined in the primate brain. Tantalizing results from neuroscience are shedding light on the mixed motor and sensory representations used by the brain during reaching, grasping, and object recognition. We now know a great deal about what happens in the brain during these activities, but not necessarily why. Is the integration we see functionally important, or just a reflection of evolution's lack of enthusiasm for sharp modularity? We wish to instantiate these results in robotic form to probe the technical advantages and to find any lacunae in existing models. We believe it would be missing the point to investigate this on a platform where dextrous manipulation and sophisticated machine vision are already implemented in their mature form, and instead follow a developmental approach from simpler primitives. We begin with a precursor to manipulation, simple poking and prodding, and show how it facilitates object segmentation, a long-standing problem in machine vision. The robot can familiarize itself with the objects in its environment by acting upon them. It can then recognize other actors (such as humans) in the environment through their effect on the objects it has learned about. We argue that following causal chains of events out from the robot's body into the environment allows for a very natural developmental progression of visual competence, and we relate this idea to results in neuroscience.
引用
收藏
页码:109 / 128
页数:20
相关论文
共 50 条
  • [1] Better vision through better borescopes
    Quality, 1998, 37 (01):
  • [2] Grounding vision through experimental manipulation
    Fitzpatrick, P
    Metta, G
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2003, 361 (1811): : 2165 - 2185
  • [3] Object Learning through Interactive Manipulation and Foveated Vision
    Bevec, Robert
    Ude, Ales
    2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2013, : 234 - 239
  • [4] Accurate Vision-based Manipulation through Contact Reasoning
    Kloss, Alina
    Bauza, Maria
    Wu, Jiajun
    Tenenbaum, Joshua B.
    Rodriguez, Alberto
    Bohg, Jeannette
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 6738 - 6744
  • [5] BETTER VISION
    ROSE, DJ
    AMERICAN BAR ASSOCIATION JOURNAL, 1983, 69 (OCT): : 1348 - 1348
  • [6] 'BETTER VISION'
    OLES, C
    POETRY, 1981, 138 (04) : 226 - 226
  • [7] Manipulation with no or partial vision
    Purdy, KA
    Lederman, SJ
    Klatzky, RL
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 1999, 25 (03) : 755 - 774
  • [8] Healthier Lives Through Better Sleep: A New Vision for the AASM Foundation
    Martin, Jennifer L.
    JOURNAL OF CLINICAL SLEEP MEDICINE, 2019, 15 (05): : 679 - 681
  • [9] In-Hand Manipulation of Unseen Objects Through 3D Vision
    Pereira, Martim
    Dimou, Dimitrios
    Moreno, Plinio
    ROBOT2022: FIFTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 1, 2023, 589 : 163 - 174
  • [10] Better technology for neuronal manipulation
    不详
    NATURE BIOMEDICAL ENGINEERING, 2018, 2 (07): : 465 - 466