Semantic maps from multiple visual cues

被引:28
|
作者
Kostavelis, Ioannis [1 ]
Gasteratos, Antonios [1 ]
机构
[1] Democritus Univ Thrace, Prod & Management Engn Dept, Lab Robot & Automat, 12 Vas Sophias Str, GR-67100 Xanthi, Greece
关键词
Semantic map; 3D object map; 3D topological map; Place classification; Object recognition; SLAM; ROBUST PLACE RECOGNITION; ROBOT; CLASSIFICATION;
D O I
10.1016/j.eswa.2016.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Future service robots targeted to operate in domestic or industrial environment and in close collaboration with humans should possess the ability to produce meaningful internal perceptual representations of their own surroundings, enabling them to fulfill a variety of real-world tasks. For this purpose, we present here a semantic mapping framework featuring geometrical and semasiological attributes that reveal the relationships between objects and places in a real-life environment. The geometrical component consists of a 3D metric map, onto which a topological map is deployed. The semasiological part is realized by putting together a place recognition algorithm and an object recognition one. The categorization of the different places relies on the resolution of appearance-based consistency histograms, while for the recognition of objects in the scene, a hierarchical temporal memory (HTM) network boosted by a saliency attentional model, is utilized. These semantic attributes are then deposited on the topological map to augment it with the belief distributions regarding the visited places, enabling thus the agent to act in an intelligent manner in human populated environments. Thus, the proposed framework outlines a proficient system in the construction of human conceivable environment representations, which has been successfully assessed on real-world scenarios, proving its ability to provide a consistent solution to the emerging problem of the human-robot cohabitation. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:45 / 57
页数:13
相关论文
共 50 条
  • [31] Multiple visual cues enhance quantitative perception in infancy
    Baker, Joseph M.
    Feigleson, Jessica M.
    Jordan, Kerry E.
    COGNITION IN FLUX, 2010, : 2799 - 2803
  • [32] A Probabilistic Approach to Integrating Multiple Cues in Visual Tracking
    Du, Wei
    Piater, Justus
    COMPUTER VISION - ECCV 2008, PT II, PROCEEDINGS, 2008, 5303 : 225 - 238
  • [33] Negotiating the semantic gap: from feature maps to semantic landscapes
    Zhao, R
    Grosky, WI
    PATTERN RECOGNITION, 2002, 35 (03) : 593 - 600
  • [34] Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence
    Nguyen, Huy Manh
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [35] Adolescents' Developing Sensitivity to Orthographic and Semantic Cues During Visual Search for Words
    Vibert, Nicolas
    Braasch, Jason L. G.
    Darles, Daniel
    Potocki, Anna
    Ros, Christine
    Jaafari, Nematollah
    Rouet, Jean-Francois
    FRONTIERS IN PSYCHOLOGY, 2019, 10
  • [36] Movienet: a movie multilayer network model using visual and textual semantic cues
    Mourchid, Youssef
    Renoust, Benjamin
    Roupin, Olivier
    Le Van
    Cherifi, Hocine
    El Hassouni, Mohammed
    APPLIED NETWORK SCIENCE, 2019, 4 (01)
  • [37] Movienet: a movie multilayer network model using visual and textual semantic cues
    Youssef Mourchid
    Benjamin Renoust
    Olivier Roupin
    Lê Văn
    Hocine Cherifi
    Mohammed El Hassouni
    Applied Network Science, 4
  • [38] Linking visual cues and semantic terms under specific digital video domains
    Sánchez, JM
    Binefa, X
    Vitrià, J
    Radeva, P
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2000, 11 (03): : 253 - 271
  • [39] Semantic Linking Maps for Active Visual Object Search (Extended Abstract)
    Zeng, Zhen
    Roefer, Adrian
    Jenkins, Odest Chadwicke
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4864 - 4868
  • [40] Towards a Model of Information Seeking by Integrating Visual, Semantic and Memory Maps
    Chanceaux, Myriam
    Guerin-Dugue, Anne
    Lemaire, Benoit
    Baccino, Thierry
    COGNITIVE VISION, 2008, 5329 : 65 - +