Scene understanding using natural language description based on 3D semantic graph map

被引:0
|
作者
Jiyoun Moon
Beomhee Lee
机构
[1] Seoul National University,Automation and Systems Research Institute, Department of Electrical Engineering
来源
Intelligent Service Robotics | 2018年 / 11卷
关键词
Scene understanding; Natural language description; 3D semantic graph map;
D O I
暂无
中图分类号
学科分类号
摘要
A natural language description for working environment understanding is an important component in human–robot communication. Although 3D semantic graph mappings are widely studied for perceptual aspects of the environment, these approaches hardly apply to the communication issues such as natural language descriptions for a semantic graph map. There are many researches on workspace understanding over images in the field of computer vision, which automatically generate sentences while they usually never utilize multiple scenes and 3D information. In this paper, we introduce a novel natural language description method using 3D semantic graph map. An object-oriented semantic graph map is first constructed using 3D information. A graph convolutional neural network and a recurrent neural network are then used to generate a description of the map. A natural language sentence focusing on objects over 3D semantic graph map can be eventually generated consisting of a single scene or multiple scenes. We validate the proposed method using publicly available dataset and compare it with conventional methods.
引用
收藏
页码:347 / 354
页数:7
相关论文
共 50 条
  • [1] Scene understanding using natural language description based on 3D semantic graph map
    Moon, Jiyoun
    Lee, Beomhee
    INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
  • [2] 3D Visualization of Simple Natural Language Statement Using Semantic Description
    Kadir, Rabiah Abdul
    Hashim, Abdul Rahman Mad
    Wirza, Rahmita
    Mustapha, Aida
    VISUAL INFORMATICS: SUSTAINING RESEARCH AND INNOVATIONS, PT I, 2011, 7066 : 36 - 44
  • [3] Research and realization of the layout of objects in 3D scene based on natural language understanding
    Li, Han-Jing
    Li, Sheng
    Zhao, Tie-Jun
    Han, Yan-Hai
    Ye, Li-Jun
    Li, Li
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2007, 29 (08): : 1845 - 1849
  • [4] Language-Assisted 3D Feature Learning for Semantic Scene Understanding
    Zhang, Junbo
    Fan, Guofan
    Wang, Guanghan
    Su, Zhengyuan
    Ma, Kaisheng
    Yi, Li
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3445 - 3453
  • [5] An XML-based scene description language for 3D virtual museum
    Lu, Wei
    Zeng, Dinghao
    Pan, Jingui
    PROCEEDINGS OF THE ITI 2008 30TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2008, : 445 - 450
  • [6] SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation
    Lv, Changsheng
    Qi, Mengshi
    Li, Xia
    Yang, Zhengyuan
    Ma, Huadong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4035 - 4043
  • [7] Using Ignorance in 3D Scene Understanding
    Harasymowicz-Boggio, Bogdan
    Siemiatkowska, Barbara
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [8] 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
    Bhattacharyya, Prarthana
    Gu, Yanlei
    Bao, Jiali
    Liu, Xu
    Kamijo, Shunsuke
    2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [9] Semantic Scene Manipulation Based on 3D Spatial Object Relations and Language Instructions
    Kartmann, Rainer
    Liu, Danqing
    Asfour, Tamim
    PROCEEDINGS OF THE 2020 IEEE-RAS 20TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2020), 2021, : 306 - 313
  • [10] Unbiased 3D Semantic Scene Graph Prediction in Point Cloud Using Deep Learning
    Han, Chaolin
    Li, Hongwei
    Xu, Jian
    Dong, Bing
    Wang, Yalin
    Zhou, Xiaowen
    Zhao, Shan
    APPLIED SCIENCES-BASEL, 2023, 13 (09):