Scene understanding using natural language description based on 3D semantic graph map

被引：0

作者：

Jiyoun Moon

Beomhee Lee

机构：

[1] Seoul National University,Automation and Systems Research Institute, Department of Electrical Engineering

来源：

Intelligent Service Robotics | 2018年 / 11卷

关键词：

Scene understanding; Natural language description; 3D semantic graph map;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A natural language description for working environment understanding is an important component in human–robot communication. Although 3D semantic graph mappings are widely studied for perceptual aspects of the environment, these approaches hardly apply to the communication issues such as natural language descriptions for a semantic graph map. There are many researches on workspace understanding over images in the field of computer vision, which automatically generate sentences while they usually never utilize multiple scenes and 3D information. In this paper, we introduce a novel natural language description method using 3D semantic graph map. An object-oriented semantic graph map is first constructed using 3D information. A graph convolutional neural network and a recurrent neural network are then used to generate a description of the map. A natural language sentence focusing on objects over 3D semantic graph map can be eventually generated consisting of a single scene or multiple scenes. We validate the proposed method using publicly available dataset and compare it with conventional methods.

引用

页码：347 / 354

页数：7

共 50 条

[1] Scene understanding using natural language description based on 3D semantic graph map
Moon, Jiyoun
Lee, Beomhee
INTELLIGENT SERVICE ROBOTICS, 2018, 11 (04) : 347 - 354
[2] 3D Visualization of Simple Natural Language Statement Using Semantic Description
Kadir, Rabiah Abdul
Hashim, Abdul Rahman Mad
Wirza, Rahmita
Mustapha, Aida
VISUAL INFORMATICS: SUSTAINING RESEARCH AND INNOVATIONS, PT I, 2011, 7066 : 36 - 44
[3] Research and realization of the layout of objects in 3D scene based on natural language understanding
Li, Han-Jing
Li, Sheng
Zhao, Tie-Jun
Han, Yan-Hai
Ye, Li-Jun
Li, Li
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2007, 29 (08): : 1845 - 1849
[4] Language-Assisted 3D Feature Learning for Semantic Scene Understanding
Zhang, Junbo
Fan, Guofan
Wang, Guanghan
Su, Zhengyuan
Ma, Kaisheng
Yi, Li
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3445 - 3453
[5] An XML-based scene description language for 3D virtual museum
Lu, Wei
Zeng, Dinghao
Pan, Jingui
PROCEEDINGS OF THE ITI 2008 30TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2008, : 445 - 450
[6] SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation
Lv, Changsheng
Qi, Mengshi
Li, Xia
Yang, Zhengyuan
Ma, Huadong
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4035 - 4043
[7] Using Ignorance in 3D Scene Understanding
Harasymowicz-Boggio, Bogdan
Siemiatkowska, Barbara
MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
[8] 3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
Bhattacharyya, Prarthana
Gu, Yanlei
Bao, Jiali
Liu, Xu
Kamijo, Shunsuke
2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
[9] Semantic Scene Manipulation Based on 3D Spatial Object Relations and Language Instructions
Kartmann, Rainer
Liu, Danqing
Asfour, Tamim
PROCEEDINGS OF THE 2020 IEEE-RAS 20TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2020), 2021, : 306 - 313
[10] Unbiased 3D Semantic Scene Graph Prediction in Point Cloud Using Deep Learning
Han, Chaolin
Li, Hongwei
Xu, Jian
Dong, Bing
Wang, Yalin
Zhou, Xiaowen
Zhao, Shan
APPLIED SCIENCES-BASEL, 2023, 13 (09):

← 1 2 3 4 5 →