Structured Neural Motifs: Scene Graph Parsing via Enhanced Context

被引:3
|
作者
Li, Yiming [1 ,4 ]
Yang, Xiaoshan [2 ,3 ,4 ]
Xu, Changsheng [1 ,2 ,3 ,4 ]
机构
[1] HeFei Univ Technol, Hefei, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Peng Cheng Lab, Shenzhen, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Scene graph; Deep learning; LSTMs;
D O I
10.1007/978-3-030-37734-2_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph is one kind of structured representation of the visual content in an image. It is helpful for complex visual understanding tasks such as image captioning, visual question answering and semantic image retrieval. Since the real-world images always have multiple object instances and complex relationships, the context information is extremely important for scene graph generation. It has been noted that the context dependencies among different nodes in the scene graph are asymmetric, which meas it is highly possible to directly predict relationship labels based on object labels but not vice-versa. Based on this finding, the existing motifs network has successfully exploited the context patterns among object nodes and the dependencies between the object nodes and the relation nodes. However, the spatial information and the context dependencies among relation nodes are neglected. In this work, we propose Structured Motif Network (StrcMN) which predicts object labels and pairwise relationships by mining more complete global context features. The experiments show that our model significantly outperforms previous methods on the VRD and Visual Genome datasets.
引用
收藏
页码:175 / 188
页数:14
相关论文
共 50 条
  • [41] Preserving details in semantics-aware context for scene parsing
    Ma, Shuai
    Pang, Yanwei
    Pan, Jing
    Shao, Ling
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (02)
  • [42] Preserving details in semantics-aware context for scene parsing
    Shuai MA
    Yanwei PANG
    Jing PAN
    Ling SHAO
    ScienceChina(InformationSciences), 2020, 63 (02) : 79 - 92
  • [43] Parsing Strategies for Context-Sensitive Graph Grammars
    Zou, Yang
    Zeng, Xiaoqin
    Liu, Yufeng
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON VISUAL INFORMATION COMMUNICATION AND INTERACTION, VINCI 2019, 2019,
  • [44] Preserving details in semantics-aware context for scene parsing
    Shuai Ma
    Yanwei Pang
    Jing Pan
    Ling Shao
    Science China Information Sciences, 2020, 63
  • [45] Revisiting Structured Sentiment Analysis as Latent Dependency Graph Parsing
    Zhou, Chengjie
    Li, Bobo
    Fei, Hao
    Li, Fei
    Ten, Chong
    Ji, Donghong
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 10178 - 10191
  • [46] Scene parsing using graph matching on street-view data
    Yu, Tianshu
    Wang, Ruisheng
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2016, 145 : 70 - 80
  • [47] Improving Weakly Supervised Scene Graph Parsing through Object Grounding
    Zhang, Yizhou
    Zheng, Zhaoheng
    Nevatia, Ram
    Liu, Yan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4058 - 4064
  • [48] Enhanced Context Learning with Transformer for Human Parsing
    Song, Jingya
    Shi, Qingxuan
    Li, Yihang
    Yang, Fang
    APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [49] Motif-Backdoor: Rethinking the Backdoor Attack on Graph Neural Networks via Motifs
    Zheng, Haibin
    Xiong, Haiyang
    Chen, Jinyin
    Ma, Haonan
    Huang, Guohan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02): : 2479 - 2493
  • [50] Semantic-enhanced graph neural networks with global context representation
    Qian, Youcheng
    Yin, Xueyan
    MACHINE LEARNING, 2024, 113 (10) : 7761 - 7781