Scene Graph Generation With Hierarchical Context

被引:22
|
作者
Ren, Guanghui [1 ,2 ]
Ren, Lejian [1 ]
Liao, Yue [3 ]
Liu, Si [3 ]
Li, Bo [3 ]
Han, Jizhong [1 ]
Yan, Shuicheng [4 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100093, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol SIAT, Guangdong Prov Key Lab Comp Vis & Virtual Real Te, Shenzhen 518055, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[4] YITU Technol, Beijing 100086, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Correlation; Feature extraction; Depression; Visualization; Learning systems; Silicon; Generative adversarial networks; Attention mechanism; context aggregation; scene graph generation;
D O I
10.1109/TNNLS.2020.2979270
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph generation has received increasing attention in recent years. Enhancing the predicate representations is an important entry point to this task. There are various methods to fully investigate the context of representation enhancement. In this brief, we analyze the decisive factors that can significantly affect the relation detection results. Our analysis shows that spatial correlations between objects, focused regions of objects, and global hints related to the relations have strong influences in relation prediction and contradiction elimination. Based on our analysis, we propose a hierarchical context network (HCNet) to generate a scene graph. HCNet consists of three contexts, including interaction context, depression context, and global context, which integrates information from pair, object, and graph levels. The experiments show that our method outperforms the state-of-the-art methods on the Visual Genome (VG) data set.
引用
收藏
页码:909 / 915
页数:7
相关论文
共 50 条
  • [1] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
    Kurosawa, Ikuto
    Kobayashi, Tetsunori
    Hayashi, Yoshihiko
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
  • [2] Multimodal Context Embedding for Scene Graph Generation
    Jung, Gayoung
    Kim, Incheol
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2020, 16 (06): : 1250 - 1260
  • [3] HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
    Trong-Thuan Nguyen
    Pha Nguyen
    Luu, Khoa
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 18384 - 18394
  • [4] Scene Adaptive Context Modeling and Balanced Relation Prediction for Scene Graph Generation
    Xu, Kai
    Wang, Lichun
    Li, Shuang
    Gao, Tong
    Yin, Baocai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (03)
  • [5] Target Adaptive Context Aggregation for Video Scene Graph Generation
    Teng, Yao
    Wang, Limin
    Li, Zhifeng
    Wu, Gangshan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13668 - 13677
  • [6] Hypercomplex context guided interaction modeling for scene graph generation
    Wang, Zheng
    Xu, Xing
    Luo, Yadan
    Wang, Guoqing
    Yang, Yang
    PATTERN RECOGNITION, 2023, 141
  • [7] Exploring Context and Visual Pattern of Relationship for Scene Graph Generation
    Wang, Wenbin
    Wang, Ruiping
    Shan, Shiguang
    Chen, Xilin
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8180 - 8189
  • [8] Augmented Spatial Context Fusion Network for Scene Graph Generation
    Xu, Hongbo
    Wang, LiChun
    Xu, Kai
    Fu, Fangyu
    Yin, Baocai
    Huang, Qingming
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [9] Fast Contextual Scene Graph Generation with Unbiased Context Augmentation
    Jin, Tianlei
    Guo, Fangtai
    Meng, Qiwei
    Zhu, Shiqiang
    Xi, Xiangming
    Wang, Wen
    Mu, Zonghao
    Song, Wei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 6302 - 6311
  • [10] Scene Graph Generation Based on Shuffle Residual Context Information
    Lin X.
    Tian X.
    Ji Y.
    Xu Y.
    Liu C.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (08): : 1721 - 1730