Graph R-CNN for Scene Graph Generation

被引:528
|
作者
Yang, Jianwei [1 ]
Lu, Jiasen [1 ]
Lee, Stefan [1 ]
Batra, Dhruv [1 ,2 ]
Parikh, Devi [1 ,2 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
[2] Facebook AI Res, Menlo Pk, CA USA
来源
关键词
Graph R-CNN; Scene graph generation; Relation proposal network; Attentional graph convolutional network;
D O I
10.1007/978-3-030-01246-5_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with the quadratic number of potential relations between objects in an image. We also propose an attentional Graph Convolutional Network (aGCN) that effectively captures contextual information between objects and relations. Finally, we introduce a new evaluation metric that is more holistic and realistic than existing metrics. We report state-of-the-art performance on scene graph generation as evaluated using both existing and our proposed metrics.
引用
收藏
页码:690 / 706
页数:17
相关论文
共 50 条
  • [1] Structured Sparse R-CNN for Direct Scene Graph Generation
    Teng, Yao
    Wang, Limin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19415 - 19424
  • [2] Multimodal graph inference network for scene graph generation
    Jingwen Duan
    Weidong Min
    Deyu Lin
    Jianfeng Xu
    Xin Xiong
    Applied Intelligence, 2021, 51 : 8768 - 8783
  • [3] Multimodal graph inference network for scene graph generation
    Duan, Jingwen
    Min, Weidong
    Lin, Deyu
    Xu, Jianfeng
    Xiong, Xin
    APPLIED INTELLIGENCE, 2021, 51 (12) : 8768 - 8783
  • [4] Unconditional Scene Graph Generation
    Garg, Sarthak
    Dhamo, Helisa
    Farshad, Azade
    Musatian, Sabrina
    Navab, Nassir
    Tombari, Federico
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16342 - 16351
  • [5] Iterative Scene Graph Generation
    Khandelwal, Siddhesh
    Sigal, Leonid
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Panoptic Scene Graph Generation
    Yang, Jingkang
    Ang, Yi Zhe
    Guo, Zujin
    Zhou, Kaiyang
    Zhang, Wayne
    Liu, Ziwei
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 178 - 196
  • [7] Graph R-CNN: Towards Accurate 3D Object Detection with Semantic-Decorated Local Graph
    Yang, Honghui
    Liu, Zili
    Wu, Xiaopei
    Wang, Wenxiao
    Qian, Wei
    He, Xiaofei
    Cai, Deng
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 662 - 679
  • [8] Beware of Overcorrection: Scene-induced Commonsense Graph for Scene Graph Generation
    Chen, Lianggangxu
    Lu, Jiale
    Song, Youqi
    Wang, Changbo
    He, Gaoqi
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2888 - 2897
  • [9] Relation R-CNN: A Graph Based Relation-Aware Network for Object Detection
    Chen, Shengjia
    Li, Zhixin
    Tang, Zhenjun
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1680 - 1684
  • [10] Improved Faster R-CNN for the Detection Method of Industrial Control Logic Graph Recognition
    Wu, Shilin
    Wang, Yan
    Yang, Huayu
    Wang, Pingfeng
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10