IS-GGT: Iterative Scene Graph Generation with Generative Transformers

被引:6
|
作者
Kundu, Sanjoy [1 ]
Aakur, Sathyanarayanan N. [1 ]
机构
[1] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52729.2023.00609
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering, captioning, and even object detection, to name a few. Current approaches take a generation-by-classification approach where the scene graph is generated through labeling of all possible edges between objects in a scene, which adds computational overhead to the approach. This work introduces a generative transformer-based approach to generating scene graphs beyond link prediction. Using two transformer-based components, we first sample a possible scene graph structure from detected objects and their visual features. We then perform predicate classification on the sampled edges to generate the final scene graph. This approach allows us to efficiently generate scene graphs from images with minimal inference overhead. Extensive experiments on the Visual Genome dataset demonstrate the efficiency of the proposed approach. Without bells and whistles, we obtain, on average, 20.7% mean recall (mR@100) across different settings for scene graph generation (SGG), outperforming state-of-the-art SGG approaches while offering competitive performance to unbiased SGG approaches.
引用
收藏
页码:6292 / 6301
页数:10
相关论文
共 50 条
  • [21] Unbiased Scene Graph Generation in Videos
    Nag, Sayak
    Min, Kyle
    Tripathi, Subama
    Roy-Chowdhury, Amit K.
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22803 - 22813
  • [22] Fully Convolutional Scene Graph Generation
    Liu, Hengyue
    Yan, Ning
    Mortazavi, Masood
    Bhanu, Bir
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11541 - 11551
  • [23] Review on scene graph generation methods
    Monesh, S.
    Senthilkumar, N. C.
    MULTIAGENT AND GRID SYSTEMS, 2024, 20 (02) : 129 - 160
  • [24] Adversarial Attacks on Scene Graph Generation
    Zhao, Mengnan
    Zhang, Lihe
    Wang, Wei
    Kong, Yuqiu
    Yin, Baocai
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 3210 - 3225
  • [25] Panoptic Video Scene Graph Generation
    Yang, Jingkang
    Peng, Wenxuan
    Li, Xiangtai
    Guo, Zujin
    Chen, Liangyu
    Li, Bo
    Ma, Zheng
    Zhou, Kaiyang
    Zhang, Wayne
    Loy, Chen Change
    Liu, Ziwei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 18675 - 18685
  • [26] Scene Graph Generation With Hierarchical Context
    Ren, Guanghui
    Ren, Lejian
    Liao, Yue
    Liu, Si
    Li, Bo
    Han, Jizhong
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (02) : 909 - 915
  • [27] Heterogeneous Learning for Scene Graph Generation
    He, Yunqing
    Ren, Tongwei
    Tang, Jinhui
    Wu, Gangshan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 4704 - 4713
  • [28] Exploring and Exploiting the Hierarchical Structure of a Scene for Scene Graph Generation
    Kurosawa, Ikuto
    Kobayashi, Tetsunori
    Hayashi, Yoshihiko
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1422 - 1429
  • [29] VARSCENE: A Deep Generative Model for Realistic Scene Graph Synthesis
    Verma, Tathagat
    De, Abir
    Agrawal, Yateesh
    Vinay, Vishwa
    Chakrabarti, Soumen
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [30] Graph-LSTM with Global Attribute for Scene Graph Generation
    Shao, Tong
    Wu, Dapeng Oliver
    Journal of Physics: Conference Series, 2021, 2003 (01)