IS-GGT: Iterative Scene Graph Generation with Generative Transformers

被引:6
|
作者
Kundu, Sanjoy [1 ]
Aakur, Sathyanarayanan N. [1 ]
机构
[1] Oklahoma State Univ, Dept Comp Sci, Stillwater, OK 74078 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR52729.2023.00609
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering, captioning, and even object detection, to name a few. Current approaches take a generation-by-classification approach where the scene graph is generated through labeling of all possible edges between objects in a scene, which adds computational overhead to the approach. This work introduces a generative transformer-based approach to generating scene graphs beyond link prediction. Using two transformer-based components, we first sample a possible scene graph structure from detected objects and their visual features. We then perform predicate classification on the sampled edges to generate the final scene graph. This approach allows us to efficiently generate scene graphs from images with minimal inference overhead. Extensive experiments on the Visual Genome dataset demonstrate the efficiency of the proposed approach. Without bells and whistles, we obtain, on average, 20.7% mean recall (mR@100) across different settings for scene graph generation (SGG), outperforming state-of-the-art SGG approaches while offering competitive performance to unbiased SGG approaches.
引用
收藏
页码:6292 / 6301
页数:10
相关论文
共 50 条
  • [1] Iterative Scene Graph Generation
    Khandelwal, Siddhesh
    Sigal, Leonid
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Scene Graph Generation by Iterative Message Passing
    Xu, Danfei
    Zhu, Yuke
    Choy, Christopher B.
    Li Fei-Fei
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3097 - 3106
  • [3] Relation Detection with Transformers for Panoptic Scene Graph Generation
    Liu, Chang
    Yan, Wenchao
    Chen, Shilin
    Huang, Liqun
    Huang, Xiaotao
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT IV, 2025, 15034 : 223 - 238
  • [4] Composite Relationship Fields with Transformers for Scene Graph Generation
    Adaimi, George
    Mizrahi, David
    Alahi, Alexandre
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 52 - 64
  • [5] Deep Generative Probabilistic Graph Neural Networks for Scene Graph Generation
    Khademi, Mahmoud
    Schulte, Oliver
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11237 - 11245
  • [6] Compositional Transformers for Scene Generation
    Hudson, Drew A.
    Zitnick, C. Lawrence
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [7] Context-aware Scene Graph Generation with Seq2Seq Transformers
    Lu, Yichao
    Rai, Himanshu
    Chang, Jason
    Knyazev, Boris
    Yu, Guangwei
    Shekhar, Shashank
    Taylor, Graham W.
    Volkovs, Maksims
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15911 - 15921
  • [8] Generative Transformers for Design Concept Generation
    Zhu, Qihao
    Luo, Jianxi
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2023, 23 (04)
  • [9] SceneFormer: Indoor Scene Generation with Transformers
    Wang, Xinpeng
    Yeshwanth, Chandan
    Niesner, Matthias
    2021 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2021), 2021, : 106 - 115
  • [10] Generative Compositional Augmentations for Scene Graph Prediction
    Knyazev, Boris
    de Vries, Harm
    Cangea, Catalina
    Taylor, Graham W.
    Courville, Aaron
    Belilovsky, Eugene
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15807 - 15817