Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation

被引:58
|
作者
Dong, Xingning [1 ]
Gan, Tian [1 ]
Song, Xuemeng [1 ]
Wu, Jianlong [1 ]
Cheng, Yuan [2 ]
Nie, Liqiang [1 ]
机构
[1] Shandong Univ, Jinan, Peoples R China
[2] Ant Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
COMPRESSION;
D O I
10.1109/CVPR52688.2022.01882
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation, which generally follows a regular encoder-decoder pipeline, aims to first encode the visual contents within the given image and then parse them into a compact summary graph. Existing SGG approaches generally not only neglect the insufficient modality fusion between vision and language, but also fail to provide informative predicates due to the biased relationship predictions, leading SGG far from practical. Towards this end, we first present a novel Stacked Hybrid-Attention network, which facilitates the intra-modal refinement as well as the intermodal interaction, to serve as the encoder. We then devise an innovative Group Collaborative Learning strategy to optimize the decoder. Particularly, based on the observation that the recognition capability of one classifier is limited towards an extremely unbalanced dataset, we first deploy a group of classifiers that are expert in distinguishing different subsets of classes, and then cooperatively optimize them from two aspects to promote the unbiased SGG. Experiments conducted on VG and GQA datasets demonstrate that, we not only establish a new state-of-the-art in the unbiased metric, but also nearly double the performance compared with two baselines. Our code is available at https://github.com/dongxingning/SHA-GCL-for-SGG.
引用
收藏
页码:19405 / 19414
页数:10
相关论文
共 50 条
  • [41] Bridging Visual and Textual Semantics: Towards Consistency for Unbiased Scene Graph Generation
    Zhang, Ruonan
    An, Gaoyun
    Hao, Yiqing
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7102 - 7119
  • [42] Hybrid-Attention based Decoupled Metric Learning for Zero-Shot Image Retrieval
    Chen, Binghui
    Deng, Weihong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2745 - 2754
  • [43] Topic Scene Graph Generation by Attention Distillation from Caption
    Wang, Wenbin
    Wang, Ruiping
    Chen, Xilin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15880 - 15890
  • [44] Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation
    Wang, Wenqing
    Gao, Kaifeng
    Luo, Yawei
    Jiang, Tao
    Gao, Fei
    Shao, Jian
    Sun, Jianwen
    Xiao, Jun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5153 - 5163
  • [45] A New Training Data Organization Form and Training Mode for Unbiased Scene Graph Generation
    Xu, Hongbo
    Wang, Lichun
    Xu, Kai
    Fu, Fangyu
    Yin, Baocai
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5295 - 5305
  • [46] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
    Zhou, Zijian
    Shi, Miaojing
    Caesar, Holger
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21580 - 21591
  • [47] Skew Class-Balanced Re-Weighting for Unbiased Scene Graph Generation
    Kang, Haeyong
    Yoo, Chang D.
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01): : 287 - 303
  • [48] Importance Weighted Structure Learning for Scene Graph Generation
    Liu, Daqi
    Bober, Miroslaw
    Kittler, Josef
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 1231 - 1242
  • [49] Energy-Based Learning for Scene Graph Generation
    Suhail, Mohammed
    Mittal, Abhay
    Siddiquie, Behjat
    Broaddus, Chris
    Eledath, Jayan
    Medioni, Gerard
    Sigal, Leonid
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13931 - 13940
  • [50] Debiased Scene Graph Generation for Dual Imbalance Learning
    Zhou, Hao
    Zhang, Jun
    Luo, Tingjin
    Yang, Yazhou
    Lei, Jun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4274 - 4288