Fine-Grained Scene Graph Generation with Data Transfer

被引:37
|
作者
Zhang, Ao [1 ,2 ]
Yao, Yuan [3 ,4 ]
Chen, Qianyu [3 ,4 ]
Ji, Wei [1 ,2 ]
Liu, Zhiyuan [3 ,4 ]
Sun, Maosong [3 ,4 ]
Chua, Tat-Seng [1 ,2 ]
机构
[1] Sea NExT Joint Lab, Singapore, Singapore
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Tsinghua Univ, Inst Artificial Intelligence, Dept Comp Sci & Technol, Beijing, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
来源
关键词
Scene graph generation; Plug-and-play; Large-scale;
D O I
10.1007/978-3-031-19812-0_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images. Recent works have made a steady progress on SGG, and provide useful tools for high-level vision and language understanding. However, due to the data distribution problems including long-tail distribution and semantic ambiguity, the predictions of current SGG models tend to collapse to several frequent but uninformative predicates (e.g., on, at), which limits practical application of these models in downstream tasks. To deal with the problems above, we propose a novel Internal and External Data Transfer (IETrans) method, which can be applied in a plug-and-play fashion and expanded to large SGG with 1,807 predicate classes. Our IETrans tries to relieve the data distribution problem by automatically creating an enhanced dataset that provides more sufficient and coherent annotations for all predicates. By applying our proposed method, a Neural Motif model doubles the macro performance for informative SGG. The code and data are publicly available at https://github.com/waxnkw/IETrans-SGG.pytorch.
引用
收藏
页码:409 / 424
页数:16
相关论文
共 50 条
  • [21] Fine-grained Pseudo Labels for Scene Text Recognition
    Li, Xiaoyu
    Chen, Xiaoxue
    Huang, Zuming
    Xie, Lele
    Chen, Jingdong
    Yang, Ming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5786 - 5795
  • [22] Fine-Grained Language Identification in Scene Text Images
    Li, Yongrui
    Wu, Shilian
    Yu, Jun
    Wang, Zengfu
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4573 - 4581
  • [23] Knowledge Mining with Scene Text for Fine-Grained Recognition
    Wang, Hao
    Liao, Junchao
    Cheng, Tianheng
    Gao, Zewen
    Liu, Hao
    Ren, Bo
    Bai, Xiang
    Liu, Wenyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4614 - 4623
  • [24] Semantic Clustering for Robust Fine-Grained Scene Recognition
    George, Marian
    Dixit, Mandar
    Zogg, Gabor
    Vasconcelos, Nuno
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 783 - 798
  • [25] A fine-grained approach to scene text script identification
    Gomez, Lluis
    Karatzas, Dimosthenis
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 192 - 197
  • [26] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [27] Fine-Grained Object Detection Using Transfer Learning and Data Augmentation
    Dalal, Rahul
    Moh, Teng-Sheng
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 893 - 896
  • [28] DATA ON CONSOLIDATION OF FINE-GRAINED SEDIMENTS
    CHILINGA.GV
    RIEKE, HH
    JOURNAL OF SEDIMENTARY PETROLOGY, 1968, 38 (03): : 811 - &
  • [29] Discovering Fine-Grained Semantics in Knowledge Graph Relations
    Jain, Nitisha
    Krestel, Ralf
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 822 - 831
  • [30] A Fine-Grained Structural Partitioning Approach to Graph Compression
    Pitois, Francois
    Seba, Hamida
    Haddad, Mohammed
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2023, 2023, 14148 : 392 - 397