Context Diffusion: In-Context Aware Image Generation

被引:0
|
作者
Najdenkoska, Ivona [1 ,2 ]
Sinha, Animesh [1 ]
Dubey, Abhimanyu [1 ]
Mahajan, Dhruv [1 ]
Ramanathan, Vignesh [1 ]
Radenovic, Filip [1 ]
机构
[1] Meta GenAI, Menlo Pk, CA 94025 USA
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
关键词
Image generation; Diffusion models; In-context learning;
D O I
10.1007/978-3-031-72980-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [21] Generative Calibration for In-context Learning
    Jiang, Zhongtao
    Zhang, Yuanzhe
    Liu, Cao
    Zhao, Jun
    Liu, Kang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2312 - 2333
  • [22] Distinguishability Calibration to In-Context Learning
    Li, Hongjing
    Yan, Hanqi
    Li, Yanran
    Qian, Li
    He, Yulan
    Gui, Lin
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1385 - 1397
  • [23] In-Context Annotations for Refinding and Sharing
    Kawase, Ricardo
    Herder, Eelco
    Papadakis, George
    Nejdl, Wolfgang
    WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 85 - 100
  • [24] NICE: To Optimize In-Context Examples or Not?
    Srivastava, Pragya
    Golechha, Satvik
    Deshpande, Amit
    Sharma, Amit
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 5494 - 5510
  • [25] Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
    Hu, Minghui
    Wang, Yujie
    Cham, Tat-Jen
    Yang, Jianfei
    Suganthan, P. N.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11492 - 11501
  • [26] Stacking VAE and GAN for Context-aware Text-to-Image Generation
    Zhang, Chenrui
    Peng, Yuxin
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [27] CorGAN: Context aware Recurrent Generative Adversarial Network for Medical Image Generation
    Qiao, Zhi
    Qian, Zhen
    Tang, Hui
    Gong, Guanzhong
    Yin, Yong
    Huang, Chao
    Fan, Wei
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1100 - 1103
  • [28] Enhancing image caption generation through context-aware attention mechanism
    Bhuiyan, Ahatesham
    Hossain, Eftekhar
    Hoque, Mohammed Moshiul
    Dewan, M. Ali Akber
    HELIYON, 2024, 10 (17)
  • [29] Towards In-context Scene Understanding
    Balazevic, Ivana
    Steiner, David
    Parthasarathy, Nikhil
    Arandjelovic, Relja
    Henaff, Olivier J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [30] Requirements Satisfiability with In-Context Learning
    Santos, Sarah
    Breaux, Travis
    Norton, Thomas
    Haghighi, Sara
    Ghanavati, Sepideh
    32ND IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE, RE 2024, 2024, : 168 - 179