Context Diffusion: In-Context Aware Image Generation

被引:0
|
作者
Najdenkoska, Ivona [1 ,2 ]
Sinha, Animesh [1 ]
Dubey, Abhimanyu [1 ]
Mahajan, Dhruv [1 ]
Ramanathan, Vignesh [1 ]
Radenovic, Filip [1 ]
机构
[1] Meta GenAI, Menlo Pk, CA 94025 USA
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
关键词
Image generation; Diffusion models; In-context learning;
D O I
10.1007/978-3-031-72980-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [1] In-Context Learning Unlocked for Diffusion Models
    Wang, Zhendong
    Jiang, Yifan
    Lu, Yadong
    Shen, Yelong
    He, Pengcheng
    Chen, Weizhu
    Wang, Zhangyang
    Zhou, Mingyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] In-Context In-Context Learning with Transformer Neural Processes
    Ashman, Matthew
    Diaconu, Cristiana
    Weller, Adrian
    Turner, Richard E.
    SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, 2024, 253 : 1 - 29
  • [3] Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model
    Gu, Zheng
    Yang, Shiyuan
    Liao, Jing
    Huo, Jing
    Gao, Yang
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
  • [4] Exploring Diverse In-Context Configurations for Image Captioning
    Yang, Xu
    Wu, Yongliang
    Yang, Mingzhuo
    Chen, Haokun
    Geng, Xin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection
    Bai, Yu
    Chen, Fan
    Wang, Huan
    Xiong, Caiming
    Mei, Song
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Exploring In-Context Learning for Knowledge Grounded Dialog Generation
    Chen, Qinyu
    Wu, Wenhao
    Li, Sujian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10071 - 10081
  • [7] Tyche: Stochastic In-Context Learning for Medical Image Segmentation
    Rakic, Marianne
    Wong, Hallee E.
    Ortiz, Jose Javier Gonzalez
    Cimini, Beth A.
    Guttag, John, V
    Dalca, Adrian, V
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11159 - 11173
  • [8] Boosting Your Context by Dual Similarity Checkup for In-Context Learning Medical Image Segmentation
    Gao, Jun
    Lao, Qicheng
    Kang, Qingbo
    Liu, Paul
    Du, Chenlin
    Li, Kang
    Zhang, Le
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 310 - 319
  • [9] The Learnability of In-Context Learning
    Wies, Noam
    Levine, Yoav
    Shashua, Amnon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] A glance at in-context learning
    Wu, Yongliang
    Yang, Xu
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)