Context Diffusion: In-Context Aware Image Generation

被引:0
|
作者
Najdenkoska, Ivona [1 ,2 ]
Sinha, Animesh [1 ]
Dubey, Abhimanyu [1 ]
Mahajan, Dhruv [1 ]
Ramanathan, Vignesh [1 ]
Radenovic, Filip [1 ]
机构
[1] Meta GenAI, Menlo Pk, CA 94025 USA
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
关键词
Image generation; Diffusion models; In-context learning;
D O I
10.1007/978-3-031-72980-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [31] Is Mamba Capable of In-Context Learning?
    Grazzi, Riccardo
    Siems, Julien
    Schrodi, Simon
    Brox, Thomas
    Hutter, Frank
    INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, 2024, 256
  • [32] Concept-aware Data Construction Improves In-context Learning of Language Models
    Stefanik, Michal
    Kadlcik, Marek
    Sojka, Petr
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12335 - 12352
  • [33] Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering
    Wu, Zhiyong
    Wang, Yaoxiang
    Ye, Jiacheng
    Kong, Lingpeng
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1423 - 1436
  • [34] Towards Zero-Shot Persona Dialogue Generation with In-Context Learning
    Xu, Xinchao
    Lei, Zeyang
    Wu, Wenquan
    Niu, Zheng-Yu
    Wu, Hua
    Wang, Haifeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1387 - 1398
  • [35] CONTEXT-AWARE CANDIDATES FOR IMAGE CROPPING
    Lian, Tianpei
    Cao, Zhiguo
    Xian, Ke
    Pan, Zhiyu
    Zhong, Weicai
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1479 - 1483
  • [36] Context-aware transformer for image captioning
    Yang, Xin
    Wang, Ying
    Chen, Haishun
    Li, Jie
    Huang, Tingting
    NEUROCOMPUTING, 2023, 549
  • [37] Regional context-aware image annotation
    Qiu, Z.-Y. (zyqiucsu@gmail.com), 1600, Science Press (37):
  • [38] Geostatistics for Context-Aware Image Classification
    Codevilla, Felipe
    Botelho, Silvia S. C.
    Duarte, Nelson
    Purkis, Samuel
    Shihavuddin, A. S. M.
    Garcia, Rafael
    Gracias, Nuno
    COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 228 - 239
  • [39] Towards Context-Aware Behaviour Generation
    de Sousa Duarte, Paulo Artur
    Barreto, Felipe Mota
    de Almada Gomes, Francisco Anderson
    de Carvalho, Windson Viana
    Mota Trinta, Fernando Antonio
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 596 - 598
  • [40] Automated generation of context-aware tests
    Wang, Zhimin
    Elbaum, Sebastian
    Rosenblum, David S.
    ICSE 2007: 29TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2007, : 406 - +