Context Diffusion: In-Context Aware Image Generation

被引:0
|
作者
Najdenkoska, Ivona [1 ,2 ]
Sinha, Animesh [1 ]
Dubey, Abhimanyu [1 ]
Mahajan, Dhruv [1 ]
Ramanathan, Vignesh [1 ]
Radenovic, Filip [1 ]
机构
[1] Meta GenAI, Menlo Pk, CA 94025 USA
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
关键词
Image generation; Diffusion models; In-context learning;
D O I
10.1007/978-3-031-72980-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.
引用
收藏
页码:375 / 391
页数:17
相关论文
共 50 条
  • [41] LaiDA: Linguistics-Aware In-Context Learning with Data Augmentation for Metaphor Components Identification
    Liu, Hongde
    He, Chenyuan
    Meng, Feiyang
    Niu, Changyong
    Jia, Yuxiang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 287 - 299
  • [42] Binary Context Tree Based Middleware for Next Generation Context Aware Networks
    Bilen, Tugce
    Canberk, Berk
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 93 - 99
  • [43] In-context Examples Selection for Machine Translation
    Agrawal, Sweta
    Zhou, Chunting
    Lewis, Mike
    Zettlemoyer, Luke
    Ghazvininejad, Marjan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8857 - 8873
  • [44] IN-CONTEXT OBSERVATION OF ACTION LEVELS IN HANDBALL
    Lasierra, G.
    Carreras, D.
    Montoya, M.
    Planas, A.
    REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2020, 20 (79): : 435 - 451
  • [45] Complementary Explanations for Effective In-Context Learning
    Ye, Xi
    Iyer, Srinivasan
    Celikyilmaz, Asli
    Stoyanov, Ves
    Durrett, Greg
    Pasunuru, Ramakanth
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4469 - 4484
  • [46] Dissecting In-Context Learning of Translations in GPTs
    Raunak, Vikas
    Awadalla, Hany Hassan
    Menezes, Arul
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 866 - 872
  • [47] On the adaptation of in-context learners for system identification
    Piga, Dario
    Pura, Filippo
    Forgione, Marco
    IFAC PAPERSONLINE, 2024, 58 (15): : 277 - 282
  • [48] Unified Demonstration Retriever for In-Context Learning
    Li, Xiaonan
    Lv, Kai
    Yan, Hang
    Lin, Tianyang
    Wei, Zhu
    Ni, Yuan
    Xie, Guotong
    Wang, Xiaoling
    Qiu, Xipeng
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4644 - 4668
  • [49] Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning
    Wang, Xinshun
    Fang, Zhongbin
    Li, Xia
    Li, Xiangtai
    Chen, Chen
    Liu, Mengyuan
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 2436 - 2446
  • [50] Learning To Retrieve Prompts for In-Context Learning
    Rubin, Ohad
    Herzig, Jonathan
    Berant, Jonathan
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2655 - 2671