Context Diffusion: In-Context Aware Image Generation

被引：0

作者：

Najdenkoska, Ivona ^{[1
,2
]}

Sinha, Animesh ^{[1
]}

Dubey, Abhimanyu ^{[1
]}

Mahajan, Dhruv ^{[1
]}

Ramanathan, Vignesh ^{[1
]}

Radenovic, Filip ^{[1
]}

机构：

[1] Meta GenAI, Menlo Pk, CA 94025 USA

[2] Univ Amsterdam, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2024, PT LXXVII | 2024年 / 15135卷

关键词：

Image generation; Diffusion models; In-context learning;

D O I：

10.1007/978-3-031-72980-5_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.

引用

页码：375 / 391

页数：17

共 50 条

[41] LaiDA: Linguistics-Aware In-Context Learning with Data Augmentation for Metaphor Components Identification
Liu, Hongde
He, Chenyuan
Meng, Feiyang
Niu, Changyong
Jia, Yuxiang
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT V, NLPCC 2024, 2025, 15363 : 287 - 299
[42] Binary Context Tree Based Middleware for Next Generation Context Aware Networks
Bilen, Tugce
Canberk, Berk
2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 93 - 99
[43] In-context Examples Selection for Machine Translation
Agrawal, Sweta
Zhou, Chunting
Lewis, Mike
Zettlemoyer, Luke
Ghazvininejad, Marjan
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8857 - 8873
[44] IN-CONTEXT OBSERVATION OF ACTION LEVELS IN HANDBALL
Lasierra, G.
Carreras, D.
Montoya, M.
Planas, A.
REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2020, 20 (79): : 435 - 451
[45] Complementary Explanations for Effective In-Context Learning
Ye, Xi
Iyer, Srinivasan
Celikyilmaz, Asli
Stoyanov, Ves
Durrett, Greg
Pasunuru, Ramakanth
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4469 - 4484
[46] Dissecting In-Context Learning of Translations in GPTs
Raunak, Vikas
Awadalla, Hany Hassan
Menezes, Arul
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 866 - 872
[47] On the adaptation of in-context learners for system identification
Piga, Dario
Pura, Filippo
Forgione, Marco
IFAC PAPERSONLINE, 2024, 58 (15): : 277 - 282
[48] Unified Demonstration Retriever for In-Context Learning
Li, Xiaonan
Lv, Kai
Yan, Hang
Lin, Tianyang
Wei, Zhu
Ni, Yuan
Xie, Guotong
Wang, Xiaoling
Qiu, Xipeng
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 4644 - 4668
[49] Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning
Wang, Xinshun
Fang, Zhongbin
Li, Xia
Li, Xiangtai
Chen, Chen
Liu, Mengyuan
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 2436 - 2446
[50] Learning To Retrieve Prompts for In-Context Learning
Rubin, Ohad
Herzig, Jonathan
Berant, Jonathan
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2655 - 2671

← 1 2 3 4 5 →