Context Diffusion: In-Context Aware Image Generation

被引：0

作者：

Najdenkoska, Ivona ^{[1
,2
]}

Sinha, Animesh ^{[1
]}

Dubey, Abhimanyu ^{[1
]}

Mahajan, Dhruv ^{[1
]}

Ramanathan, Vignesh ^{[1
]}

Radenovic, Filip ^{[1
]}

机构：

[1] Meta GenAI, Menlo Pk, CA 94025 USA

[2] Univ Amsterdam, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2024, PT LXXVII | 2024年 / 15135卷

关键词：

Image generation; Diffusion models; In-context learning;

D O I：

10.1007/978-3-031-72980-5_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.

引用

页码：375 / 391

页数：17

共 50 条

[31] Is Mamba Capable of In-Context Learning?
Grazzi, Riccardo
Siems, Julien
Schrodi, Simon
Brox, Thomas
Hutter, Frank
INTERNATIONAL CONFERENCE ON AUTOMATED MACHINE LEARNING, 2024, 256
[32] Concept-aware Data Construction Improves In-context Learning of Language Models
Stefanik, Michal
Kadlcik, Marek
Sojka, Petr
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12335 - 12352
[33] Self-Adaptive In-Context Learning: An Information Compression Perspective for In-Context Example Selection and Ordering
Wu, Zhiyong
Wang, Yaoxiang
Ye, Jiacheng
Kong, Lingpeng
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1423 - 1436
[34] Towards Zero-Shot Persona Dialogue Generation with In-Context Learning
Xu, Xinchao
Lei, Zeyang
Wu, Wenquan
Niu, Zheng-Yu
Wu, Hua
Wang, Haifeng
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1387 - 1398
[35] CONTEXT-AWARE CANDIDATES FOR IMAGE CROPPING
Lian, Tianpei
Cao, Zhiguo
Xian, Ke
Pan, Zhiyu
Zhong, Weicai
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1479 - 1483
[36] Context-aware transformer for image captioning
Yang, Xin
Wang, Ying
Chen, Haishun
Li, Jie
Huang, Tingting
NEUROCOMPUTING, 2023, 549
[37] Regional context-aware image annotation
Qiu, Z.-Y. (zyqiucsu@gmail.com), 1600, Science Press (37):
[38] Geostatistics for Context-Aware Image Classification
Codevilla, Felipe
Botelho, Silvia S. C.
Duarte, Nelson
Purkis, Samuel
Shihavuddin, A. S. M.
Garcia, Rafael
Gracias, Nuno
COMPUTER VISION SYSTEMS (ICVS 2015), 2015, 9163 : 228 - 239
[39] Towards Context-Aware Behaviour Generation
de Sousa Duarte, Paulo Artur
Barreto, Felipe Mota
de Almada Gomes, Francisco Anderson
de Carvalho, Windson Viana
Mota Trinta, Fernando Antonio
30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 596 - 598
[40] Automated generation of context-aware tests
Wang, Zhimin
Elbaum, Sebastian
Rosenblum, David S.
ICSE 2007: 29TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2007, : 406 - +

← 1 2 3 4 5 →