Context Diffusion: In-Context Aware Image Generation

被引：0

作者：

Najdenkoska, Ivona ^{[1
,2
]}

Sinha, Animesh ^{[1
]}

Dubey, Abhimanyu ^{[1
]}

Mahajan, Dhruv ^{[1
]}

Ramanathan, Vignesh ^{[1
]}

Radenovic, Filip ^{[1
]}

机构：

[1] Meta GenAI, Menlo Pk, CA 94025 USA

[2] Univ Amsterdam, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2024, PT LXXVII | 2024年 / 15135卷

关键词：

Image generation; Diffusion models; In-context learning;

D O I：

10.1007/978-3-031-72980-5_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.

引用

页码：375 / 391

页数：17

共 50 条

[1] In-Context Learning Unlocked for Diffusion Models
Wang, Zhendong
Jiang, Yifan
Lu, Yadong
Shen, Yelong
He, Pengcheng
Chen, Weizhu
Wang, Zhangyang
Zhou, Mingyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[2] In-Context In-Context Learning with Transformer Neural Processes
Ashman, Matthew
Diaconu, Cristiana
Weller, Adrian
Turner, Richard E.
SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, 2024, 253 : 1 - 29
[3] Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model
Gu, Zheng
Yang, Shiyuan
Liao, Jing
Huo, Jing
Gao, Yang
ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
[4] Exploring Diverse In-Context Configurations for Image Captioning
Yang, Xu
Wu, Yongliang
Yang, Mingzhuo
Chen, Haokun
Geng, Xin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[5] Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection
Bai, Yu
Chen, Fan
Wang, Huan
Xiong, Caiming
Mei, Song
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[6] Exploring In-Context Learning for Knowledge Grounded Dialog Generation
Chen, Qinyu
Wu, Wenhao
Li, Sujian
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10071 - 10081
[7] Tyche: Stochastic In-Context Learning for Medical Image Segmentation
Rakic, Marianne
Wong, Hallee E.
Ortiz, Jose Javier Gonzalez
Cimini, Beth A.
Guttag, John, V
Dalca, Adrian, V
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 11159 - 11173
[8] Boosting Your Context by Dual Similarity Checkup for In-Context Learning Medical Image Segmentation
Gao, Jun
Lao, Qicheng
Kang, Qingbo
Liu, Paul
Du, Chenlin
Li, Kang
Zhang, Le
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 310 - 319
[9] The Learnability of In-Context Learning
Wies, Noam
Levine, Yoav
Shashua, Amnon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[10] A glance at in-context learning
Wu, Yongliang
Yang, Xu
FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (05)

← 1 2 3 4 5 →