Context Diffusion: In-Context Aware Image Generation

被引：0

作者：

Najdenkoska, Ivona ^{[1
,2
]}

Sinha, Animesh ^{[1
]}

Dubey, Abhimanyu ^{[1
]}

Mahajan, Dhruv ^{[1
]}

Ramanathan, Vignesh ^{[1
]}

Radenovic, Filip ^{[1
]}

机构：

[1] Meta GenAI, Menlo Pk, CA 94025 USA

[2] Univ Amsterdam, Amsterdam, Netherlands

来源：

COMPUTER VISION - ECCV 2024, PT LXXVII | 2024年 / 15135卷

关键词：

Image generation; Diffusion models; In-context learning;

D O I：

10.1007/978-3-031-72980-5_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is provided alongside context examples and text prompts. However, the quality and context fidelity of the generated images deteriorate when the prompt is not present, demonstrating that these models cannot truly learn from the visual context. To address this, we propose a novel framework that separates the encoding of the visual context and the preservation of the desired image layout. This results in the ability to learn from the visual context and prompts, but also from either of them. Furthermore, we enable our model to handle few-shot settings, to effectively address diverse in-context learning scenarios. Our experiments and human evaluation demonstrate that Context Diffusion excels in both in-domain and out-of-domain tasks, resulting in an overall enhancement in image quality and context fidelity compared to counterpart models.

引用

页码：375 / 391

页数：17

共 50 条

[21] Generative Calibration for In-context Learning
Jiang, Zhongtao
Zhang, Yuanzhe
Liu, Cao
Zhao, Jun
Liu, Kang
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2312 - 2333
[22] Distinguishability Calibration to In-Context Learning
Li, Hongjing
Yan, Hanqi
Li, Yanran
Qian, Li
He, Yulan
Gui, Lin
17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1385 - 1397
[23] In-Context Annotations for Refinding and Sharing
Kawase, Ricardo
Herder, Eelco
Papadakis, George
Nejdl, Wolfgang
WEB INFORMATION SYSTEMS AND TECHNOLOGIES, 2011, 75 : 85 - 100
[24] NICE: To Optimize In-Context Examples or Not?
Srivastava, Pragya
Golechha, Satvik
Deshpande, Amit
Sharma, Amit
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 5494 - 5510
[25] Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation
Hu, Minghui
Wang, Yujie
Cham, Tat-Jen
Yang, Jianfei
Suganthan, P. N.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11492 - 11501
[26] Stacking VAE and GAN for Context-aware Text-to-Image Generation
Zhang, Chenrui
Peng, Yuxin
2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
[27] CorGAN: Context aware Recurrent Generative Adversarial Network for Medical Image Generation
Qiao, Zhi
Qian, Zhen
Tang, Hui
Gong, Guanzhong
Yin, Yong
Huang, Chao
Fan, Wei
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1100 - 1103
[28] Enhancing image caption generation through context-aware attention mechanism
Bhuiyan, Ahatesham
Hossain, Eftekhar
Hoque, Mohammed Moshiul
Dewan, M. Ali Akber
HELIYON, 2024, 10 (17)
[29] Towards In-context Scene Understanding
Balazevic, Ivana
Steiner, David
Parthasarathy, Nikhil
Arandjelovic, Relja
Henaff, Olivier J.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[30] Requirements Satisfiability with In-Context Learning
Santos, Sarah
Breaux, Travis
Norton, Thomas
Haghighi, Sara
Ghanavati, Sepideh
32ND IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE, RE 2024, 2024, : 168 - 179

← 1 2 3 4 5 →