SADM: Sequence-Aware Diffusion Model for Longitudinal Medical Image Generation

被引:12
|
作者
Yoon, Jee Seok [1 ,3 ]
Zhang, Chenghao [2 ]
Suk, Heung-Il [1 ]
Guo, Jia [2 ]
Li, Xiaoxiao [3 ]
机构
[1] Korea Univ, Seoul 02841, South Korea
[2] Columbia Univ, New York, NY 10027 USA
[3] Univ British Columbia, Vancouver, BC V6T 1Z4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Diffusion model; Sequential image generation; Autoregressive conditioning;
D O I
10.1007/978-3-031-34048-2_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human organs constantly undergo anatomical changes due to a complex mix of short-term (e.g., heartbeat) and long-term (e.g., aging) factors. Evidently, prior knowledge of these factors will be beneficial when modeling their future state, i.e., via image generation. However, most of the medical image generation tasks only rely on the input from a single image, thus ignoring the sequential dependency even when longitudinal data is available. Sequence-aware deep generative models, where model input is a sequence of ordered and timestamped images, are still underexplored in the medical imaging domain that is featured by several unique challenges: 1) Sequences with various lengths; 2) Missing data or frame, and 3) High dimensionality. To this end, we propose a sequence-aware diffusion model (SADM) for the generation of longitudinal medical images. Recently, diffusion models have shown promising results in high-fidelity image generation. Our method extends this new technique by introducing a sequence-aware transformer as the conditional module in a diffusion model. The novel design enables learning longitudinal dependency even with missing data during training and allows autoregressive generation of a sequence of images during inference. Our extensive experiments on 3D longitudinal medical images demonstrate the effectiveness of SADM compared with baselines and alternative methods. The code is available at https://github.com/ubc-tea/SADM-Longitudinal-Medical-Image-Generation.
引用
收藏
页码:388 / 400
页数:13
相关论文
共 50 条
  • [1] Sequence-Aware Factored Mixed Similarity Model for Next-Item Recommendation
    Zhong, Liulan
    Lin, Jing
    Pan, Weike
    Ming, Zhong
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 181 - 188
  • [2] Aggregate-aware model with bidirectional edge generation for medical image segmentation
    Ma, Shiqiang
    Li, Xuejian
    Tang, Jijun
    Guo, Fei
    APPLIED SOFT COMPUTING, 2024, 163
  • [3] Context Diffusion: In-Context Aware Image Generation
    Najdenkoska, Ivona
    Sinha, Animesh
    Dubey, Abhimanyu
    Mahajan, Dhruv
    Ramanathan, Vignesh
    Radenovic, Filip
    COMPUTER VISION - ECCV 2024, PT LXXVII, 2024, 15135 : 375 - 391
  • [4] Diffusion Deformable Model for 4D Temporal Medical Image Generation
    Kim, Boah
    Ye, Jong Chul
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT I, 2022, 13431 : 539 - 548
  • [5] GH-DDM: the generalized hybrid denoising diffusion model for medical image generation
    Sicheng Zhang
    Jin Liu
    Bo Hu
    Zhendong Mao
    Multimedia Systems, 2023, 29 : 1335 - 1345
  • [6] GH-DDM: the generalized hybrid denoising diffusion model for medical image generation
    Zhang, Sicheng
    Liu, Jin
    Hu, Bo
    Mao, Zhendong
    MULTIMEDIA SYSTEMS, 2023, 29 (03) : 1335 - 1345
  • [7] Medical Image Generation based on Latent Diffusion Models
    Song, Wenbo
    Jiang, Yan
    Fang, Yin
    Cao, Xinyu
    Wu, Peiyan
    Xing, Hanshuo
    Wu, Xinglong
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE INNOVATION, ICAII 2023, 2023, : 89 - 93
  • [8] Structure-Aware Procedural Text Generation From an Image Sequence
    Nishimura, Taichi
    Hashimoto, Atsushi
    Ushiku, Yoshitaka
    Kameko, Hirotaka
    Yamakata, Yoko
    Mori, Shinsuke
    IEEE ACCESS, 2021, 9 : 2125 - 2141
  • [9] Density-Aware Diffusion Model for Efficient Image Dehazing
    Zhang, Ling
    Bai, Wenxu
    Xiao, Chunxia
    COMPUTER GRAPHICS FORUM, 2024, 43 (07)
  • [10] MoVideo: Motion-Aware Video Generation with Diffusion Model
    Liang, Jingyun
    Fang, Yuchen
    Zhang, Kai
    Timofte, Radu
    Van Gool, Luc
    Ranjan, Rakesh
    COMPUTER VISION-ECCV 2024, PT XLIV, 2025, 15102 : 56 - 74