Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

被引:42
|
作者
Chung, Hyungjin [1 ,2 ]
Ryu, Dohoon [1 ]
Mccann, Michael T. [2 ]
Klasky, Marc L. [2 ]
Ye, Jong Chul [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea
[2] Los Alamos Natl Lab, Los Alamos, NM 87545 USA
基金
新加坡国家研究基金会;
关键词
CONVOLUTIONAL NEURAL-NETWORK; COMPUTED-TOMOGRAPHY; RECONSTRUCTION; ALGORITHM;
D O I
10.1109/CVPR52729.2023.02159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers, acting as the prior of the distribution, while the information of the forward model can be granted at the sampling stage. Nonetheless, as the generative process remains in the same high dimensional (i.e. identical to data dimension) space, the models have not been extended to 3D inverse problems due to the extremely high memory and computational cost. In this paper, we combine the ideas from the conventional model-based iterative reconstruction with the modern diffusion models, which leads to a highly effective method for solving 3D medical image reconstruction tasks such as sparse-view tomography, limited angle tomography, compressed sensing MRI from pre-trained 2D diffusion models. In essence, we propose to augment the 2D diffusion prior with a model-based prior in the remaining direction at test time, such that one can achieve coherent reconstructions across all dimensions. Our method can be run in a single commodity GPU, and establishes the new state-of-the-art, showing that the proposed method can perform reconstructions of high fidelity and accuracy even in the most extreme cases (e.g. 2-view 3D tomography). We further reveal that the generalization capacity of the proposed method is surprisingly high, and can be used to reconstruct volumes that are entirely different from the training dataset. Code available: https://github.com/HJ-harry/DiffusionMBIR
引用
收藏
页码:22542 / 22551
页数:10
相关论文
共 50 条
  • [21] Generating Images of Rare Concepts Using Pre-trained Diffusion Models
    Samuel, Dvir
    Ben-Ari, Rami
    Raviv, Simon
    Darshan, Nir
    Chechik, Gal
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4695 - 4703
  • [22] Fine-tuning the hyperparameters of pre-trained models for solving multiclass classification problems
    Kaibassova, D.
    Nurtay, M.
    Tau, A.
    Kissina, M.
    COMPUTER OPTICS, 2022, 46 (06) : 971 - 979
  • [23] A spatial local method for solving 2D and 3D advection-diffusion equations
    Tunc, Huseyin
    Sari, Murat
    ENGINEERING COMPUTATIONS, 2023, 40 (9/10) : 2068 - 2089
  • [24] Algorithms for Numerical Solving of 2D Anomalous Diffusion Problems
    Abrashina-Zhadaeva, Natalia
    Romanova, Natalie
    MATHEMATICAL MODELLING AND ANALYSIS, 2012, 17 (03) : 447 - 455
  • [25] Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs
    Kovacs, Aron Samuel
    Hermosilla, Pedro
    Raidou, Renata G.
    COMPUTER GRAPHICS FORUM, 2024, 43 (02)
  • [26] Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
    Tang, Yiwen
    Zhang, Ray
    Guo, Zoey
    Ma, Xianzheng
    Zhao, Bin
    Wang, Zhigang
    Wang, Dong
    Li, Xuelong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5171 - 5179
  • [27] 3D Human Pose Machine with a ToF Sensor using Pre-trained Convolutional Neural Networks
    Kim, Jong-Sung
    Kwon, Seung-Joon
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1018 - 1020
  • [28] Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation
    Kim, Kihong
    Lee, Haneol
    Park, Jihye
    Kim, Seyeon
    Lee, Kwanghee
    Kim, Seungryong
    Yoo, Jaejun
    COMPUTER VISION - ECCV 2024, PT LII, 2025, 15110 : 148 - 165
  • [29] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
    Yi, Taoran
    Fang, Jiemin
    Wang, Junjie
    Wu, Guanjun
    Xie, Lingxi
    Zhang, Xiaopeng
    Liu, Wenyu
    Tian, Qi
    Wang, Xinggang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6796 - 6807
  • [30] Solving the 2D and 3D nonlinear inverse source problems of elliptic type partial differential equations by a homogenization function method
    Liu, Chein-Shan
    Qiu, Lin
    NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS, 2023, 39 (02) : 1287 - 1298