Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

被引：42

作者：

Chung, Hyungjin ^{[1
,2
]}

Ryu, Dohoon ^{[1
]}

Mccann, Michael T. ^{[2
]}

Klasky, Marc L. ^{[2
]}

Ye, Jong Chul ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Daejeon, South Korea

[2] Los Alamos Natl Lab, Los Alamos, NM 87545 USA

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年

基金：

新加坡国家研究基金会;

关键词：

CONVOLUTIONAL NEURAL-NETWORK; COMPUTED-TOMOGRAPHY; RECONSTRUCTION; ALGORITHM;

D O I：

10.1109/CVPR52729.2023.02159

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers, acting as the prior of the distribution, while the information of the forward model can be granted at the sampling stage. Nonetheless, as the generative process remains in the same high dimensional (i.e. identical to data dimension) space, the models have not been extended to 3D inverse problems due to the extremely high memory and computational cost. In this paper, we combine the ideas from the conventional model-based iterative reconstruction with the modern diffusion models, which leads to a highly effective method for solving 3D medical image reconstruction tasks such as sparse-view tomography, limited angle tomography, compressed sensing MRI from pre-trained 2D diffusion models. In essence, we propose to augment the 2D diffusion prior with a model-based prior in the remaining direction at test time, such that one can achieve coherent reconstructions across all dimensions. Our method can be run in a single commodity GPU, and establishes the new state-of-the-art, showing that the proposed method can perform reconstructions of high fidelity and accuracy even in the most extreme cases (e.g. 2-view 3D tomography). We further reveal that the generalization capacity of the proposed method is surprisingly high, and can be used to reconstruct volumes that are entirely different from the training dataset. Code available: https://github.com/HJ-harry/DiffusionMBIR

引用

页码：22542 / 22551

页数：10

共 50 条

[21] Generating Images of Rare Concepts Using Pre-trained Diffusion Models
Samuel, Dvir
Ben-Ari, Rami
Raviv, Simon
Darshan, Nir
Chechik, Gal
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4695 - 4703
[22] Fine-tuning the hyperparameters of pre-trained models for solving multiclass classification problems
Kaibassova, D.
Nurtay, M.
Tau, A.
Kissina, M.
COMPUTER OPTICS, 2022, 46 (06) : 971 - 979
[23] A spatial local method for solving 2D and 3D advection-diffusion equations
Tunc, Huseyin
Sari, Murat
ENGINEERING COMPUTATIONS, 2023, 40 (9/10) : 2068 - 2089
[24] Algorithms for Numerical Solving of 2D Anomalous Diffusion Problems
Abrashina-Zhadaeva, Natalia
Romanova, Natalie
MATHEMATICAL MODELLING AND ANALYSIS, 2012, 17 (03) : 447 - 455
[25] Surface-aware Mesh Texture Synthesis with Pre-trained 2D CNNs
Kovacs, Aron Samuel
Hermosilla, Pedro
Raidou, Renata G.
COMPUTER GRAPHICS FORUM, 2024, 43 (02)
[26] Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models
Tang, Yiwen
Zhang, Ray
Guo, Zoey
Ma, Xianzheng
Zhao, Bin
Wang, Zhigang
Wang, Dong
Li, Xuelong
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5171 - 5179
[27] 3D Human Pose Machine with a ToF Sensor using Pre-trained Convolutional Neural Networks
Kim, Jong-Sung
Kwon, Seung-Joon
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1018 - 1020
[28] Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation
Kim, Kihong
Lee, Haneol
Park, Jihye
Kim, Seyeon
Lee, Kwanghee
Kim, Seungryong
Yoo, Jaejun
COMPUTER VISION - ECCV 2024, PT LII, 2025, 15110 : 148 - 165
[29] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models
Yi, Taoran
Fang, Jiemin
Wang, Junjie
Wu, Guanjun
Xie, Lingxi
Zhang, Xiaopeng
Liu, Wenyu
Tian, Qi
Wang, Xinggang
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 6796 - 6807
[30] Solving the 2D and 3D nonlinear inverse source problems of elliptic type partial differential equations by a homogenization function method
Liu, Chein-Shan
Qiu, Lin
NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS, 2023, 39 (02) : 1287 - 1298

← 1 2 3 4 5 →