Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images

被引：3

作者：

Long, Xiaoxiao ^{[1
]}

Zheng, Yuhang ^{[2
]}

Zheng, Yupeng ^{[2
]}

Tian, Beiwen ^{[2
]}

Lin, Cheng ^{[3
]}

Liu, Lingjie ^{[4
]}

Zhao, Hao ^{[2
]}

Zhou, Guyue ^{[2
]}

Wang, Wenping ^{[5
]}

机构：

[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[2] Tsinghua Univ, Inst AI Ind Res, Beijing 100190, Peoples R China

[3] Tencent Games, Shenzhen 518054, Peoples R China

[4] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA

[5] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 09期

关键词：

Estimation; Three-dimensional displays; Geometry; Task analysis; Image edge detection; Shape; Neural networks; Monocular depth and normal estimation; 3D from single images; geometric context; adaptive surface normal;

D O I：

10.1109/TPAMI.2024.3381710

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We introduce a novel approach to learn geometries such as depth and surface normal from images while incorporating geometric context. The difficulty of reliably capturing geometric context in existing methods impedes their ability to accurately enforce the consistency between the different geometric properties, thereby leading to a bottleneck of geometric estimation quality. We therefore propose the Adaptive Surface Normal (ASN) constraint, a simple yet efficient method. Our approach extracts geometric context that encodes the geometric variations present in the input image and correlates depth estimation with geometric constraints. By dynamically determining reliable local geometry from randomly sampled candidates, we establish a surface normal constraint, where the validity of these candidates is evaluated using the geometric context. Furthermore, our normal estimation leverages the geometric context to prioritize regions that exhibit significant geometric variations, which makes the predicted normals accurately capture intricate and detailed geometric information. Through the integration of geometric context, our method unifies depth and surface normal estimations within a cohesive framework, which enables the generation of high-quality 3D geometry from images. We validate the superiority of our approach over state-of-the-art methods through extensive evaluations and comparisons on diverse indoor and outdoor datasets, showcasing its efficiency and robustness.

引用

页码：6263 / 6279

页数：17

共 50 条

[21] Articulated joint estimation from motion using two monocular images
Zhang, XY
Liu, YC
Huang, TS
VISION, MODELING, AND VISUALIZATION 2003, 2003, : 63 - 70
[22] Improvised Filter Design for Depth Estimation from Single Monocular Images
Das, Aniruddha
Ramnani, Vikas
Bhavsar, Jignesh
Mitra, Suman K.
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 333 - 338
[23] DEPTH ESTIMATION FROM A SEQUENCE OF MONOCULAR IMAGES WITH KNOWN CAMERA MOTION
ZHANG, HQ
SUDHAKAR, R
SHIEH, JY
ROBOTICS AND AUTONOMOUS SYSTEMS, 1994, 13 (02) : 87 - 95
[24] Depth Estimation from Monocular Infrared Images for Autonomous Flight of Drones
Shimada, Tomoyasu
Nishikawa, Hiroki
Kong, Xiangbo
Tomiyama, Hiroyuki
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[25] Articulated joint estimation from motion using two monocular images
Zhang, XY
Liu, YC
Huang, TS
PATTERN RECOGNITION LETTERS, 2004, 25 (10) : 1097 - 1106
[26] Acquisition of 3-D Surface Shape of Human Body from Monocular Images without Pose Estimation
Peng, En
Li, Ling
2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1136 - 1141
[27] Geometric-photometric approach to monocular shape estimation
Torreao, JRA
IMAGE AND VISION COMPUTING, 2003, 21 (12) : 1045 - 1061
[28] Adaptive Neighborhood Selection for Real-Time Surface Normal Estimation from Organized Point Cloud Data Using Integral Images
Holzer, S.
Rusu, R. B.
Dixon, M.
Gedikli, S.
Navab, N.
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2684 - 2689
[29] Monocular contextual constraint for stereo matching with adaptive weights assignment
Zhang, Chenghao
Meng, Gaofeng
Su, Bing
Xiang, Shiming
Pan, Chunhong
IMAGE AND VISION COMPUTING, 2022, 121
[30] GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
Qi, Xiaojuan
Liao, Renjie
Liu, Zhengzhe
Urtasun, Raquel
Jia, Jiaya
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 283 - 291

← 1 2 3 4 5 →