Adaptive Surface Normal Constraint for Geometric Estimation From Monocular Images

被引:3
|
作者
Long, Xiaoxiao [1 ]
Zheng, Yuhang [2 ]
Zheng, Yupeng [2 ]
Tian, Beiwen [2 ]
Lin, Cheng [3 ]
Liu, Lingjie [4 ]
Zhao, Hao [2 ]
Zhou, Guyue [2 ]
Wang, Wenping [5 ]
机构
[1] Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Tsinghua Univ, Inst AI Ind Res, Beijing 100190, Peoples R China
[3] Tencent Games, Shenzhen 518054, Peoples R China
[4] Univ Penn, Dept Comp & Informat Sci, Philadelphia, PA 19104 USA
[5] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
关键词
Estimation; Three-dimensional displays; Geometry; Task analysis; Image edge detection; Shape; Neural networks; Monocular depth and normal estimation; 3D from single images; geometric context; adaptive surface normal;
D O I
10.1109/TPAMI.2024.3381710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce a novel approach to learn geometries such as depth and surface normal from images while incorporating geometric context. The difficulty of reliably capturing geometric context in existing methods impedes their ability to accurately enforce the consistency between the different geometric properties, thereby leading to a bottleneck of geometric estimation quality. We therefore propose the Adaptive Surface Normal (ASN) constraint, a simple yet efficient method. Our approach extracts geometric context that encodes the geometric variations present in the input image and correlates depth estimation with geometric constraints. By dynamically determining reliable local geometry from randomly sampled candidates, we establish a surface normal constraint, where the validity of these candidates is evaluated using the geometric context. Furthermore, our normal estimation leverages the geometric context to prioritize regions that exhibit significant geometric variations, which makes the predicted normals accurately capture intricate and detailed geometric information. Through the integration of geometric context, our method unifies depth and surface normal estimations within a cohesive framework, which enables the generation of high-quality 3D geometry from images. We validate the superiority of our approach over state-of-the-art methods through extensive evaluations and comparisons on diverse indoor and outdoor datasets, showcasing its efficiency and robustness.
引用
收藏
页码:6263 / 6279
页数:17
相关论文
共 50 条
  • [21] Articulated joint estimation from motion using two monocular images
    Zhang, XY
    Liu, YC
    Huang, TS
    VISION, MODELING, AND VISUALIZATION 2003, 2003, : 63 - 70
  • [22] Improvised Filter Design for Depth Estimation from Single Monocular Images
    Das, Aniruddha
    Ramnani, Vikas
    Bhavsar, Jignesh
    Mitra, Suman K.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 333 - 338
  • [23] DEPTH ESTIMATION FROM A SEQUENCE OF MONOCULAR IMAGES WITH KNOWN CAMERA MOTION
    ZHANG, HQ
    SUDHAKAR, R
    SHIEH, JY
    ROBOTICS AND AUTONOMOUS SYSTEMS, 1994, 13 (02) : 87 - 95
  • [24] Depth Estimation from Monocular Infrared Images for Autonomous Flight of Drones
    Shimada, Tomoyasu
    Nishikawa, Hiroki
    Kong, Xiangbo
    Tomiyama, Hiroyuki
    2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
  • [25] Articulated joint estimation from motion using two monocular images
    Zhang, XY
    Liu, YC
    Huang, TS
    PATTERN RECOGNITION LETTERS, 2004, 25 (10) : 1097 - 1106
  • [26] Acquisition of 3-D Surface Shape of Human Body from Monocular Images without Pose Estimation
    Peng, En
    Li, Ling
    2012 12TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS & VISION (ICARCV), 2012, : 1136 - 1141
  • [27] Geometric-photometric approach to monocular shape estimation
    Torreao, JRA
    IMAGE AND VISION COMPUTING, 2003, 21 (12) : 1045 - 1061
  • [28] Adaptive Neighborhood Selection for Real-Time Surface Normal Estimation from Organized Point Cloud Data Using Integral Images
    Holzer, S.
    Rusu, R. B.
    Dixon, M.
    Gedikli, S.
    Navab, N.
    2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, : 2684 - 2689
  • [29] Monocular contextual constraint for stereo matching with adaptive weights assignment
    Zhang, Chenghao
    Meng, Gaofeng
    Su, Bing
    Xiang, Shiming
    Pan, Chunhong
    IMAGE AND VISION COMPUTING, 2022, 121
  • [30] GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation
    Qi, Xiaojuan
    Liao, Renjie
    Liu, Zhengzhe
    Urtasun, Raquel
    Jia, Jiaya
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 283 - 291