Integrating Geometrical Context for Semantic Labeling of Indoor Scenes using RGBD Images
被引:16
|
作者:
Khan, Salman H.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, AustraliaUniv Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
Khan, Salman H.
[1
]
Bennamoun, Mohammed
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, AustraliaUniv Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
Bennamoun, Mohammed
[1
]
Sohel, Ferdous
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, AustraliaUniv Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
Sohel, Ferdous
[1
]
Togneri, Roberto
论文数: 0引用数: 0
h-index: 0
机构:
Univ Western Australia, Sch EECE, 35 Stirling Highway, Crawley, WA 6009, AustraliaUniv Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
Togneri, Roberto
[2
]
Naseem, Imran
论文数: 0引用数: 0
h-index: 0
机构:
Karachi Inst Econ & Technol, Dept Engn, Karachi 75190, PakistanUniv Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
Naseem, Imran
[3
]
机构:
[1] Univ Western Australia, Sch CSSE, 35 Stirling Highway, Crawley, WA 6009, Australia
[2] Univ Western Australia, Sch EECE, 35 Stirling Highway, Crawley, WA 6009, Australia
Inexpensive structured light sensors can capture rich information from indoor scenes, and scene labeling problems provide a compelling opportunity to make use of this information. In this paper we present a novel conditional random field (CRF) model to effectively utilize depth information for semantic labeling of indoor scenes. At the core of the model, we propose a novel and efficient plane detection algorithm which is robust to erroneous depthmaps. Our CRF formulation defines local, pairwise and higher order interactions between image pixels. At the local level, we propose a novel scheme to combine energies derived from appearance, depth and geometry-based cues. The proposed local energy also encodes the location of each object class by considering the approximate geometry of a scene. For the pairwise interactions, we learn a boundary measure which defines the spatial discontinuity of object classes across an image. To model higher-order interactions, the proposed energy treats smooth surfaces as cliques and encourages all the pixels on a surface to take the same label. We show that the proposed higher-order energies can be decomposed into pairwise submodular energies and efficient inference can be made using the graph-cuts algorithm. We follow a systematic approach which uses structured learning to fine-tune the model parameters. We rigorously test our approach on SUN3D and both versions of the NYU-Depth database. Experimental results show that our work achieves superior performance to state-of-the-art scene labeling techniques.
机构:
Peking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R ChinaPeking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R China
Zhang, Wanruo
Liu, Hong
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R ChinaPeking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R China
Liu, Hong
Wu, Jianbing
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R ChinaPeking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R China
Wu, Jianbing
Li, Yidi
论文数: 0引用数: 0
h-index: 0
机构:
Taiyuan Univ Technol, Coll Comp Sci & Technol, Taiyuan, Peoples R ChinaPeking Univ, Shenzhen Grad Sch, Natl Key Lab Gen Artificial Intelligence, Beijing, Peoples R China