ClearPose: Large-scale Transparent Object Dataset and Benchmark

被引:15
|
作者
Chen, Xiaotong [1 ]
Zhang, Huijie [1 ]
Yu, Zeren [1 ]
Opipari, Anthony [1 ]
Jenkins, Odest Chadwicke [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
关键词
Transparent objects; Depth completion; Pose estimation; Dataset and benchmark;
D O I
10.1007/978-3-031-20074-8_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transparent objects are ubiquitous in household settings and pose distinct challenges for visual sensing and perception systems. The optical properties of transparent objects leave conventional 3D sensors alone unreliable for object depth and pose estimation. These challenges are highlighted by the shortage of large-scale RGB-Depth datasets focusing on transparent objects in real-world settings. In this work, we contribute a large-scale real-world RGB-Depth transparent object dataset named ClearPose to serve as a benchmark dataset for segmentation, scene-level depth completion and object-centric pose estimation tasks. The ClearPose dataset contains over 350K labeled real-world RGB-Depth frames and 5M instance annotations covering 63 household objects. The dataset includes object categories commonly used in daily life under various lighting and occluding conditions as well as challenging test scenarios such as cases of occlusion by opaque or translucent objects, non-planar orientations, presence of liquids, etc. We benchmark several state-of-the-art depth completion and object pose estimation deep neural networks on ClearPose. The dataset and benchmarking source code is available at https://githuh.com/opipari/ClearPose.
引用
收藏
页码:381 / 396
页数:16
相关论文
共 50 条
  • [41] A Large-Scale Benchmark Dataset for Anomaly Detection and Rare Event Classification for Audio Forensics
    Abbasi, Ahmed
    Javed, Abdul Rehman Rehman
    Yasin, Amanullah
    Jalil, Zunera
    Kryvinska, Natalia
    Tariq, Usman
    IEEE ACCESS, 2022, 10 : 38885 - 38894
  • [42] SKVOS: Sketch-Based Video Object Segmentation with a Large-Scale Benchmark
    Yang, Ruolin
    Li, Da
    Hu, Conghui
    Zhang, Honggang
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [43] BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
    Cai, Likun
    Zhang, Zhi
    Zhu, Yi
    Zhang, Li
    Li, Mu
    Xue, Xiangyang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4776 - 4786
  • [44] A benchmark approach and dataset for large-scale lane mapping from MLS point clouds
    Mi, Xiaoxin
    Dong, Zhen
    Cao, Zhipeng
    Yang, Bisheng
    Cao, Zhen
    Zheng, Chao
    Stoter, Jantien
    Nan, Liangliang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 133
  • [45] MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition
    Guo, Yandong
    Zhang, Lei
    Hu, Yuxiao
    He, Xiaodong
    Gao, Jianfeng
    COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 : 87 - 102
  • [46] LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking
    Fan, Heng
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Bai, Hexin
    Xu, Yong
    Liao, Chunyuan
    Ling, Haibin
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5369 - 5378
  • [47] LaSOT: A High-quality Large-scale Single Object Tracking Benchmark
    Heng Fan
    Hexin Bai
    Liting Lin
    Fan Yang
    Peng Chu
    Ge Deng
    Sijia Yu
    Mingzhen Harshit
    Juehuan Huang
    Yong Liu
    Chunyuan Xu
    Lin Liao
    Haibin Yuan
    International Journal of Computer Vision, 2021, 129 : 439 - 461
  • [48] A Platform for Electrical Capacitance Tomography Large-scale Benchmark Dataset Generating and Image Reconstruction
    Zheng, Jin
    Peng, Lihui
    2017 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2017, : 138 - 143
  • [49] FishNet: A Large-scale Dataset and Benchmark for Fish Recognition, Detection, and Functional Trait Prediction
    Khan, Faizan Farooq
    Li, Xiang
    Temple, Andrew J.
    Elhoseiny, Mohamed
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20439 - 20449
  • [50] EMS: A Large-Scale Eye Movement Dataset, Benchmark, and New Model for Schizophrenia Recognition
    Song, Yingjie
    Liu, Zhi
    Li, Gongyang
    Xie, Jiawei
    Wu, Qiang
    Zeng, Dan
    Xu, Lihua
    Zhang, Tianhong
    Wang, Jijun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,