ClearPose: Large-scale Transparent Object Dataset and Benchmark

被引:15
|
作者
Chen, Xiaotong [1 ]
Zhang, Huijie [1 ]
Yu, Zeren [1 ]
Opipari, Anthony [1 ]
Jenkins, Odest Chadwicke [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
关键词
Transparent objects; Depth completion; Pose estimation; Dataset and benchmark;
D O I
10.1007/978-3-031-20074-8_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transparent objects are ubiquitous in household settings and pose distinct challenges for visual sensing and perception systems. The optical properties of transparent objects leave conventional 3D sensors alone unreliable for object depth and pose estimation. These challenges are highlighted by the shortage of large-scale RGB-Depth datasets focusing on transparent objects in real-world settings. In this work, we contribute a large-scale real-world RGB-Depth transparent object dataset named ClearPose to serve as a benchmark dataset for segmentation, scene-level depth completion and object-centric pose estimation tasks. The ClearPose dataset contains over 350K labeled real-world RGB-Depth frames and 5M instance annotations covering 63 household objects. The dataset includes object categories commonly used in daily life under various lighting and occluding conditions as well as challenging test scenarios such as cases of occlusion by opaque or translucent objects, non-planar orientations, presence of liquids, etc. We benchmark several state-of-the-art depth completion and object pose estimation deep neural networks on ClearPose. The dataset and benchmarking source code is available at https://githuh.com/opipari/ClearPose.
引用
收藏
页码:381 / 396
页数:16
相关论文
共 50 条
  • [31] NetBench: A Large-Scale and Comprehensive Network Traffic Benchmark Dataset for Foundation Models
    Qian, Chen
    Li, Xiaochang
    Wang, Qineng
    Zhou, Gang
    Shao, Huajie
    PROCEEDINGS 2024 IEEE INTERNATIONAL WORKSHOP ON FOUNDATION MODELS FOR CYBER-PHYSICAL SYSTEMS & INTERNET OF THINGS, FMSYS 2024, 2024, : 20 - 25
  • [32] MultiScene: A Large-Scale Dataset and Benchmark for Multiscene Recognition in Single Aerial Images
    Hua, Yuansheng
    Mou, Lichao
    Jin, Pu
    Zhu, Xiao Xiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] DiTing: A large-scale Chinese seismic benchmark dataset for artificial intelligence in seismology
    Ming Zhao
    Zhuowei Xiao
    Shi Chen
    Lihua Fang
    Earthquake Science, 2023, (02) : 84 - 94
  • [34] IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
    Wu, Xiaoping
    Zhan, Chi
    Lai, Yu-Kun
    Cheng, Ming-Ming
    Yang, Jufeng
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8779 - 8788
  • [35] MELAUDIS: A Large-Scale Benchmark Acoustic Dataset For Intelligent Transportation Systems Research
    Parineh, Hossein
    Sarvi, Majid
    Bagloee, Saeed Asadi
    SCIENTIFIC DATA, 2025, 12 (01)
  • [36] UPAD: A Large-Scale Passive Sonar Benchmark Dataset for Vessel Detection and Classification
    Fischer, John
    Orescanin, Marko
    OCEANS 2024 - SINGAPORE, 2024,
  • [37] ParkScape: A Large-Scale Fisheye Dataset for Parking Slot Detection and a Benchmark Method
    Fu, Li
    Ma, Dongliang
    Qu, Xin
    Jiang, Xin
    Shan, Lie
    Zeng, Dan
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [38] A Large-Scale Shape Benchmark for 3D Object Retrieval: Toyohashi Shape Benchmark
    Tatsuma, Atsushi
    Koyanagi, Hitoshi
    Aono, Masaki
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [39] A Large-Scale Homography Benchmark
    Barath, Daniel
    Mishkin, Dmytro
    Polic, Michal
    Forstner, Wolfgang
    Matas, Jiri
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21360 - 21370
  • [40] LaSOT: A High-quality Large-scale Single Object Tracking Benchmark
    Fan, Heng
    Bai, Hexin
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Harshit
    Huang, Mingzhen
    Liu, Juehuan
    Xu, Yong
    Liao, Chunyuan
    Yuan, Lin
    Ling, Haibin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 439 - 461