Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

被引:71
|
作者
Yang, Lihe [1 ]
Kang, Bingyi [2 ]
Huang, Zilong [2 ]
Xu, Xiaogang [3 ,4 ]
Feng, Jiashi [2 ]
Zhao, Hengshuang [1 ]
机构
[1] HKU, Hong Kong, Peoples R China
[2] Tiktok, Beijing 9, Peoples R China
[3] CUHK, Hong Kong, Peoples R China
[4] ZJU, Hangzhou, Peoples R China
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2024年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52733.2024.00987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work presents Depth Anything(1), a highly practical solution for robust monocular depth estimation. Without pursuing novel technical modules, we aim to build a simple yet powerful foundation model dealing with any images under any circumstances. To this end, we scale up the dataset by designing a data engine to collect and automatically annotate large-scale unlabeled data (similar to 62M), which significantly enlarges the data coverage and thus is able to reduce the generalization error. We investigate two simple yet effective strategies that make data scaling-up promising. First, a more challenging optimization target is created by leveraging data augmentation tools. It compels the model to actively seek extra visual knowledge and acquire robust representations. Second, an auxiliary supervision is developed to enforce the model to inherit rich semantic priors from pre-trained encoders. We evaluate its zero-shot capabilities extensively, including six public datasets and randomly captured photos. It demonstrates impressive generalization ability (Figure 1). Further, through fine-tuning it with metric depth information from NYUv2 and KITTI, new SOTAs are set. Our better depth model also results in a better depth-conditioned ControlNet. Our models are released here.
引用
收藏
页码:10371 / 10381
页数:11
相关论文
共 50 条
  • [41] Analysis of large-scale power quality monitoring data based on quantum clustering
    Zhong, Qing
    Liang, Jiahao
    Xu, Zhong
    Meyer, Jan
    Wang, Longjun
    Wang, Gang
    ELECTRIC POWER SYSTEMS RESEARCH, 2023, 220
  • [42] Large-Scale Power Systems State Estimation Using PMU and SCADA Data
    Saadabadi, Hamideh
    Dehghani, Maryam
    2016 24TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2016, : 906 - 911
  • [43] SHIP: A Scalable Hierarchical Power Control Architecture for Large-Scale Data Centers
    Wang, Xiaorui
    Chen, Ming
    Lefurgy, Charles
    Keller, Tom W.
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2012, 23 (01) : 168 - 176
  • [44] Large-scale data analytics for resilient recovery services from power failures
    Afsharinejad, Amir Hossein
    Ji, Chuanyi
    Wilcox, Robert
    JOULE, 2021, 5 (09) : 2504 - 2520
  • [45] Reliable Data Delivery in Large-Scale Low-Power Sensor Networks
    Puccinelli, Daniele
    Haenggi, Martin
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2010, 6 (04) : 1 - 41
  • [46] Power fluctuation evaluation of large-scale wind turbines based on SCADA data
    Dai, Juchuan
    Cao, Junwei
    Liu, Deshun
    Wen, Li
    Long, Xin
    IET RENEWABLE POWER GENERATION, 2017, 11 (04) : 395 - 402
  • [47] Hierarchical visual data mining for large-scale data
    Ward, M
    Peng, W
    Wang, XN
    COMPUTATIONAL STATISTICS, 2004, 19 (01) : 147 - 158
  • [48] Hierarchical visual data mining for large-scale data
    Matthew Ward
    Wei Peng
    Xiaoning Wang
    Computational Statistics, 2004, 19 : 147 - 158
  • [49] The Argument for a "Data Cube" for Large-Scale Psychometric Data
    von Davier, Alina A.
    Wong, Pak Chung
    Polyak, Steve
    Yudelson, Michael
    FRONTIERS IN EDUCATION, 2019, 4
  • [50] Adaptive data reduction for large-scale transaction data
    Li, Xiao-Bai
    Jacob, Varghese S.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 188 (03) : 910 - 924