The Challenges of Continuous Self-Supervised Learning

被引:12
|
作者
Purushwalkam, Senthil [1 ]
Morgado, Pedro [1 ,2 ]
Gupta, Abhinav [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Univ Wisconsin, Madison, WI 53706 USA
来源
关键词
D O I
10.1007/978-3-031-19809-0_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Self-supervised learning (SSL) aims to eliminate one of the major bottlenecks in representation learning - the need for human annotations. As a result, SSL holds the promise to learn representations from data in-the-wild, i.e., without the need for finite and static datasets. Instead, SSL should exploit the continuous stream of data being generated on the internet or by agents exploring their environments. In this work, we investigate whether traditional self-supervised learning approaches would be effective deployed in-the-wild by conducting experiments on the continuous self-supervised learning problem. In this setup, models should learn from a continuous (infinite) non-IID data stream that follows a non-stationary distribution of visual concepts. The goal is to learn representations that are robust, adaptive yet not forgetful of concepts seen in the past. We show that a direct application of current methods to continuous SSL is 1) inefficient both computationally and in the amount of data required, 2) leads to inferior representations due to temporal correlations (non-IID data) in the streaming sources and 3) exhibits signs of catastrophic forgetting when trained on sources with non-stationary data distributions. We study the use of replay buffers to alleviate the issues of inefficiency and temporal correlations, and enhance them by actively maintaining the least redundant samples in the buffer. We show that minimum redundancy (MinRed) buffers allow us to learn effective representations even in the most challenging streaming scenarios (e.g., sequential frames obtained from a single embodied agent), and alleviates the problem of catastrophic forgetting.
引用
收藏
页码:702 / 721
页数:20
相关论文
共 50 条
  • [41] On Feature Decorrelation in Self-Supervised Learning
    Hua, Tianyu
    Wang, Wenxiao
    Xue, Zihui
    Ren, Sucheng
    Wang, Yue
    Zhao, Hang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9578 - 9588
  • [42] Self-Supervised Adversarial Imitation Learning
    Monteiro, Juarez
    Gavenski, Nathan
    Meneguzzi, Felipe
    Barros, Rodrigo C.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [43] Self-Supervised Learning: Generative or Contrastive
    Liu, Xiao
    Zhang, Fanjin
    Hou, Zhenyu
    Mian, Li
    Wang, Zhaoyu
    Zhang, Jing
    Tang, Jie
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 857 - 876
  • [44] Graph Self-Supervised Learning: A Survey
    Liu, Yixin
    Jin, Ming
    Pan, Shirui
    Zhou, Chuan
    Zheng, Yu
    Xia, Feng
    Yu, Philip S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (06) : 5879 - 5900
  • [45] Self-Supervised Learning Across Domains
    Bucci, Silvia
    D'Innocente, Antonio
    Liao, Yujun
    Carlucci, Fabio Maria
    Caputo, Barbara
    Tommasi, Tatiana
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5516 - 5528
  • [46] Nonequilibrium thermodynamics of self-supervised learning
    Salazar, Domingos S. P.
    PHYSICS LETTERS A, 2021, 419
  • [47] Reverse Engineering Self-Supervised Learning
    Ben-Shaul, Ido
    Shwartz-Ziv, Ravid
    Galanti, Tomer
    Dekel, Shai
    LeCun, Yann
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Self-supervised Learning for CT Deconvolution
    Sudhakar, Prasad
    Langoju, Rajesh
    Agrawal, Utkarsh
    Patil, Bhushan D.
    Narayanan, Ajay
    Chaugule, Vinay
    Amilneni, Vinod
    Cheerankal, Paul
    Das, Bipul
    MEDICAL IMAGING 2021: PHYSICS OF MEDICAL IMAGING, 2021, 11595
  • [49] Self-supervised learning for climate downscaling
    Singh, Karandeep
    Jeong, Chaeyoon
    Park, Sungwon
    Babur, Arjun N.
    Zeller, Elke
    Cha, Meeyoung
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 13 - 17
  • [50] Self-Supervised Learning across the Spectrum
    Shenoy, Jayanth
    Zhang, Xingjian Davis
    Tao, Bill
    Mehrotra, Shlok
    Yang, Rem
    Zhao, Han
    Vasisht, Deepak
    REMOTE SENSING, 2024, 16 (18)